Anton Baumann retweetou

We have been exploring new algorithmic frontiers and are excited to share our contributions to Self Distillation Policy Optimization (SDPO) for agentic continual learning, check out our blog post here:
trajectory.ai/field-notes/sc…
English







