Anton Baumann أُعيد تغريده

We have been exploring new algorithmic frontiers and are excited to share our contributions to Self Distillation Policy Optimization (SDPO) for agentic continual learning, check out our blog post here:
trajectory.ai/field-notes/sc…
English







