
@paolodiprodi @Dorian_Todd_ Thx a lot for the info! P.S., you’re doing really nice stuff with JEPA ;)
English
MR
358 posts

@MRisso
Workin’ towards (EDIT: Done) Comp Eng PhD 🇮🇹
















Finally finished! If you're interested in an overview of recent methods in reinforcement learning for reasoning LLMs, check out this blog post: aweers.de/blog/2026/rl-f… It summarizes ten methods, tries to highlight differences and trends, and has a collection of open problems

We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.





