Zhikun Xu
64 posts

Zhikun Xu
@JerrryKun
ri @AIatMeta, pursuing CS PhD @SCAI_ASU | Prev: Applied Math (B.S. & M.S.) @FudanUni, Research Internship @awscloud @AlibabaGroup @AMD | Opinions are my own.

More AI-generated code doesn't make your team faster. It might actually slow you down.



Two students take the same exam. Both score 100 — one solved it himself, the other Googled every answer. A semester later, the gap is huge. That's the problem with today's AI agents. I write a detailed blog to share my recent thoughts on this, mainly based on Theory of Agents. I promise this is definitely worth 30 minutes of your time. Blog: notion.so/Second-Half-of… Project: hrwise-nlp.github.io/assets/website…











To the questions of “why not both?”: my dream is for LLMs to make conceptual discoveries, like Galois with group theory or Einstein with general relativity. I don’t believe breakthroughs like these would come from A* search or its more advanced version MCTS.






As far as I know, there isn't any chatbot or API that gives you access to an IMO 2025 gold-medalist model. Not only does this change today, but you get to download the weights with the Apache 2.0 open-source release of @deepseek_ai Math-V2 on @huggingface! Imagine owning the brain of one of the best mathematicians in the world for free to: - explore it for research - fine-tune it - optimize it - run it on your own hardware No limitations, no nerfing, no company or government to take it back. That's democratization of AI and knowledge at its best, literally 🤯🤯🤯 You can download the weights here: huggingface.co/deepseek-ai/De…. The frontier of AI is open-source!

Want to 𝐜𝐮𝐭 𝐑𝐅𝐓 𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠 𝐭𝐢𝐦𝐞 𝐛𝐲 𝐮𝐩 𝐭𝐨 𝟐× and boost performance? 🚀 Meet 𝑨𝒅𝒂𝑹𝑭𝑻 — a lightweight, plug-and-play curriculum learning method you can drop into any mainstream RFT algorithms (PPO, GRPO, REINFORCE). Less compute. Better results. 🧵 1/n




