David Jin
116 posts

David Jin
@DaviJin
Ph.D. student in CSE @MIT; B.S. in Physics & Data Science @Caltech & @GrinnellCollege




People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

Excited to share GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training, which received the Best Paper Award at the ICLR 2026 Workshop on Foundation Models for Science. Can we scale neural physics simulation without scaling expensive solver-generated labels? (1/6) (The below results are all predicted by GeoPT.)



Meta Platforms Inc. has acquired Assured Robot Intelligence, a startup developing artificial intelligence models for robots, as part of a major initiative to build humanoid technology. bloomberg.com/news/articles/…



Excited to share GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training, which received the Best Paper Award at the ICLR 2026 Workshop on Foundation Models for Science. Can we scale neural physics simulation without scaling expensive solver-generated labels? (1/6) (The below results are all predicted by GeoPT.)


















[LG] CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery A Qu, H Zheng, Z Zhou, Y Yan… [MIT & NUS] (2026) arxiv.org/abs/2604.01658



We made Muon run up to 2x faster for free! Introducing Gram Newton-Schulz: a mathematically equivalent but computationally faster Newton-Schulz algorithm for polar decomposition. Gram Newton-Schulz rewrites Newton-Schulz such that instead of iterating on the expensive rectangular X matrix, we iterate on the small, square, symmetric XX^T Gram matrix to reduce FLOPs. This allows us to make more use of fast symmetric GEMM kernels on Hopper and Blackwell, halving the FLOPs of each of those GEMMs. Gram Newton-Schulz is a drop-in replacement of Newton-Schulz for your Muon use case: we see validation perplexity preserved within 0.01, and share our (long!) journey stabilizing this algorithm and ensuring that training quality is preserved above all else. This was a super fun project with @noahamsel, @berlinchen, and @tri_dao that spanned theory, numerical analysis, and ML systems! Blog and codebase linked below 🧵





