Sitan Chen

198 posts

Sitan Chen

@sitanch

assistant professor of computer science @hseas, learning theorist, 🎹

Katılım Nisan 2020

203 Takip Edilen1.9K Takipçiler

Sabitlenmiş Tweet

Sitan Chen@sitanch·11 Şub

Excited about this new work where we dig into the role of token order in masked diffusions! MDMs train on some horribly hard tasks, but careful planning at inference can sidestep the hardest ones, dramatically improving over vanilla MDM sampling (e.g. 7%->90% acc on Sudoku) 1/

English

154

38.8K

Sitan Chen@sitanch·28 Nis

@thegautamkamath @eatcs_secretary Wow congrats Gautam!

Français

Gautam Kamath@thegautamkamath·27 Nis

I am honoured (and still a bit stunned) to receive the 2026 Presburger Award from @eatcs_secretary. This recognizes 1 or 2 young scientists for outstanding contributions in theoretical CS This honour is shared w my collaborators, students, institutions, & research community 1/7

English

477

30.7K

Sitan Chen@sitanch·19 Şub

@SurbhiGoel_ Congrats Surbhi!!

हिन्दी

153

Surbhi Goel@SurbhiGoel_·17 Şub

Honored and grateful to be selected as a Sloan fellow this year! A big thanks to my wonderful students, collaborators, and mentors, none of this would be possible without you all.

Sloan Foundation@SloanFoundation

Congrats to the 126 early-career scholars awarded a 2026 Sloan Research Fellowship, whose creativity and innovation set them apart as the next generation of scientific leaders! Our Fellows represent 7 fields and 44 institutions across the US and Canada. sloan.org/fellowships/20…

English

5.2K

Sitan Chen@sitanch·17 Şub

Excited about this paper where we revisit the core message of our ICML '25 work (diffusion LM training is hard, but enables any-order generation) and develop a new paradigm that achieves 2.5x training speedups by aligning the orders encountered at inference and over training!

Jaeyeon (Jay) Kim@Jaeyeon_Kim_0

🚨🚨🚨 Now you can stop training your masked diffusion models ''for the worst''. We propose 🐆PUMA🐆--Progressive UnMAsking, a simple modification of the forward masking process that speeds up the masked diffusion training.

English

4.5K

Sitan Chen retweetledi

Adil Salim@AdilSlm·16 Oca

📢New paper out! We propose an inference algorithm for diffusion models that does not explicitly depend on the ambient dimension and converges exponentially fast. That’s because, unlike most of the competition, we solve the reverse ODE via Picard and not via Euler discretization

English

210

15.1K

Sitan Chen retweetledi

Institute for Foundations of Machine Learning@MLFoundations·15 Ara

Adam Klivans Wins Test of Time Award at FOCS 2025: cs.utexas.edu/news/2025/adam…

English

33K

Sitan Chen@sitanch·8 Ara

Additionally, please check out the nice concurrent work of Lavenant & Zanella which also proved the connection to Riemann approx of the information curve, plus prior works of Li & Cai and the seminal work of Tim Austin giving operational meaning to dual total correlation. 8/8

English

968

Sitan Chen@sitanch·8 Ara

Was very fun working with my amazing coauthors Kevin Cong and Jerry Li on this project! Remarkably, Kevin is still an undergrad but could easily pass for a seasoned PhD student given the mathematical level at which he operates.. Paper link: arxiv.org/pdf/2511.04647 7/

English

1.1K

Sitan Chen@sitanch·8 Ara

Proponents of diffusion language models tout their ability to generate many tokens in parallel. Skeptics argue this is fundamentally broken as it ignores token dependencies. Who's right? 🤔🤔🤔 🚀 In a new work, we rigorously prove that the picture is a lot more nuanced... 1/

English

126

16.5K

Sitan Chen@sitanch·29 Kas

Congratulations to the authors for building this awesome resource for the community! Excited to see FlexMDM here 😄

Kalyan@nkalyanv99

We’re releasing UNI-D², a unified codebase for discrete diffusion language models 🤝🚀 Co-led with @vincentpaulinef and an amazing advisor team: @stefanAbauer, @AlexanderTong7 , @andrea_dittadi, @AMK6610, @KaplFer 🙌 🔗 GitHub: github.com/nkalyanv99/UNI… 📚 Docs: nkalyanv99.github.io/UNI-D2/ Reproduce and extend state-of-the-art baselines with one toolkit. Let’s move beyond autoregressive models and push discrete diffusion together 🧵👇

English

1.8K

Sitan Chen retweetledi

Jaeyeon (Jay) Kim@Jaeyeon_Kim_0·11 Kas

🚨🚨🚨 Now your Masked Diffusion Model can self-correct! We propose PRISM, a plug-and-play approach fine-tuning method that adds self-correction ability to any pretrained MDM! (1/N)

GIF

English

304

47.7K

Sitan Chen retweetledi

Physics Magazine@PhysicsMagazine·29 Eki

Researchers have demonstrated an algorithm that characterizes quantum systems of any size with optimal efficiency and precision without needing prior information or assumptions about the system’s structure. go.aps.org/4hr1wHO

English

2.5K

Sitan Chen retweetledi

Aayush Karan@aakaran31·17 Eki

We found a new way to get language models to reason. 🤯 No RL, no training, no verifiers, no prompting. ❌ With better sampling, base models can achieve single-shot reasoning on par with (or better than!) GRPO while avoiding its characteristic loss in generation diversity.

English

250

1.7K

277K

Sitan Chen@sitanch·11 Eki

@RichardKueng @gong_weiyuan Congrats on the beautiful result! This question has been on my mind for a while now, so it’s great to see it finally solved :)

English

Richard Kueng@RichardKueng·10 Eki

Huge thanks to Viet Tran, Mariami Gachechiladze and MVP Jan Noeller for a productive and fun collaboration! Plus, big shoutout to @sitanch, @gong_weiyuan and Qi Ye for developing elegant new frameworks for proving hardness of learning tasks which we managed to adapt to our needs.

English

427

Richard Kueng@RichardKueng·10 Eki

@RobertHuangHY, myself and @preskill identified quantum state learning tasks that look hard, but become easy if you jointly process 2 copies. I have long wondered whether such challenges exist for c>2 copies. Turns out yes, there is an infinite hierarchy: scirate.com/arxiv/2510.080…

English

1.1K

Sitan Chen@sitanch·10 Eki

Paper link: arxiv.org/abs/2510.08499 6/6

English

519

Sitan Chen@sitanch·10 Eki

Hard to believe it's been ~5 years since @JordanCotler, @RobertHuangHY, and I started working together on quantum learning under realistic constraints, and while the world looks very different these days, the sheer fun of collaborating w/ them remains a reassuring constant 😀 5/

English

525

Sitan Chen@sitanch·10 Eki

⚛️⚛️⚛️ Thrilled to share our new paper on quantum probe tomography! In this work we ask: Can one learn about a complex quantum system given only the ability to control and measure a single particle? 1/

English

7.1K

Keşfet

@thegautamkamath @eatcs_secretary @SurbhiGoel_ @RichardKueng @gong_weiyuan @RobertHuangHY @preskill @JordanCotler