Zhuo Sun

111 posts

Zhuo Sun

Zhuo Sun

@JasonSun10

Assistant Professor@SUFE, PhD in Comp. Stats & Machine Learning@University College London

Katılım Temmuz 2021
613 Takip Edilen149 Takipçiler
Zhuo Sun retweetledi
yingzhen
yingzhen@liyzhen2·
Our own answer: structured coupling arxiv.org/abs/2605.07676 - flow matching with VAE-based coupling - VAE encoder & flow sharing networks - VAE decoder init. + flow refinement for sampling flow matching 🤝 VAEs -> good representation & sample quality🚀
yingzhen tweet media
yingzhen@liyzhen2

Tons of papers re diffusion/flow matching at ML confs these days, but to my surprise very few of them consider learning the prior🤔 Am I missing any important work here? 🙏 for suggestions

English
3
45
241
22.2K
Zhuo Sun
Zhuo Sun@JasonSun10·
@avt_im Have read many of your papers; they are really nice. Best of luck for future!
English
1
0
0
32
Alexander Terenin
Alexander Terenin@avt_im·
Big career news: I'm leaving academia - and moving to the San Francisco Bay Area to explore something new. I've written a short blog post with a few reflections on the end of this chapter. If you'd like to catch up, now is the time to reach out! avt.im/blog/the-road-…
English
13
8
277
22.6K
Zhuo Sun retweetledi
DeepSeek
DeepSeek@deepseek_ai·
🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n
DeepSeek tweet media
English
1.6K
7.7K
45.5K
9.7M
Zhuo Sun retweetledi
Xiaoyuan Cheng
Xiaoyuan Cheng@cheng_xiaoyuan·
2 papers at ICLR 2026 🎉 🟢 Oral: *Information Shapes Koopman Representation* (P3-#222) 🟢 *From Embedding to Control* (P4-#4805) 🕒 Thu, Apr 23 • 2:15–4:45 PM Come say hi! On the job market; moving more toward world models (new work in progress 👀) #ICLR #ICLR2026
English
0
3
34
2.6K
Zhuo Sun retweetledi
Guanyang Wang
Guanyang Wang@GuanyangW·
Happy to share Junshi (军师), an open-source Claude Code skill we built for researchers. The goal of Junshi is to propose personalized, promising next ideas, not just summarize the literature. 1/n
English
1
3
25
2.6K
Zhuo Sun retweetledi
PageIndex
PageIndex@PageIndexAI·
Inspired by @karpathy's knowledge base thread, we are open-sourcing OpenKB: Open LLM Knowledge Base In addition to Andrej's great original design, OpenKB can scale to long PDFs and multi-modality, see details below 👇
PageIndex tweet media
English
5
16
62
4.5K
Zhuo Sun retweetledi
Lester Mackey
Lester Mackey@LesterMackey·
Qiang Liu, Chris Oates, and I are writing a monograph on Probabilistic Inference and Learning with Stein’s Method, and we’d love to get your feedback on the first draft
Lester Mackey tweet media
English
3
30
200
22.1K
Zhuo Sun
Zhuo Sun@JasonSun10·
Feel the same way!
Andrej Karpathy@karpathy

It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since - the models have significantly higher quality, long-term coherence and tenacity and they can power through large and long tasks, well past enough that it is extremely disruptive to the default programming workflow. Just to give an example, over the weekend I was building a local video analysis dashboard for the cameras of my home so I wrote: “Here is the local IP and username/password of my DGX Spark. Log in, set up ssh keys, set up vLLM, download and bench Qwen3-VL, set up a server endpoint to inference videos, a basic web ui dashboard, test everything, set it up with systemd, record memory notes for yourself and write up a markdown report for me”. The agent went off for ~30 minutes, ran into multiple issues, researched solutions online, resolved them one by one, wrote the code, tested it, debugged it, set up the services, and came back with the report and it was just done. I didn’t touch anything. All of this could easily have been a weekend project just 3 months ago but today it’s something you kick off and forget about for 30 minutes. As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel. The biggest prize is in figuring out how you can keep ascending the layers of abstraction to set up long-running orchestrator Claws with all of the right tools, memory and instructions that productively manage multiple parallel Code instances for you. The leverage achievable via top tier "agentic engineering" feels very high right now. It’s not perfect, it needs high-level direction, judgement, taste, oversight, iteration and hints and ideas. It works a lot better in some scenarios than others (e.g. especially for tasks that are well-specified and where you can verify/test functionality). The key is to build intuition to decompose the task just right to hand off the parts that work and help out around the edges. But imo, this is nowhere near "business as usual" time in software.

English
0
0
0
153
Zhuo Sun retweetledi
Symposium on Probabilistic Machine Learning
ProbML 2026 (formerly AABI) invites submissions on probabilistic ML (Bayesian & beyond), July 5 in Seoul (co-located with ICML). Website: probml.cc. Tracks: proceedings (PMLR), workshop, fast track. New focus includes healthcare & climate! Submit by: 20 March 2026
Symposium on Probabilistic Machine Learning tweet media
English
2
13
20
8K
Zhuo Sun
Zhuo Sun@JasonSun10·
Our paper "Multilevel Control Functional" with score 8,8,8 accepted at ICLR 2026, is not recommended 'oral' at ICLR, which ranks top 20 in over 19000 submissions #iclr
English
1
1
11
4.3K
Zhuo Sun retweetledi
François-Xavier Briol
François-Xavier Briol@fx_briol·
The UCL IMSS Annual Lecture will take place on the 27th April with a keynote from @LesterMackey. The theme is 'Computational Statistics and Machine Learning', and we will have talks from Alessandro Barp, Paula Cordero Encinar & Po-Ling Loh. imss2026.github.io @stats_UCL
English
1
7
23
2.9K
Zhuo Sun retweetledi
OpenAI
OpenAI@OpenAI·
Introducing Prism, a free workspace for scientists to write and collaborate on research, powered by GPT-5.2. Available today to anyone with a ChatGPT personal account: prism.openai.com
English
1.1K
2.3K
16.2K
5.9M
Zhuo Sun
Zhuo Sun@JasonSun10·
@iclr_conf It is getting noticed that review scores are undergoing significant and rapid changes within a very short time frame. Although score adjustments are normal during discussions, the current level of fluctuation seems highly unusual under the present circumstances.
ICLR@iclr_conf

English
0
0
3
900
Zhuo Sun retweetledi
Akshay 🚀
Akshay 🚀@akshay_pachaar·
Transformer vs. Mixture of Experts in LLMs, clearly explained (with visuals):
English
14
125
1.3K
236.5K