Ning Ding

372 posts

Ning Ding banner
Ning Ding

Ning Ding

@stingning

Researcher of AI/LM. Assistant Professor @Tsinghua_Uni. Working on scalable methods of language models.

Earth Katılım Mayıs 2015
358 Takip Edilen5.3K Takipçiler
Sabitlenmiş Tweet
Ning Ding
Ning Ding@stingning·
Building upon SimpleVLA-RL, we have implemented real-world RL on long-horizon dexterous tasks and witnessed a non-trivial (~relatively 300%) performance improvement over the SFT model, along with surprising capabilities on auto-recovery. Blog coming soon. The entire process uses very little data and training compute—basically costing no more than a single robotic arm—hinting that real-world generality for machines is actually within sight.
English
16
86
606
87.1K
Ning Ding retweetledi
Junyang Lin
Junyang Lin@JustinLin610·
working on distillation
Junyang Lin tweet media
English
28
15
494
16.1K
Ning Ding retweetledi
Bingxiang He
Bingxiang He@HBX_hbx·
✨ [ICLR 2026] How Far Can Unsupervised RLVR Scale LLM Training? The dream: models can improve themselves without human supervision. The reality: sometimes they can only sharpen what they already believe. Intrinsic rewards struggle to scale LLM training because they follow a rise-then-fall pattern that makes collapse mathematically inevitable. But that's not the end of the story. We find unsupervised RLVR (URLVR) is particularly well-suited for test-time training and quantifying model priors. The full picture 👇 📄 Paper: arxiv.org/abs/2603.08660 🧪 GitHub: github.com/PRIME-RL/TTRL (1/n)
Bingxiang He tweet media
English
1
13
64
4.6K
Ning Ding retweetledi
青龍聖者
青龍聖者@bdsqlsz·
it is coming.
青龍聖者 tweet media
English
109
152
1.6K
517K
Ning Ding
Ning Ding@stingning·
I increasingly believe that all the things — AI, robotics, controlled fusion, space, quantum computing, and more — will converge into one thing.
English
8
1
20
1.7K
Ning Ding
Ning Ding@stingning·
GPT-5.3-Codex-Spark on @cerebras is something very different.
English
1
0
9
1.1K
Ning Ding
Ning Ding@stingning·
Today I heard a line that stuck with me: "the real moat is the organizational structure."
English
4
5
51
6.4K
Ning Ding
Ning Ding@stingning·
I’m not sure how many people have noticed this, but there are already very, very few humans left in @openclaw's PR queue: AIs open the pull requests, other AIs review and score them, and (partially maybe) decide what gets merged. If you want to see “self-evolving AI” in the wild, it’s right here.
English
0
0
29
5K
Ning Ding retweetledi
jietang
jietang@jietang·
@geekbb 能,相信ds
日本語
26
16
325
105.9K