Linyang He

18 posts

Linyang He

Linyang He

@LinyangNeuroAI

PhD Student @ZuckermanBrain and @EE_ColumbiaSEAS with @NimaMesgarani. Interested in Human&Machine Intelligence

NYC Katılım Ekim 2025
118 Takip Edilen28 Takipçiler
Linyang He retweetledi
Siyuan
Siyuan@siyuansong_·
🚀 Announcing the Chinese BabyLM Challenge: the first shared task on data-efficient pretraining for Chinese. 📍 Co-located with NLPCC 2026 (Nov 3–5, Macau🇨🇳🇲🇴) Can you train a strong Chinese LM on just ~100M words? chinese-babylm.github.io 🧵 👇(1/6)
English
1
5
14
2.2K
Linyang He retweetledi
Linyang He retweetledi
Hanze Dong
Hanze Dong@hendrydong·
Between theorem recognition and theorem proving lies theorem understanding. We introduce LiveMathematicianBench: a live, contamination-resistant testbed for research-level mathematical reasoning, built from post-cutoff arXiv theorems. It probes a capability that existing benchmarks rarely isolate: whether models can understand theorem statements, track delicate assumptions, reason over logical structure, and leverage proof-level guidance. livemathematicianbench.github.io
Hanze Dong tweet media
English
10
35
176
17.2K
Hanze Dong
Hanze Dong@hendrydong·
Burned some after-work hours spinning Copilot CLI into WeChat. Full multi-session support. WeChat ←→ copilot-wechat ←→ Copilot CLI (ACP) ←→ GPT / Claude / Gemini That makes me work anywhere with my phone. We're moving past static apps. The new meta is Dynamic Personal Interfaces. Spin up a bespoke workspace in minutes, throw it into your favorite chat app, and ship from anywhere. New vibes are incoming! github.com/hendrydong/cop…
English
1
2
10
432
Linyang He retweetledi
Hanze Dong
Hanze Dong@hendrydong·
SFT curates responses. RL curates sampling.   RL improves by curating what the model experiences: condition; distribution; weighting of what gets learned from.   Better signal curation shifts the performance-compute curve upward.   Full write-up below 👇 hendrydong.github.io/blogs/pages/rl…
Hanze Dong tweet media
English
3
25
220
30K
Linyang He retweetledi
Hokin Deng
Hokin Deng@DengHokin·
#VideoReason We are open-sourcing the entire VBVR stack to speed-up the arrival of video reasoning as the next fundamental paradigm of intelligence - 150+ synthetic generators - 1 million training clips - Cloud-scale data factory - Unified EvalKit - 100 rule-based evaluators - Strong baseline model Checkout at video-reason.com
English
19
65
224
52.2K
Linyang He retweetledi
Yinghao Ma
Yinghao Ma@nicolaus625·
[1/n]🧠Can LLMs understand viral meme clips like Rickrolling, Leekspin, Nyan Cat, and “愛♡スクリ~ム”? 🎉Happy to share AVMeme Exam, the funniest audio/video understanding benchmark ever! We eval multimodal LLMs on the meme clips you hear & see daily on YouTube, TikTok &Bilibili
Yinghao Ma tweet media
日本語
1
3
6
298
Kanishka Misra 🌊
Kanishka Misra 🌊@kanishkamisra·
@LinyangNeuroAI presenting a classifier free representational analyses on BLIMP and COMPS @unireps — enjoying how creatively he’s been using (and extending) minimal pair datasets!
Kanishka Misra 🌊 tweet media
English
1
0
8
316
Linyang He
Linyang He@LinyangNeuroAI·
5️⃣ Takeaway: - Raw LLM embeddings = biased toward shallow linguistic features. - Residual disentanglement exposes the deeper, reasoning-specific representations shared by brains and models.
English
1
0
0
110