Han Fang

1.4K posts

Han Fang

@Han_Fang_

AI Research @ Meta SuperIntelligence Labs

United States เข้าร่วม Ağustos 2011

196 กำลังติดตาม1.7K ผู้ติดตาม

ทวีตที่ปักหมุด

Han Fang@Han_Fang_·5d

Been deep in the agent tool-use side of Muse Spark 🥑 for a while now. Great to see our work translating into the first milestone. Lots more to come.

Alexandr Wang@alexandr_wang

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵

English

Han Fang@Han_Fang_·1d

Great work by @ChengsongH31219 @wyu_nd @wangxiaoyang1st @hongming110 et al

English

1.1K

Han Fang@Han_Fang_·1d

R-Zero (ICLR 2026) — self-evolving LLM from zero external data. one base model, two roles: Challenger generates hard problems, Solver solves them. Challenger is rewarded when Solver fails. co-evolve with GRPO. Challenger learns to probe for weaknesses, not just generate hard problems. +6.49 math, +7.54 general reasoning on Qwen3-4B-Base. three iterations. no human data. arxiv.org/abs/2508.05004

English

395

23.5K

Han Fang@Han_Fang_·2d

@XFreeze @elonmusk @elonmusk instruction following on easy-to-verify instructions isn’t sufficient- thats what IFBench is. The real deal is the instructions that users actually might ask. For example, do you think the response below is in your tone?

English

273

X Freeze@XFreeze·2d

Grok 4.20 just claimed #1 on IFBench (Artificial Analysis) - the gold standard for instruction following 81% score. Outranking every other model And here is what that actually means - When you ask Grok to do something, it doesn't give you a close enough answer. It doesn't approximate. It doesn't go off-script It follows your instructions. Precisely. Every time xAI is not just racing to build the most intelligent AI - they are also building the most reliable one An AI that actually listens to you...

English

193

610

1.9K

523.5K

Han Fang@Han_Fang_·2d

@alexandr_wang It’s #1 in the productivity category now

English

5.2K

Alexandr Wang@alexandr_wang·2d

okay this is too exciting :) meta AI is now #2 in the app store, top AI app! we are so back!

English

259

117

2.2K

321.8K

Han Fang รีทวีตแล้ว

Jason Weston@jaseweston·5d

🏋️Thinking Mid-training: RL of Interleaved Reasoning🎗️ We address the gap between pretraining (no explicit reasoning) and post-training (reasoning-heavy) with an intermediate SFT+RL mid-training phase to teach models how to think. - Annotate pretraining data with interleaved thoughts - SFT mid-training to learn when/what to think alongside original content - RL mid-training to optimize reasoning generation with grounded reward from future token prediction Result: 3.2x improvement on reasoning benchmarks compared to direct RL post-training on base Llama-3-8B, and gains over only prior SFT as well. Introducing reasoning earlier makes models better prepared for post-training! Read more in the blog post: facebookresearch.github.io/RAM/blogs/thin…

English

553

65.9K

Han Fang@Han_Fang_·5d

The only thing better than working on really hard problems is the friendships you build along the way with incredible fellow researchers.

Xiang Yue@xiangyue96

Muse Spark from MSL is our first model and an exciting early step for the team. It’s been a great journey building it together! While there’s still a gap to the SoTA, this is an exciting start! We’ll keep improving and moving toward personal superintelligence!

English

765

Han Fang รีทวีตแล้ว

Shengjia Zhao@shengjia_zhao·5d

Excited to share what we’ve been building at Meta Superintelligence Labs! We just released Muse Spark, our first AI model. It's a natively multimodal reasoning model and the first step on our path to personal superintelligence. We've overhauled our entire stack to support scaling, and this is just the beginning. ai.meta.com/blog/introduci…

English

173

1.7K

224.8K

Han Fang รีทวีตแล้ว

Hongyu Ren@ren_hongyu·5d

Check out Muse Spark, our first milestone in the quest for personal superintelligence! Scaling this with the team has been a total blast. Give it a spin and let us know what you think! 🥑

English

315

63.7K

Han Fang รีทวีตแล้ว

AI at Meta@AIatMeta·26 Mar

Today we're introducing TRIBE v2 (Trimodal Brain Encoder), a foundation model trained to predict how the human brain responds to almost any sight or sound. Building on our Algonauts 2025 award-winning architecture, TRIBE v2 draws on 500+ hours of fMRI recordings from 700+ people to create a digital twin of neural activity and enable zero-shot predictions for new subjects, languages, and tasks. Try the demo and learn more here: go.meta.me/tribe2

English

736

2.5K

16K

6.8M

Han Fang@Han_Fang_·19 Mar

Own your own model, own your own destiny

Cursor@cursor_ai

Composer 2 is now available in Cursor.

English

445

Han Fang@Han_Fang_·18 Mar

@karthikabinav @Yuchenj_UW well said- everyone will have an important role to play in the new world

English

Karthik A Sankararaman 🇮🇳🇺🇸@karthikabinav·17 Mar

@Yuchenj_UW Even (and especially) assuming AGI, this doesn't logically follow at all. Set of things a frontier lab can do is finite (bounded by compute). Set of things business can be built on is infinite. So a finite set of companies doing finite set of things cant be infinite.

English

364

Yuchen Jin@Yuchenj_UW·17 Mar

Some people at frontier AI labs told me they believe startups are over. OpenAI, Anthropic, Google, xAI will absorb every industry as AGI nears. Coding today, science, medicine, and finance next. Then everything else. If they’re right, that’s a pretty boring end of the world.

English

540

160

944.5K

Han Fang รีทวีตแล้ว

Karina Nguyen@karinanguyen·11 Mar

Excited to release PostTrainBench v1.0! This benchmark evaluates the ability of frontier AI agents to post-train language models in a simplified setting. We believe this is a first step toward tracking progress in recursive self-improvement 🧵:

English

677

148.3K

Han Fang@Han_Fang_·7 Mar

Eyes on the *right* balls folks

Anthropic@AnthropicAI

New on the Anthropic Engineering Blog: In evaluating Claude Opus 4.6 on BrowseComp, we found cases where the model recognized the test, then found and decrypted answers to it—raising questions about eval integrity in web-enabled environments. Read more: anthropic.com/engineering/ev…

English

849

Han Fang@Han_Fang_·6 Mar

A must read for everyone: anthropic.com/research/labor…

English

328

Han Fang@Han_Fang_·17 Şub

I’ve been thinking about something lately. Every mature science has its central dogma — a foundational claim so deeply embedded that practitioners forget it’s even there. Biology has DNA → RNA → Protein. Thermodynamics has entropy. Does AI have one? Here I want to share a thought exercise of mine: what if we treated the compression-intelligence connection not as a useful intuition, but as our field’s central dogma? See my thoughts here: tokens-for-thoughts.notion.site/the-central-do…

English

262

Han Fang@Han_Fang_·16 Şub

@steipete @OpenAI @openclaw Congratulations both!

English

122

Peter Steinberger 🦞@steipete·16 Şub

I'm joining @OpenAI to bring agents to everyone. @OpenClaw is becoming a foundation: open, independent, and just getting started.🦞 steipete.me/posts/2026/ope…

English

4.2K

3.8K

41.2K

5.8M

Han Fang รีทวีตแล้ว

Kimi.ai@Kimi_Moonshot·30 Oca

Kimi K2.5 tech report just dropped! Quick hits: - Joint text–vision training: pretrained with 15T vision-text tokens, zero-vision SFT (text-only) to activate visual reasoning - Agent Swarm + PARL: dynamically orchestrated parallel sub-agents, up to 4.5× lower latency, 78.4% on BrowseComp - MoonViT-3D: a unified image–video encoder with 4× temporal compression, enabling 4× longer videos in the same context - Toggle: token-efficient RL, 25–30% fewer tokens with no accuracy drop Here's our work toward scalable, real-world agentic intelligence. More details in the report 👉github.com/MoonshotAI/Kim…

English

279

1.9K

313.5K

ค้นพบ

@ChengsongH31219 @wyu_nd @wangxiaoyang1st @hongming110 @XFreeze @elonmusk @alexandr_wang @karthikabinav