Lichang Chen

336 posts

Lichang Chen

@LichangChen2

Coding Agent RL & Harness of 🥑 @Meta MSL | Previously: GenAI unit @GoogleDeepmind | PhD’25 @umdcs BS’20 @zju_china

Menlo Park, CA Katılım Eylül 2021

753 Takip Edilen940 Takipçiler

Sabitlenmiş Tweet

Lichang Chen@LichangChen2·17 Nis

Try Contemplating Mode in Meta.AI! It uses parallel thinking to deliver more well-rounded answers. We’d love your feedback! I feel very fortunate to have contributed to this project and to keep learning through it. It has been both fun and rewarding to grow alongside one of the best reasoning teams on the planet.

English

645

Lichang Chen@LichangChen2·7h

Stuart might be right

Muskonomy@muskonomy

NEWS: UC Berkeley computer scientist Stuart Russell takes the stand on day 5 of the Elon Musk vs Sam Altman trial, with direct examination by Elon Musk’s lawyer Steven Molo. Russell told the court the current AI race has a “winner take all” dynamic. He said “whichever company develops AGI first (AI that matches or exceeds human capabilities in every area) will have a significant advantage.” Because AGI could replace most human work, Russell warned the companies that build it will “quickly come to control perhaps the majority of economic activity on the planet.” He added that even governments could “become subordinate to these companies.”

English

256

Lichang Chen@LichangChen2·3d

@LijieyYang Congrats!

English

Lijie(Derrick) Yang@LijieyYang·3d

Excited to share that LessIsMore has been accepted to ICML 2026! 🚀 LessIsMore is a training-free sparse attention for efficient long-horizon reasoning. By enforcing cross-head unified token selection, it brings up to 1.6x E2E speedup while preserving reasoning accuracy under practical workloads. Huge thanks to my amazing co-authors and mentors @Jackfram2, @JiaZhihao, Ravi! Paper: arxiv.org/abs/2508.07101 Code: github.com/DerrickYLJ/Les… #ICML2026 #LLM #EfficientAI

English

7.1K

Lichang Chen@LichangChen2·28 Nis

@arena @OpenAI Even worse than Muse Spark? 🥲🥲

English

150

Arena.ai@arena·27 Nis

GPT-5.5 by @OpenAI is now live in the Arena, landing across multiple leaderboards. Here’s how it ranks by modality: - Code Arena (agentic web dev): #9, a strong +50pt jump over GPT-5.4 - Document Arena (analysis & long-content reasoning): #6, on par with Sonnet 4.6 - Text Arena: #7, Math #3, Instruction Following: #8 - Expert Arena: #5 - Search Arena: #2 - Vision Arena: #5 Strong, well-rounded performance, especially in Code (+50 pts vs GPT-5.4). Congrats to @OpenAI on the release. Full category breakdowns by modality in the thread.

OpenAI@OpenAI

Introducing GPT-5.5 A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done. Now available in ChatGPT and Codex.

English

346

132

1.9K

1.4M

Lichang Chen@LichangChen2·26 Nis

I like the idea but I would rephrase it as: Coding agent will be the new foundation for the whole tech industry. They can become any roles by writing the code and docs, e.g, if it works on a research project/folder, it becomes automated researcher as long as you give it an excellent harness and make it iterate to improve the code and docs.

Jacob Effron@jacobeffron

.@swyx current thesis: "2025 was coding agents. 2026 is coding agents breaking containment to do everything else."

English

495

Lichang Chen@LichangChen2·23 Nis

Let’s see where we will get at the end of this year.

Arena.ai@arena

Muse Spark debuts at #7 in the Code Arena - making @AIatMeta the #3 lab right behind @AnthropicAI’s Claude Sonnet 4.6 and @Zai_org’s GLM-5.1, surpassing Gemini-3.1-Pro and GPT-5.4. Code Arena evaluates agentic coding on real-world tasks - building live websites and apps, ranked by users on real workflows. Huge congrats to @AIatMeta on this impressive milestone!

English

1.3K

Lichang Chen retweetledi

Ronit Pereira@Ronitper·19 Nis

“It takes a lot of hard work to make something simple.” - Steve Jobs

English

745

5.1K

93.7K

Lichang Chen@LichangChen2·20 Nis

@recurseparadox Can Antigravity help you automate this process? 😆

English

141

Pranav Shyam@recurseparadox·20 Nis

life goes by quickly when you’re fixing errors and relaunching jobs

English

320

Lichang Chen@LichangChen2·19 Nis

@suchenzang Beautiful women also only have eyes for other beautiful women and we call that the culture of SF?

English

1.3K

Susan Zhang@suchenzang·19 Nis

finally joined an equinox sf gym just to ogle all the beautiful men who only have eyes for other beautiful men

English

498

42.8K

Lichang Chen@LichangChen2·13 Nis

@IntrinsicalAI Yeah, I agree.

English

Intrinsical AI@IntrinsicalAI·13 Nis

@LichangChen2 This is a completely nonsense test. You're measuring how well a LLM approach Nash equilibria. LLMs are not deterministic, but they are in fact very useful to profiling/data-mining and micro-trends. What about letting an LLM configure the solver tree + read results before acting?

English

Lichang Chen@LichangChen2·12 Nis

I am wondering how we can close the gap between neural-based test-time scaling method and the language-based one. Up till now, seems like no one has successfully reproduced the AlphaGo or Poker solvers on the LLMs though they have stronger prior than pure neural/numerical-based approach.

GTOWizard@GTOWizard

We benchmarked every major AI model at poker. GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro, Grok 4 and more. All played 5,000 hands of heads-up no-limit against our state-of-the-art poker agent. Every single one lost. Here's the full breakdown 🧵

English

1.6K

Lichang Chen@LichangChen2·13 Nis

The CEO of deepmind questioned: “In the AGI systems, will LLMs become the key component only or the total system” I think startup founders will agree on the former but the model trainers will agree on the later😊

Milk Road AI@MilkRoadAI

The CEO of Google DeepMind just went on record saying he disagrees with one of the most respected AI researchers in the world. Demis Hassabis, the man behind AlphaFold, AlphaGo, and Google's entire AI operation publicly pushed back against Yann LeCun's claim that large language models are a dead end for artificial intelligence. LeCun, who left Meta earlier this year to start his own AI lab, has been saying for years that LLMs cannot reason, cannot plan, and will never get us to human-level intelligence. Hassabis disagrees, and he said so directly. His position is that scaling laws are still working, foundation models are still getting more capable, and whatever AGI ends up looking like, LLMs will be a central part of it, not something that gets replaced. He does say there is roughly a 50/50 chance that one or two additional breakthroughs will be needed beyond scaling alone, things like better memory, long-term planning, and world models. But the core disagreement with LeCun is clear, Hassabis believes the current architecture is sound and the current path leads somewhere real. Two Nobel-recognized researchers, two founding figures of modern AI, now publicly on opposite sides of the most important technical question in the industry.

English

1.3K

Lichang Chen@LichangChen2·11 Nis

@alexandr_wang It has pretty good generalization ability😆 we will get used to seeing amazing things from 🥑!

English

Alexandr Wang@alexandr_wang·11 Nis

honestly I didn’t even know our model could do some of these

Michael Golden@michaelgold3n

Another image converted to code with Meta Muse spark. hard to believe this was all in one prompt

English

113

2.7K

260.7K

Lichang Chen@LichangChen2·8 Nis

@TianheYu 🥑🥑

QME

109

Tianhe (Kevin) Yu@TianheYu·8 Nis

Excited to see the release of our initial milestone 🥑 Incredible progress in pretraining, RL and test-time scaling. The best is yet to come.

Alexandr Wang@alexandr_wang

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵

English

9.6K

Lichang Chen@LichangChen2·8 Nis

Having the strong base model is the foundation towards Superintelligence! More to come this year!

Alexandr Wang@alexandr_wang

English

1.3K

Lichang Chen@LichangChen2·6 Nis

Spring in Annapolis is amazing! Really miss my time living in Maryland🥹🥹

Governor Wes Moore@GovWesMoore

Spring has sprung here in Annapolis! 💐

English

561

Lichang Chen@LichangChen2·5 Nis

Look forward to Sakana’s next milestone.

nature@Nature

Nature research paper: Towards end-to-end automation of AI research go.nature.com/4rSOGFK

English

581

Lichang Chen@LichangChen2·5 Nis

@dynemetis It highly depends on how you define continual learning. If it’s human-level continual learning, then 90% of the learning are in the model layer, it’s like you receive real-world signal and update neurons in your brain.🧠

English

Josh@dynemetis·4 Nis

90% of continual learning will be done at the context level 9.9% in the harness

Harrison Chase@hwchase17

most people thinking of continual learning as happening at the model level but with agents - there's actually three different levels you could "learn" at: - model - harness - context

English

4.3K

Keşfet

@LijieyYang @Jackfram2 @JiaZhihao @arena @OpenAI @recurseparadox @suchenzang @IntrinsicalAI