Ethan Chern

265 posts

Ethan Chern

Ethan Chern

@ethanchern

PhD student @sjtu1896

Se unió Temmuz 2023
301 Siguiendo183 Seguidores
Tweet fijado
Ethan Chern
Ethan Chern@ethanchern·
"Failure is just iteration. No explosion, no innovation. Keep going."🚀 You vent to @elonmusk—he looks you in the eye and replies instantly, like a video call. Introducing LiveTalk: real-time video gen system on a GPU that sees you, reads emotion, and responds in real time.🧵👇
English
1
8
17
11.1K
Ethan Chern retuiteado
John Wu
John Wu@JohnWu2048·
@fudayuan @whistom25 @Eudaemonia279 @stefan_fee 👋 Introducing daVinci-Env: the largest fully transparent framework for SWE environment synthesis in Python at scale. We open-source 45,320 environments, and 32B/72B models trained on them reach 62.4%/66.0% on SWE-Bench Verified.
John Wu tweet media
English
1
8
42
13.9K
Ethan Chern retuiteado
Shiqi Chen
Shiqi Chen@shiqi_chen17·
📍 Can LLMs discover, abstract, and reuse higher-level tool skills across tasks? Existing tool-use benchmarks test solving tasks with fixed tools. But real workflows contain recurring structures where efficiency comes from reusable tool compositions, not isolated calls. We introduce SkillCraft: 126 tasks across 6 domains designed to test whether LLM agents can acquire compositional skills, not just call atomic tools. We also propose Skill Mode, a lightweight protocol with four MCP primitives that let agents compose, verify, cache, and reuse tool chains at test time. Our Key findings across evaluating 8 SOTA models: ⚡Skill Mode enables agents to self-discover and reuse skills, leading to higher success and efficiency than agents without it. The gains are larger for stronger models. 🧠 Stronger models (e.g., Claude) discover more generalizable skills, which transfer across tasks and even across models. 🔍 Deeper composition ≠ better — shallow, well-tested skills generalize best. 🔗 Paper: arxiv.org/abs/2603.00718 💻 Code: github.com/shiqichen17/Sk… 🏠 Page: skillcraft-website.github.io/page (1/7)
English
9
40
202
67.1K
Ethan Chern retuiteado
Ethan Chern retuiteado
Yi wei Qin
Yi wei Qin@QinYi88814·
Should data "evolve"? 🧬 Scaling is not enough. Model performance is bounded by data, but its value is defined by processing depth. We introduce Data Darwinism, a 10-level hierarchy (L0-L9) redefining data as an eternal co-evolutionary process.(1/n) huggingface.co/papers/2602.07…
Yi wei Qin tweet media
English
2
10
19
8.3K
Ethan Chern retuiteado
马东锡 NLP
马东锡 NLP@dongxi_nlp·
本周最喜欢的三篇 Coding Agent 文章,da Vinci-Agency, da Vinci-Dev, ProjDevBench。
马东锡 NLP tweet media
中文
6
34
237
19.3K
Ethan Chern retuiteado
Dongrui Liu
Dongrui Liu@dong_rui39501·
[1/8] 🐶 Introducing "AgentDoG": A Diagnostic Guardrail Framework for AI Agent Safety. It achieves SOTA performance, diagnosing root causes (e.g., prompt injection, tool misuse) with 82% accuracy, far surpassing general LLMs. 📄 Paper: arxiv.org/abs/2601.18491
Dongrui Liu tweet media
English
9
12
20
982
Ethan Chern retuiteado
Mohan Jiang
Mohan Jiang@mohan_jian12240·
🎨 AI agents are excellent "Sprinters" solving single functions in seconds, but they fail at "Marathons" like long-horizon tasks. They lose context, drift, or give up. Why? They lack endurance training. We introduce daVinci-Agency: The FIRST automatic data synthesis pipeline to achieve project evolution level agency! With just 239 samples, we beat baselines trained on 66k samples. 🚀 Paper: huggingface.co/papers/2602.02…
Mohan Jiang tweet media
English
5
4
9
2K
Ethan Chern retuiteado
Ji Zeng
Ji Zeng@stargazer4096·
Can we instill foundational agent behaviors before SFT/RL? Introducing daVinci-Dev: First systematic study of Agentic Mid-Training for SWE. 🎨🤖 We lift Qwen-2.5 to the level of Qwen3, and we release billions of tokens! 📄 github.com/GAIR-NLP/daVin… 🤗 huggingface.co/collections/GA…
Ji Zeng tweet media
English
2
9
18
3.5K
Ethan Chern retuiteado
马东锡 NLP
马东锡 NLP@dongxi_nlp·
「 Benchmark, AgencyBench 」 很喜欢这个 benchmark,衡量 Agent 在 long horizon 长时度,超长上下文,复杂性高的任务表现。 文章分析不同模型做任务时的偏好非常有趣: Claude 和 GPT 喜欢用 shell command Gemini 喜欢用 memory tools Qwen 喜欢用 file operations Grok 和 GLM 喜欢用 web search
马东锡 NLP tweet media
中文
3
11
43
5.6K
Ethan Chern retuiteado
Keyu Li
Keyu Li@chlorophyllwzh·
The boundary of evaluation determines the upper limit of intelligence. 🚀 NEW PAPER: "AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts"
Keyu Li tweet media
English
1
9
10
697
Ethan Chern retuiteado
DailyPapers
DailyPapers@HuggingPapers·
LiveTalk Real-time multimodal interactive video diffusion system that achieves 20× speedup through improved on-policy distillation. Generates avatar videos from text, image, and audio inputs with sub-second latency.
DailyPapers tweet media
English
2
9
65
5K
Ethan Chern retuiteado
Pengfei Liu
Pengfei Liu@stefan_fee·
🚀 LiveTalk 1.0 is NOW OPEN SOURCE! Pushing multimodal (video, audio) generation from “offline rendering” to “real-time interaction,” enabling AI to truly “show and tell” 🎯 Core Innovation: • Diffusion handles rendering • Next-token-prediction handles cognitive agency • On-policy distillation + system optimization → real-time interaction * Open-sourced • 20× speedup • 0.33s first-frame latency 📄 Paper: arxiv.org/pdf/2512.23576 💻 Code: github.com/GAIR-NLP/LiveT… 🤗 Model: huggingface.co/GAIR/LiveTalk-… Cognitively Agentic World Model: Bringing cognitive companions from the physical world into the digital realm 🌟
Ethan Chern@ethanchern

"Failure is just iteration. No explosion, no innovation. Keep going."🚀 You vent to @elonmusk—he looks you in the eye and replies instantly, like a video call. Introducing LiveTalk: real-time video gen system on a GPU that sees you, reads emotion, and responds in real time.🧵👇

English
0
2
7
938
Ethan Chern retuiteado
Wildminder
Wildminder@wildmindai·
LiveTalk: A real-time multimodal avatar that streams video at ~25 FPS; - OmniAvatar-1.3B + Qwen3-Omni. github.com/GAIR-NLP/LiveT…
English
7
39
310
29.2K
Ethan Chern retuiteado
Aphelios Tang
Aphelios Tang@Aphelios_Tang·
We’re pushing Video Gen beyond offline rendering into the era of Real-Time Interaction! 🎬📷 If you like what we’re building, please give us a ⭐ on GitHub and an ↑ on Hugging Face! It means a lot to the team. 🙏 x.com/ethanchern/sta…
Ethan Chern@ethanchern

LiveTalk is ready for the real world, as we’re moving from "generating video clips" to "building true relationships"!! Try out our model as we keep improving it! Great collaboration with Zhulin Hu, @Aphelios_Tang , @jiadisu7 , @steffichern , @SJTUDengLab, and @stefan_fee!!

English
0
1
1
182
Ethan Chern
Ethan Chern@ethanchern·
Chat with your favorite idol through LiveTalk!!
English
1
0
2
201
Ethan Chern
Ethan Chern@ethanchern·
"Failure is just iteration. No explosion, no innovation. Keep going."🚀 You vent to @elonmusk—he looks you in the eye and replies instantly, like a video call. Introducing LiveTalk: real-time video gen system on a GPU that sees you, reads emotion, and responds in real time.🧵👇
English
1
8
17
11.1K