eugene
49 posts


👕 perhaps my favorite swag of all time: black sweater with the @arxiv link for Attention is All You Need

San Francisco, CA 🇺🇸 English
eugene retweetledi
eugene retweetledi
eugene retweetledi
eugene retweetledi

From my observation of friends around me, those who’ve worked at frontier AI labs experience exponential growth.
It’s not just technical. It’s a deeper shift in how they view the world, trends, and themselves. Being immersed in an environment full of other exceptional people led to exponential growth.
There’s a clear lesson here for startups: hire the very best, put them together, and you get compounding effects.
And for each individual, find the environment that puts you on an exponential curve.
English

@deepseek_ai @AlibabaGroup @crystalsssup Okay okay, due to reasonable feedback we added:
@cohere for their non commercial models
@ServiceNow with Apriel, I like folks there (and pipeline rl)
Motif
@tngtech as a shout out for awesome hacks and merges of big MoEs
This is DEFINITELY right, no take backs

English

Open models year in review
What a year! We're back with an updated open model builder tier list, our top models of the year, and our predictions for 2026.
First, the winning models:
1. DeepSeek R1 (@deepseek_ai): Transformed the AI world
2. Qwen 3 Family (@AlibabaGroup): The new default open models
3. Kimi K2 Family (@Kimi_Moonshot): Models that convinced the world that DeepSeek wasn't special and China would produce numerous leading models.
Runner up models: MiniMax M2 (@minimax_ai), GLM 4.5 (@Zai_org), GPT-OSS (@OpenAI), Gemma 3 (@GoogleAI), Olmo 3 (@allen_ai)
Honorable Mentions: Nvidia's (@nvidia) Parakeet speech-to-text model & Nemotron 2 LLM, Moondream 3 VLM (@moondreamai), Granite 4 LLMs (@IBMResearch), and HuggingFace's (@huggingface) SmolLM3.
Updated Tier list:
Frontier open labs: DeepSeek (@deepseek_ai), Qwen (@AlibabaGroup), and Kimi Moonshot (@Kimi_Moonshot)
Close behind: Z.ai (@Zai_org) & MiniMax AI (@minimax_ai) (notably none from the U.S. here and up)
Noteworthy (a mix of US & China): StepFun AI (@StepFun_ai), Ant Group's (@AntGroup/ @TheInclusionAI Inclusion AI, Meituan (@Meituan_LongCat), Tencent (@TencentHunyuan), IBM (@IBMResearch), Nvidia (@nvidia), Google (@GoogleAI), & Mistral (@MistralAI)
Then a bunch more below that, which we detail.
Predictions for 2026:
1. Scaling will continue with open models.
2. No substantive changes in the open model safety narrative.
3. Participation will continue to grow.
4. Ongoing general trends will continue w/ MoEs, hybrid attention, dense for fine-tuning.
5. The open and closed frontier gap will stay roughly the same on any public benchmarks.
6. No Llama-branded open model releases from Meta in 2026.
Read the full post on @interconnectsai -- link below.

English















