eugene

49 posts

eugene

@e_chx4

post-training @cohere

Toronto, Ontario Katılım Ekim 2021

102 Takip Edilen119 Takipçiler

eugene@e_chx4·2d

never felt more sus (start up school)

English

1.4K

eugene@e_chx4·4d

@beffjezos @1vnzh

QAM

161

Beff (e/acc)@beffjezos·4d

If your ML engineers don't come from the same school as Ilya (UToronto), they're just sparkling SWEs

English

177

13.6K

eugene@e_chx4·5d

cohemon cards are all u need

English

1.2K

eugene@e_chx4·5 Nis

living on a high floor in a condo and riding the elevator at peak hours gotta be one of the craziest rage baits

English

257

eugene@e_chx4·5 Nis

@DynamicWebPaige @arxiv @aidangomez

QAM

👩‍💻 Paige Bailey@DynamicWebPaige·4 Nis

👕 perhaps my favorite swag of all time: black sweater with the @arxiv link for Attention is All You Need

San Francisco, CA 🇺🇸 English

244

12.3K

eugene@e_chx4·4 Nis

#04 #keshi #asian #boba

299

eugene@e_chx4·1 Nis

I shared this message with the team today.

English

1.3K

eugene retweetledi

Ivan Zhang@1vnzh·27 Mar

rest now brother, we have the watch. we'll see you in Ottawa

English

156

21.8K

eugene@e_chx4·25 Mar

ZXX

907

eugene@e_chx4·18 Mar

@1vnzh bros cosplaying as a member of technical staff

English

126

Ivan Zhang@1vnzh·18 Mar

"PTAL"

Español

2.3K

eugene@e_chx4·18 Mar

@1vnzh not one original tweet unc 👎

English

160

eugene retweetledi

Achilles@Xhej__·4 Mar

ZXX

1.4K

7.8K

186.2K

eugene retweetledi

zombie@marsfairyy·26 Şub

#watdatmean

QHT

18.9K

258.5K

eugene@e_chx4·7 Şub

who knew coasters could be tuff

English

3.8K

eugene retweetledi

Yuchen Jin@Yuchenj_UW·24 Oca

From my observation of friends around me, those who’ve worked at frontier AI labs experience exponential growth. It’s not just technical. It’s a deeper shift in how they view the world, trends, and themselves. Being immersed in an environment full of other exceptional people led to exponential growth. There’s a clear lesson here for startups: hire the very best, put them together, and you get compounding effects. And for each individual, find the environment that puts you on an exponential curve.

English

668

50.3K

eugene@e_chx4·6 Oca

@aarush @nvidia if only varun was here to see this

English

Aarush Sah@aarush·5 Oca

Joined @NVIDIA today! Time to cook

English

777

262

11.4K

343.7K

eugene@e_chx4·31 Ara

i need to stop day training

English

428

eugene@e_chx4·15 Ara

@natolambert @deepseek_ai @AlibabaGroup @crystalsssup @cohere @ServiceNow @tngtech ❤️

QME

405

Nathan Lambert@natolambert·15 Ara

@deepseek_ai @AlibabaGroup @crystalsssup Okay okay, due to reasonable feedback we added: @cohere for their non commercial models @ServiceNow with Apriel, I like folks there (and pipeline rl) Motif @tngtech as a shout out for awesome hacks and merges of big MoEs This is DEFINITELY right, no take backs

English

24.5K

Nathan Lambert@natolambert·14 Ara

Open models year in review What a year! We're back with an updated open model builder tier list, our top models of the year, and our predictions for 2026. First, the winning models: 1. DeepSeek R1 (@deepseek_ai): Transformed the AI world 2. Qwen 3 Family (@AlibabaGroup): The new default open models 3. Kimi K2 Family (@Kimi_Moonshot): Models that convinced the world that DeepSeek wasn't special and China would produce numerous leading models. Runner up models: MiniMax M2 (@minimax_ai), GLM 4.5 (@Zai_org), GPT-OSS (@OpenAI), Gemma 3 (@GoogleAI), Olmo 3 (@allen_ai) Honorable Mentions: Nvidia's (@nvidia) Parakeet speech-to-text model & Nemotron 2 LLM, Moondream 3 VLM (@moondreamai), Granite 4 LLMs (@IBMResearch), and HuggingFace's (@huggingface) SmolLM3. Updated Tier list: Frontier open labs: DeepSeek (@deepseek_ai), Qwen (@AlibabaGroup), and Kimi Moonshot (@Kimi_Moonshot) Close behind: Z.ai (@Zai_org) & MiniMax AI (@minimax_ai) (notably none from the U.S. here and up) Noteworthy (a mix of US & China): StepFun AI (@StepFun_ai), Ant Group's (@AntGroup/ @TheInclusionAI Inclusion AI, Meituan (@Meituan_LongCat), Tencent (@TencentHunyuan), IBM (@IBMResearch), Nvidia (@nvidia), Google (@GoogleAI), & Mistral (@MistralAI) Then a bunch more below that, which we detail. Predictions for 2026: 1. Scaling will continue with open models. 2. No substantive changes in the open model safety narrative. 3. Participation will continue to grow. 4. Ongoing general trends will continue w/ MoEs, hybrid attention, dense for fine-tuning. 5. The open and closed frontier gap will stay roughly the same on any public benchmarks. 6. No Llama-branded open model releases from Meta in 2026. Read the full post on @interconnectsai -- link below.