eugene

49 posts

eugene banner
eugene

eugene

@e_chx4

post-training @cohere

Toronto, Ontario Katılım Ekim 2021
102 Takip Edilen119 Takipçiler
eugene
eugene@e_chx4·
never felt more sus (start up school)
eugene tweet media
English
5
0
44
1.4K
Beff (e/acc)
Beff (e/acc)@beffjezos·
If your ML engineers don't come from the same school as Ilya (UToronto), they're just sparkling SWEs
English
18
10
177
13.6K
eugene
eugene@e_chx4·
cohemon cards are all u need
eugene tweet media
English
1
0
20
1.2K
eugene
eugene@e_chx4·
living on a high floor in a condo and riding the elevator at peak hours gotta be one of the craziest rage baits
English
0
0
5
257
👩‍💻 Paige Bailey
👩‍💻 Paige Bailey@DynamicWebPaige·
👕 perhaps my favorite swag of all time: black sweater with the @arxiv link for Attention is All You Need
👩‍💻 Paige Bailey tweet media
San Francisco, CA 🇺🇸 English
15
6
244
12.3K
eugene
eugene@e_chx4·
I shared this message with the team today.
eugene tweet media
English
0
0
12
1.3K
eugene retweetledi
Ivan Zhang
Ivan Zhang@1vnzh·
rest now brother, we have the watch. we'll see you in Ottawa
English
3
6
156
21.8K
eugene
eugene@e_chx4·
eugene tweet media
ZXX
1
1
18
907
eugene
eugene@e_chx4·
@1vnzh bros cosplaying as a member of technical staff
English
0
0
5
126
eugene
eugene@e_chx4·
@1vnzh not one original tweet unc 👎
eugene tweet media
English
0
0
8
160
eugene retweetledi
Achilles
Achilles@Xhej__·
Achilles tweet media
ZXX
22
1.4K
7.8K
186.2K
eugene
eugene@e_chx4·
who knew coasters could be tuff
eugene tweet media
English
3
3
27
3.8K
eugene retweetledi
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
From my observation of friends around me, those who’ve worked at frontier AI labs experience exponential growth. It’s not just technical. It’s a deeper shift in how they view the world, trends, and themselves. Being immersed in an environment full of other exceptional people led to exponential growth. There’s a clear lesson here for startups: hire the very best, put them together, and you get compounding effects. And for each individual, find the environment that puts you on an exponential curve.
English
36
28
668
50.3K
eugene
eugene@e_chx4·
i need to stop day training
English
0
0
0
428
Nathan Lambert
Nathan Lambert@natolambert·
Open models year in review What a year! We're back with an updated open model builder tier list, our top models of the year, and our predictions for 2026. First, the winning models: 1. DeepSeek R1 (@deepseek_ai): Transformed the AI world 2. Qwen 3 Family (@AlibabaGroup): The new default open models 3. Kimi K2 Family (@Kimi_Moonshot): Models that convinced the world that DeepSeek wasn't special and China would produce numerous leading models. Runner up models: MiniMax M2 (@minimax_ai), GLM 4.5 (@Zai_org), GPT-OSS (@OpenAI), Gemma 3 (@GoogleAI), Olmo 3 (@allen_ai) Honorable Mentions: Nvidia's (@nvidia) Parakeet speech-to-text model & Nemotron 2 LLM, Moondream 3 VLM (@moondreamai), Granite 4 LLMs (@IBMResearch), and HuggingFace's (@huggingface) SmolLM3. Updated Tier list: Frontier open labs: DeepSeek (@deepseek_ai), Qwen (@AlibabaGroup), and Kimi Moonshot (@Kimi_Moonshot) Close behind: Z.ai (@Zai_org) & MiniMax AI (@minimax_ai) (notably none from the U.S. here and up) Noteworthy (a mix of US & China): StepFun AI (@StepFun_ai), Ant Group's (@AntGroup/ @TheInclusionAI Inclusion AI, Meituan (@Meituan_LongCat), Tencent (@TencentHunyuan), IBM (@IBMResearch), Nvidia (@nvidia), Google (@GoogleAI), & Mistral (@MistralAI) Then a bunch more below that, which we detail. Predictions for 2026: 1. Scaling will continue with open models. 2. No substantive changes in the open model safety narrative. 3. Participation will continue to grow. 4. Ongoing general trends will continue w/ MoEs, hybrid attention, dense for fine-tuning. 5. The open and closed frontier gap will stay roughly the same on any public benchmarks. 6. No Llama-branded open model releases from Meta in 2026. Read the full post on @interconnectsai -- link below.
Nathan Lambert tweet media
English
71
261
1.5K
351.3K
seb
seb@internetsebo·
half the time you see someone who claims they worked at Cohere they were just doing data annotation
English
1
0
5
353