Kyle Wong (@ewveggies) - Twitter-Profil | Zamantika Mersobahis Locabet

Angehefteter Tweet

Kyle Wong@ewveggies·3 Ara

Excited to share what our lab has been baking: Amazon Nova Act! Trained with large scale RL on diverse web gyms, Nova Act achieves SOTA on multiple public web agent benchmarks. Check it out!🚀 labs.amazon.science/blog/amazon-no…

English

0

1

12

10.6K

Kyle Wong@ewveggies·7h

@27upon2 nah bro we pretraining on it

English

0

1

17

Sriraam@27upon2·8h

@ewveggies that hint is going in the OPD run

English

1

0

1

123

Kyle Wong@ewveggies·11h

Good code, but missing print(“=” * 80) debug statements

Sriraam@27upon2

a masterpiece of python by codex

English

2

0

5

749

Kyle Wong@ewveggies·19h

@_Suresh2 Their collapse doesn’t seem reward model driven. Fig 15 shows instability on aime and lcb, and the report mentions for stem/code they only do RLVR

English

0

53

Suresh@_Suresh2·1d

@ewveggies the on-policy bit usually falls apart once you scale the reward model updates past a few thousand steps

English

1

0

1

110

Kyle Wong@ewveggies·1d

Beautiful tech report, perhaps the best western model report I’ve ever read. Lots of great insights: no synthetic data in midtrain, teacher models are RLed directly on top of midtrain, and adaptive clip higher. But still seems like they didn’t fully nail true on-policy as they admit their RL stage is unstable, leading to a hacky self-distillation stage (imo)

elie@eliebakouch

WOW microsoft new "MAI Thinking 1" model comes with a 109 page tech report that looks REALLY detailed, this is amazing

English

5

8

112

12.3K

Kyle Wong@ewveggies·19h

@TechnologyPat ironically msft dropped 😭

English

0

51

TechPat@TechnologyPat·21h

@ewveggies Best Western is making AI now? Pump the stonk

English

1

0

2

63

Kyle Wong@ewveggies·1d

Idk if Claudes/GPTs use synthetic data since they don't share anything. But if you want to train the BEST model, and you are confident that your model has the 'best pre-training' (most world knowledge) and 'best mid-training' (best domain specific capabilities and behaviors), then it doesn't make sense to distill off-policy data from other models, since: 1. Those models have less world and domain knowledge 2. Lots of SFT on off-policy synthetic data pulls your model into a more narrow distribution

English

1

2

249

Manoj 🪐@SaturnKit·1d

@ewveggies Can someone explain "zero synthetic data or distillation from previous models"... What are the advantages? Are other models like Claude /Sonnet/Gpts use synthetic? 🤔

English

1

0

2

262

Kyle Wong@ewveggies·1d

@truthixifi taking notes indeed

English

1

0

1

241

truthixify@truthixifi·1d

@ewveggies 📝

QME

1

0

1

280

Kyle Wong@ewveggies·2d

Internally we have a model scoring 4 points on ARC-AGI-3 But we won’t release it out of respect for Chet Holmgren’s legendary game 7 performance

ARC Prize@arcprize

Anthropic Opus 4.8 is new SOTA on ARC-AGI-3 Score: 1.5%, ~$10K ARC-AGI-3 analysis notes: * Opus 4.8 read the environment an abstraction *above* Opus 4.7, as objects & systems, not pictures * Opus 4.8 succeeded on early levels, but still committed to a wrong sub-goal

English

0

3

633

Kyle Wong@ewveggies·3d

Gotta love SF poker: Some absolute degen pre-flop 5 bet jams with 34 offsuit, gets called by pocket queens and pocket aces. Flop comes 2 5 6, flopped the absolute nuts and won $500 pot. Sickest hand I’ve ever seen.

Corgi@UseCorgi

So much fun hosting a poker night with our friends @SignalFire, packed with founders and operators from the ecosystem. Reminder that we have a space in the heart of SF for community events like this. The whole point is creating room for people to meet and for serendipity to do its thing.

English

0

4

756

Kyle Wong@ewveggies·4d

@sanjayramesh64 Yooo howd you find my twitter

English

0

44

sanjay@sanjayramesh64·4d

@ewveggies This is incredible

English

1

0

2

52

Kyle Wong@ewveggies·4d

To my grand total of 3 followers who are gonna see this, the 1000 Leetcode streak has been achieved

English

2

0

14

471

Kyle Wong@ewveggies·4d

@lukas_hellesch @UCSB @McDonalds @BurgerKing dropout mentality

English

0

1

170

Lukáš Hellesch@lukas_hellesch·4d

@ewveggies @UCSB @McDonalds @BurgerKing Turned down McDonald's AND Burger King. The grind is real

English

1

0

2

185

Kyle Wong@ewveggies·5d

hey everyone! i’m kyle - new grad 2025 @UCSB - no prev internships - no prev research - no employment ever - bottom 5th percentile in math/coding contests - 2 stars on github class projects - turned down competitve offers @McDonalds and @BurgerKing to hustle on my own - got kicked out of parents basement yesterday; disowned - staying in sf for a few days! looking to raise for my neolab hmu if interested!

Samuel Zhang@samuelxzhang

hey everyone! i'm samuel - 2nd year cs @uwaterloo - prev eng @memories_ai, ai research @uwaterloo - 99th percentile in multiple national math/coding contests - prev national level fencer; 275lbs max bench - 1.2k+ stars on github projects - turned down swe offers @Gemini @openart_ai and @ yc startups this summer to build smth of my own - got flown out for yc s26 interview yesterday; rejected - staying in sf for a few more days; looking to raise from other investors hmu if interested!

English

44

35

1.3K

98.7K

Kyle Wong@ewveggies·4d

@ICE257_ cool portfolio!

English

1

0

2

75

ICE@ICE257_·4d

Hey everyone! I’m King but you can call me ICE (fuck how do we do this) - AI/ML dev. - Heavy Codexoor - I build cool af stuff - idk check my portfolio it’s a bit more cohesive kingsleyaremu.vercel.app

Kyle Wong@ewveggies

hey everyone! i’m kyle - new grad 2025 @UCSB - no prev internships - no prev research - no employment ever - bottom 5th percentile in math/coding contests - 2 stars on github class projects - turned down competitve offers @McDonalds and @BurgerKing to hustle on my own - got kicked out of parents basement yesterday; disowned - staying in sf for a few days! looking to raise for my neolab hmu if interested!

English

1

5

251

Kyle Wong@ewveggies·4d

@DJLougen @UCSB @McDonalds @BurgerKing Blended seed round incoming

English

0

2

204