Jerry Tworek

1.8K posts

Jerry Tworek

@MillionInt

ex-VP of RL @ OpenAI | o3, o1, GPT4, ChatGPT, Codex, Solved Rubik’s cube with robotic hand | cautious AI optimist

San Francisco, CA Beigetreten Ocak 2013

1K Folgt34.6K Follower

Angehefteter Tweet

Jerry Tworek@MillionInt·12 Eyl

We trained a model and it is good in some things

OpenAI@OpenAI

We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. openai.com/index/introduc…

English

1.4K

1.6M

Jerry Tworek@MillionInt·2h

@chatgpt21 Haven't seen it yet, but I'll go on Sunday. Love the book though

English

331

Chris@chatgpt21·2h

@MillionInt Did you like the movie? Inspiring?

English

297

Jerry Tworek@MillionInt·2h

Every team has its own project Hail Mary. I have one very close to my heart

English

3.1K

Jerry Tworek@MillionInt·5h

Coding is the new chat

English

3.2K

Jerry Tworek@MillionInt·14h

@CarinaLHong This is so awesome

English

823

Carina Hong@CarinaLHong·14h

This looks AI generated but it’s real! Good morning NYC!

Axiom@axiommathai

Thank you to @Nasdaq for this shoutout at the Nasdaq Tower yesterday!

English

5.2K

Jerry Tworek@MillionInt·14h

@austinvhuang

QME

1.6K

Austin Huang@austinvhuang·16h

@MillionInt Isn’t this basically SSI?

English

2.1K

Jerry Tworek@MillionInt·17h

AI labs need a wallfacer project. AI researcher not having to explain themselves to anyone. performing seemingly random actions with hidden inscrutable agenda to create a SOTA model in a way no one would deem possible

English

344

29.6K

Jerry Tworek@MillionInt·1d

If you’re trying to solve software problems with hardware you’re going to have a bad time If they’re trying to solve hardware problems with software, you’re going to have a good time

English

213

17.5K

Jerry Tworek@MillionInt·1d

Lots of inconvenience comes from the fact that we’re just very bad at measuring intelligence And we think we’re not

English

220

11.9K

Jerry Tworek@MillionInt·1d

Imagine future of work AI is not here to "destroy" anything, but surely will change how we work, forever

English

188

9.7K

Jerry Tworek@MillionInt·2d

We have continual learning at home

Rohan Pandey@khoomeik

we already have continual learning it’s just not very good and called context compaction

English

265

20.9K

Jerry Tworek@MillionInt·2d

@FrostForger Interlinked

English

326

Frosty40@FrostForger·2d

@MillionInt cells within cells

English

313

Jerry Tworek@MillionInt·2d

Sparks of Agency Hidden deep in the datacenter beating heart of optimization It’s the universe conspiring to improve itself

English

162

5.8K

Jerry Tworek@MillionInt·3d

Ethan Mollick@emollick

VC investments typically take 5-8 years to exit. That means almost every AI VC investment right now is essentially a bet against the vision Anthropic, OpenAI, and Gemini have laid out.

ZXX

155

40.8K

Jerry Tworek@MillionInt·3d

Drop the "verifiable". Just rewards. It’s cleaner

English

340

20.8K

Jerry Tworek@MillionInt·3d

🫡

Lucas Atkins@latkins

.@MillionInt is aura farming here.

ART

135

12.5K

Jerry Tworek@MillionInt·3d

@papasmurfffs @ypatil125 @lindensli New lore drop

English

164

Papa Smurf@papasmurfffs·3d

@ypatil125 @lindensli had no idea @MillionInt was rocking tats

English

241

Yash Patil@ypatil125·3d

Check out my co-founder @lindensli's talk at GTC! nvidia.com/gtc/session-ca…

English

3.3K

Jerry Tworek@MillionInt·4d

Rethink everything. deep leaning 2.0 is approaching

Kimi.ai@Kimi_Moonshot

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

English

1.4K

187.4K

Jerry Tworek@MillionInt·4d

@aidan_mclau @mattshumer_ My dear friend you've strayed from god

English

1.4K

Aidan McLaughlin@aidan_mclau·4d

@mattshumer_ would it surprise you to know that i use auto/instant for 70% of turns?

English

289

31.1K

Matt Shumer@mattshumer_·4d

Sitting next to a woman on a plane using ChatGPT on Auto mode. I need someone to physically restrain me from telling her to turn on Thinking mode at the very least.

English

165

275.1K

Jerry Tworek@MillionInt·4d

There are markets and exchanges everywhere for those with eyes to see

English

150

9.2K

Jerry Tworek@MillionInt·5d

@akyurekekin That’s probably true because I generally think they’re incredibly noisy

English

3.4K

Ekin Akyürek@akyurekekin·5d

gradients are less noisy than you think

English

10.2K

Entdecken

@chatgpt21 @CarinaLHong @austinvhuang @FrostForger @papasmurfffs @ypatil125 @lindensli @elonmusk