will depue

8.7K posts

will depue

@willdepue

dei ex machina @openai, past: sora 1 & 2, posttraining o3/4o, applied research

san francisco Katılım Mayıs 2018

2.3K Takip Edilen57.4K Takipçiler

will depue retweetledi

Alex Zhao@cocohearts·6h

first tranche of @runpod credits should be rolling out

English

will depue@willdepue·9h

We’re sending out the first tranche of compute grants today, so make sure to sign up before then.

OpenAI@OpenAI

Are you up for a challenge? openai.com/parameter-golf

English

10.6K

will depue@willdepue·1d

@shawnbuilds @OpenAI it’s really not that hard! try grabbing the train gpt starter script and have it explain to you how it works + suggest an experiment to run. the MLX script should run on your mac and you can configure it to run in about a minute

English

732

shawn@shawnbuilds·1d

@OpenAI genuine question - how does someone with zero knowledge in this space get started attempting a problem like this? i'd imagine you'd need some serious ai fundamentals

English

6.9K

OpenAI@OpenAI·1d

Are you up for a challenge? openai.com/parameter-golf

English

334

256

3.9K

1.1M

will depue@willdepue·1d

@evilmathkid hope to see you on the leaderboard mithil!

English

1.7K

Mithil Vakde@evilmathkid·1d

Reduce kolmogorov complexity in a ~turing machine defined by 8xH100s, PTX, a CPU and 10min while also optimising the hell out of the code execution Love it

OpenAI@OpenAI

Are you up for a challenge? openai.com/parameter-golf

English

10.1K

will depue@willdepue·1d

@industriaalist i’d be surprised if the search space in the limit of L(N) wasn’t equally rich as L(D)? why do you say so

English

Samip@industriaalist·1d

few thoughts on openai's parameter golf: - first, you'd be surprised how many researchers at big labs (not just openai) are interested in our slowrun - i'd expect openai to be already automating parameter golf *entirely* with agents. and i'd also expect agents to be better than humans at this already. - for slowrun, we've deliberately kept it less gamified. the search space over learning algorithms for data efficiency is much larger than for compute/parameter efficiency. so slowrun is less of a competition and more of an open research effort toward interesting, new learning algorithms

English

316

22.7K

will depue@willdepue·1d

@artificialguybr i would be surprised if it wasnt at least somewhat of an AI <> human collaboration!

English

719

Artificial Guy / João Vitor A.@artificialguybr·1d

It would be funny if the winner of this were a user with Codex/Claude.

OpenAI@OpenAI

Are you up for a challenge? openai.com/parameter-golf

English

1.3K

will depue@willdepue·1d

@itsandrewgao ok will see if we can get more. if you run the 1xh100 baseline just make a pr with the log and the submission and i'll add it to the non record submissions for iteration

English

4.4K

andrew gao@itsandrewgao·1d

nooo how am i supposed to parameter golf when there are no 8xh100s help @willdepue

English

149

17.2K

will depue@willdepue·1d

@Laz4rz 🫡🫡🫡🫡🫡

QME

365

Lazarz@Laz4rz·1d

This is actually extremely exciting and something companies should be doing more

OpenAI@OpenAI

Are you up for a challenge? openai.com/parameter-golf

English

3.2K

will depue@willdepue·1d

@vitransformer that sounds awesome! please do try it

English

1.1K

Vision Transformers@vitransformer·1d

i mean diffusion language models?

OpenAI@OpenAI

Are you up for a challenge? openai.com/parameter-golf

English

2.6K

will depue@willdepue·1d

@bilaltwovec @typedfemale we gotta do #3 honestly, been pushing for a while

English

463

bilal@bilaltwovec·1d

@typedfemale who remembers openai requests for research 1 and 2

English

bilal@bilaltwovec·1d

we are all filipino mercor workers now

OpenAI@OpenAI

Are you up for a challenge? openai.com/parameter-golf

English

588

75.6K

will depue@willdepue·1d

@adamyathegreat of course! troop 223

English

125

adamyathegreat@adamyathegreat·1d

@willdepue Are you an Eagle Scout?

English

163

will depue@willdepue·1d

Remembering Mike Lanning, who passed today, leader of the greatest Boy Scout troop in America: Troop 233. Mike was an incredible person, leader & mentor to many. He maintains the record for most Eagle Scouts from one Scoutmaster: 1000+ with Troop 223. He will be deeply missed.

English

20.2K

will depue@willdepue·1d

@cocohearts lmao

4.6K

Alex Zhao@cocohearts·1d

proud to announce we're open sourcing a new model GPT1.1-nano

OpenAI@OpenAI

Are you up for a challenge? openai.com/parameter-golf

English

732

123.9K

will depue@willdepue·1d

@Yuchenj_UW thanks for sharing! i expect the best submissions to look pretty different than the nanogpt speedrun models, given parameter constraints

English

6.1K

Yuchen Jin@Yuchenj_UW·1d

OpenAI just dropped a training challenge: Train a <16MB language model in 10 minutes on 8×H100s and minimize held-out loss on a fixed FineWeb dataset. Basically NanoGPT Speedrun. They’re sponsoring $1M in compute. I can summon my autoresearch army to win it… if I have time.

English

1.2K

106.1K

will depue@willdepue·1d

@test_tm7873 @Yuchenj_UW feel free to train and test on whatever! we just require final leaderboard submissions to be on h100s you can always share a github repo if you’re not submitting to the leaderboard (see non record submissions), just make sure it still follows the rules of fixed dataset and eval

English

testtm@test_tm7873·1d

@Yuchenj_UW man. its epic, i love small language models. but all my experience i have with em is on tpus. 😭and they want H100s. nnnnnooooooo.

English

4.9K

will depue@willdepue·1d

@karpathy so cool

English

1.2K

Andrej Karpathy@karpathy·1d

The signature is alluding to NVIDIA GTC 2015, where Jensen excitedly told an audience of, at the time, mostly gamers and scientific computing professionals that Deep Learning is The Next Big Thing, citing among other examples my PhD thesis (one of the first image captioning systems that coupled image recognition ConvNet to an autoregressive RNN language model, trained end to end). This was back when most people were still unaware and somewhat skeptical but of course - Jensen was 1000% correct, highly prescient and locked in very early.

English

1.2K

68.4K

Andrej Karpathy@karpathy·1d

Thank you Jensen and NVIDIA! She’s a real beauty! I was told I’d be getting a secret gift, with a hint that it requires 20 amps. (So I knew it had to be good). She’ll make for a beautiful, spacious home for my Dobby the House Elf claw, among lots of other tinkering, thank you!!

NVIDIA AI Developer@NVIDIAAIDev

🙌 Andrej Karpathy’s lab has received the first DGX Station GB300 -- a Dell Pro Max with GB300. 💚 We can't wait to see what you’ll create @karpathy! 🔗 #dgx-station" target="_blank" rel="nofollow noopener">blogs.nvidia.com/blog/gtc-2026-… @DellTech

English

495

777

17.8K

878.2K

will depue@willdepue·1d

@onetwoval @kellerjordan0 thanks val! you should compete 👀👀

English

1.2K

Val@onetwoval·1d

@willdepue @kellerjordan0 this is lit

English

1.3K

will depue@willdepue·1d

we're launching a challenge, inspired by @kellerjordan0's NanoGPT Speedrun, to train the best model under extreme parameter limitations anytime i get asked how to break into ML, i point to challenges like this: just start training models! also, we're covering $1M of compute

OpenAI@OpenAI

Are you up for a challenge? openai.com/parameter-golf

English

760

74.6K

will depue@willdepue·1d