astroButter

4.3K posts

astroButter banner
astroButter

astroButter

@astroButter

Most zionist pytorch user (maybe)

🇺🇸 de bay Katılım Mart 2024
545 Takip Edilen1.4K Takipçiler
Sabitlenmiş Tweet
astroButter
astroButter@astroButter·
So long as my mom still has to work I cannot stop grinding.
English
5
7
86
22.1K
astroButter
astroButter@astroButter·
Fast takeoff is upon us Also love skypilot
Zhanghao Wu@Michaelvll1

Autoresearch from @karpathy runs 1 experiment at a time. We gave it 16 GPUs and let it run them in parallel. 8 hours. 910 experiments. 9× faster to the same best result. The most surprising part: the agent had access to both H100s and H200s. Without being told, it noticed H200s scored better (more training steps in the same 5-min budget) and started screening ideas on H100s, then promoting winners to H200s for validation. That strategy just emerged on its own. A human researcher can grab a cluster and run experiments in parallel. The agent couldn’t. It was stuck with 1 GPU, greedy hill-climbing, ~10 experiments/hour. We built a @skypilot_org agent skill that teaches coding agents to manage their own GPU clusters. The agent reads the skill, then launches clusters, submits jobs, checks logs, and pipelines experiments on its own. With that, Claude Code provisioned 16 GPUs on Kubernetes, ran factorial grids of 10-13 experiments per wave, and covered in one 5-minute round what sequential search takes six rounds to do. The biggest finding: scaling model width mattered more than every hyperparameter trick combined. The agent tested 6 width configs in a single parallel wave and found the winner immediately. Sequential search might have missed that entirely. Total cost: ~$300 compute + $9 in Claude API.

English
0
0
1
74
Mad ML scientist
Mad ML scientist@HououinTyouma·
I'm stuffing the last 3 month of llm psychosis into parameter golf
English
1
0
8
184
Sean
Sean@seanrobins_·
I met with a VC in SF that didn’t know what the Residency was… Anyone hiring? Asking for a friend
English
1
0
6
571
astroButter
astroButter@astroButter·
I WILL NOT USE CURSOR I REFUSE TO LEAVE MY TERMINAL
English
1
0
6
225
aizk ✡️
aizk ✡️@Aizkmusic·
When Claude one shots the prompt I gave
aizk ✡️ tweet media
English
1
0
25
1K
astroButter
astroButter@astroButter·
Took me until today to realize the purpose of neural link is to make the dummy plug from evangelion
astroButter tweet media
English
0
0
4
89
astroButter
astroButter@astroButter·
@helscom Nah this must be where hamas is training their frontier reasoning LLM
English
0
0
2
28
Helscom
Helscom@helscom·
i bet this is where they make those damn bombs
Helscom tweet media
English
4
0
17
250
astroButter
astroButter@astroButter·
San Francisco lives on Twitter
English
0
0
1
84
astroButter retweetledi
kache
kache@yacineMTB·
The strait of hummus
English
21
9
235
9.3K
Mad ML scientist
Mad ML scientist@HououinTyouma·
what a week I can't believe tomorrow is finally friday
English
2
0
3
89
milkman
milkman@_rabbi·
walk today
milkman tweet mediamilkman tweet media
English
4
0
26
787
astroButter
astroButter@astroButter·
Feeling FOMO that I’m not working at Lockheed Martin or רפאל at perhaps the most patriotic time in my life.
English
0
0
3
120
ludwig
ludwig@ludwigABAP·
@WhiteHouse it is so incredibly cringe and sad that the white house posts shit like this
English
13
8
711
21.3K