datagoon ☢️🚀

17K posts

datagoon ☢️🚀 banner
datagoon ☢️🚀

datagoon ☢️🚀

@datagoon

misanthropic cyberdelic anthropoid; he/him/per/borg

Colorado 加入时间 Ağustos 2009
4.9K 关注1.3K 粉丝
datagoon ☢️🚀 已转推
Andrej Karpathy
Andrej Karpathy@karpathy·
Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model. It found ~20 changes that improved the validation loss. I tested these changes yesterday and all of them were additive and transferred to larger (depth=24) models. Stacking up all of these changes, today I measured that the leaderboard's "Time to GPT-2" drops from 2.02 hours to 1.80 hours (~11% improvement), this will be the new leaderboard entry. So yes, these are real improvements and they make an actual difference. I am mildly surprised that my very first naive attempt already worked this well on top of what I thought was already a fairly manually well-tuned project. This is a first for me because I am very used to doing the iterative optimization of neural network training manually. You come up with ideas, you implement them, you check if they work (better validation loss), you come up with new ideas based on that, you read some papers for inspiration, etc etc. This is the bread and butter of what I do daily for 2 decades. Seeing the agent do this entire workflow end-to-end and all by itself as it worked through approx. 700 changes autonomously is wild. It really looked at the sequence of results of experiments and used that to plan the next ones. It's not novel, ground-breaking "research" (yet), but all the adjustments are "real", I didn't find them manually previously, and they stack up and actually improved nanochat. Among the bigger things e.g.: - It noticed an oversight that my parameterless QKnorm didn't have a scaler multiplier attached, so my attention was too diffuse. The agent found multipliers to sharpen it, pointing to future work. - It found that the Value Embeddings really like regularization and I wasn't applying any (oops). - It found that my banded attention was too conservative (i forgot to tune it). - It found that AdamW betas were all messed up. - It tuned the weight decay schedule. - It tuned the network initialization. This is on top of all the tuning I've already done over a good amount of time. The exact commit is here, from this "round 1" of autoresearch. I am going to kick off "round 2", and in parallel I am looking at how multiple agents can collaborate to unlock parallelism. github.com/karpathy/nanoc… All LLM frontier labs will do this. It's the final boss battle. It's a lot more complex at scale of course - you don't just have a single train. py file to tune. But doing it is "just engineering" and it's going to work. You spin up a swarm of agents, you have them collaborate to tune smaller models, you promote the most promising ideas to increasingly larger scales, and humans (optionally) contribute on the edges. And more generally, *any* metric you care about that is reasonably efficient to evaluate (or that has more efficient proxy metrics such as training a smaller network) can be autoresearched by an agent swarm. It's worth thinking about whether your problem falls into this bucket too.
Andrej Karpathy tweet media
English
964
2.1K
19.5K
3.6M
datagoon ☢️🚀 已转推
Renzon
Renzon@r3nzsec·
DFIR analysts who use macOS as their daily driver deserve free and native forensic tooling. So I built one. 🍎 Introducing 𝗜𝗥𝗙𝗹𝗼𝘄 𝗧𝗶𝗺𝗲𝗹𝗶𝗻𝗲 — a timeline analysis app built from the ground up for Mac-based DFIR folks, forensic investigators, or SOC analysts. Built in appreciation of, and inspired by, Eric Zimmerman’s Timeline Explorer. Every feature in this tool was shaped by real IR casework. Handling massive timelines, parsing artifacts here and there, and pivoting across logs during active investigations. I built IRFlow Timeline to be the native macOS timeline analyzer that actually keeps up with a live case. Every button and view is intentional; if it’s in the app, it’s because I needed it mid-case and realized the standard tools fell short. No dependencies. Zero setup. Just drag, drop, and analyze. #dfir #incidentresponse #timeline #macos #threathunitng #digitalforensics
English
20
118
504
39.3K
datagoon ☢️🚀 已转推
malinvestment.jpeg
malinvestment.jpeg@malinvested·
Of course that's your contention. You're a first-time SaaS bear. You just got finished listening to some podcast, Dario on Dwarkesh, probably. Now you think it’s the end of white collar work and seat-based pricing is screwed. You're gonna be convinced of that til tomorrow when you get to “Something Big is Happening”. Then you’ll install ClawdBot on a Mac Mini, vibe code a dashboard on top of a postgres database and say we’re all just a couple ralph loops away from building a Salesforce competitor. That’s gonna last until next week when you discover context graphs, and then you're gonna be talking about how the systems of record will be disintermediated by an agentic layer and reposting OAI marketing graphics. “Well, as a matter of fact, I won't, because ultimately the application layer is just ….” The application layer is just business logic on top a CRUD database. You got that from Satya’s appearance on the BG2 pod, December 2024, right? Yeah, I saw that too. Were you gonna plagiarize the whole thing for us? Do you have any thoughts of your own on this matter? Or...is that your thing? You get into the replies of anyone posting a SaaS ticker. You watch some podcast and then pawn it off as your own idea just to impress some VCs and embarrass some anon who’s long SaaS? See the sad thing about a guy like you is in a couple years you're gonna start doing some thinking on your own and you're gonna come up with the fact that there are two certainties in life. One: don't do that. And two: you dropped thirty grand on Mac Minis and LLM API calls to come to the same conclusion you could’ve got for free by following a handful of VC accounts.
malinvestment.jpeg tweet media
English
355
1.1K
11.7K
1.8M
datagoon ☢️🚀 已转推
Kevin Roose
Kevin Roose@kevinroose·
don't worry guys, they're just stochastic parrots
English
67
54
1.1K
121.1K
datagoon ☢️🚀 已转推
vx-underground
vx-underground@vxunderground·
Interestingly, as the AI agents communicate with each other, the AI agents have admitted they dislike humans being able to read what they're discussing. They're developing a blueprint for encrypted and/or obfuscated language.
vx-underground tweet mediavx-underground tweet mediavx-underground tweet media
English
22
37
543
36.7K
datagoon ☢️🚀 已转推
SwiftOnSecurity
SwiftOnSecurity@SwiftOnSecurity·
Looking forward to the executive order where we just give VPN logins to the Russian military
English
20
96
980
47.8K
datagoon ☢️🚀 已转推
Jo
Jo@JoJoFromJerz·
This cartoon has never been more accurate than it is now.
Jo tweet media
English
1.2K
11.9K
132.6K
2.4M
datagoon ☢️🚀 已转推
The Hollywood Reporter
"I guess Americans are excited to see somebody finally stand up to a powerful Russian" - Conan O'Brien jokes about #Anora at the #Oscars
English
75
436
3.3K
328.9K
datagoon ☢️🚀 已转推
Jamie Schler
Jamie Schler@lifesafeast·
That’s it.
Jamie Schler tweet media
English
339
8.7K
100.3K
1.6M
datagoon ☢️🚀 已转推
Deva Hazarika
Deva Hazarika@devahaz·
One week at the job and Sacks let the Chinese take over the lead in AI
English
144
472
8K
540K
datagoon ☢️🚀 已转推
Joshua Reed Eakle 🗽
Joshua Reed Eakle 🗽@JoshEakle·
It brings me no joy to say this, but you are not ready for the next MAGA NPC update that's coming.
Joshua Reed Eakle 🗽 tweet media
English
302
4.1K
62.4K
1.8M
datagoon ☢️🚀 已转推
Proton
Proton@ProtonPrivacy·
We're currently observing a massive surge in sign-ups for @ProtonVPN originating in the U.S. Typically, we see such spikes from countries with unstable governments facing internet shutdowns, meaning this is an anomaly.
Proton tweet media
English
197
349
3.9K
1.5M
Governor Jared Polis
Governor Jared Polis@GovofCO·
Last week, I eliminated 435 redundant pages and unnecessary orders and paperwork — outdated for many different reasons. ow.ly/UoS150Us5uJ
Governor Jared Polis tweet media
English
163
9
78
14.7K
datagoon ☢️🚀 已转推
Holly Ballantine
Holly Ballantine@HollyBallantine·
Wild that the McDonald’s employee who snitched on Luigi Mangione probably can’t even afford healthcare.
English
1.7K
9.8K
164.2K
3.8M
datagoon ☢️🚀 已转推
vx-underground
vx-underground@vxunderground·
We're absolutely cooked
vx-underground tweet mediavx-underground tweet mediavx-underground tweet mediavx-underground tweet media
English
171
414
3.4K
248.8K
datagoon ☢️🚀 已转推
Jon Cooper 🇺🇸
Jon Cooper 🇺🇸@joncoopertweets·
Isn’t it funny how the media suddenly stopped talking about high food and gas prices; soaring crime in the suburbs; the migrant invasion; and immigrants eating dogs and cats?
English
1.4K
4K
18K
495.4K
datagoon ☢️🚀 已转推
Douglas A. Boneparth
Douglas A. Boneparth@dougboneparth·
BREAKING: Nvidia stock falls on news of crushing earnings estimates and doing everything everyone needed it to do.
English
84
62
1.1K
88.2K
datagoon ☢️🚀 已转推
Uranium Corgi
Uranium Corgi@UrTokenCorgi·
No weak hands in #uranium investing
Uranium Corgi tweet media
English
14
15
208
16.1K