Sabitlenmiş Tweet
Deedy
16.2K posts

Deedy
@deedydas
Partner @MenloVentures. Formerly founding team @glean, @google Search. @cornell CS. Investor @AnthropicAI, @GoodfireAI, @OpenRouter, @WisprFlow, @_inception_ai
San Francisco Katılım Ağustos 2011
5.7K Takip Edilen230.2K Takipçiler

@adam_jesion @karpathy Very cool! I'd love to battle them but how much time per move are you allowing it?
English

After optimizing the parameters with autoresearch (@karpathy), I got training time for the 9.5M-parameter model down to just a few hours on an RTX 4090.
I didn’t spend anything on compute, because as I already mentioned, it’s just my gaming GPU. Optimization level: turbo. Claude Code keeps telling me that what we “came up with together” around “thought tokens” in a CNN deserves a paper :)))
Right now, I have a 48-hour training run going with an added transformer (50M parameters). The goal is to consistently crush Stockfish at 2800 Elo (100%) without using any traditional search methods like MCTS.
Here's the progress I've made over the past week working on this model.

English

@adam_jesion My models not good enough yet and uses a different set of techniques! So not ready to play you yet.
Can I ask
- what's your time per ply?
- and how much $$ did you spend training / inference?
English

@deedydas Take the challenge. Want to compare our models?
I’ve been working on something similar for the past week, but focused on true innovation in the model itself - not design. I came up with an additional layer called "thought tokens" that completely change the model's characteristics.
My model crushes Stockfish 2600 and wins 40% of games against 2800. Right now, I’m training a transformer-based model with 50M parameters.
So, what do you say - shall we do a face-off? :)
CheesArena for vibe-coded models?
The current version you can test has 9.5M parameters, is 100MB in size, and runs at 2ms per move. This is a killer model :)
games.jesion.pl - You can take him on here.
English

@5rb6jj7wtx @deedydas For sure - but still interesting. The fact that it is written directly means eg you could chuck autoresearch at it 👀 @deedydas
English

@hehehe52318711 @___4o____ maybe youre smarter than me i would probably not have been able to roll this in a weekend before
English

@deedydas @___4o____ It was always trivial. You probably need to spend ~1 day (recreationally) reading stuff and another one implementing it (the engine itself, not the UI). It's a weekend project
English

@deedydas The agent actually wrote a novel engine? Or using a library/existing engine?
English

@thomasahle It's spinning already! Goal is to eliminate heuristic board evaluation and get to grandmaster
English

@deedydas Now do autoresearch and see how strong you can make it
English

@___4o____ To be clear, def trivial with coding agents. Wouldn't call it trivial prior
English

@___4o____ you share the same definition of trivial as many a college professor
English

@iamdamianb Could definitely build it but how would you test the ELO? For traditional chess, I used stockfish's calibration
English

@deedydas Chess engines and the algos are totally in its training set.
At MIT, I took a 6.172, performance engineering, where we had to hyper-optimize Leiserchess.
I would be curious how much ELO it could get on that, because perf = ELO.
ocw.mit.edu/courses/6-172-…
English

> Anthropic acquires Bun
> "Hey Charlie wouldn't it be cool if they went after you too"
> Mfw

OpenAI Newsroom@OpenAINewsroom
We've reached an agreement to acquire Astral. After we close, OpenAI plans for @astral_sh to join our Codex team, with a continued focus on building great tools and advancing the shared mission of making developers more productive. openai.com/index/openai-t…
English

@jaysen_158 i dunno if you'd call it a competitor to Nvidia, they're a customer
English

Looks great! I have not read the Chinmayananda version, but I have the Easwaran translation of the Gita and it seems more approachable. Not sure if it's public domain though.
Chinmayananda: What did the sons of Pandu and also my people do when, desirous to fight, they assembled together on the holy plain of Kurukshetra, O Sanjaya?
Easwaran: O Sanjaya, tell me what happened at Kurukshetra, the field of dharma, where my family and the Pandavas gathered to fight.
English

@ahdeetya007 It was the year of that specific translation, sorry. Fixed!
English

@deedydas Might want to fix the YEAR of the GITA, when we make little things, we also have to be sure that it's accurate.
English







