Robert

217 posts

Robert banner
Robert

Robert

@robertirv1

AI | Startups | Thinking out loud

Palo Alto Katılım Temmuz 2023
1.6K Takip Edilen196 Takipçiler
Sabitlenmiş Tweet
Robert
Robert@robertirv1·
In a reward model I found you could turn off 90% the activations in the second half of the NN with no loss in performance. Just need to keep the final N position activations and pass through the residual for all positions greater than N.
Robert tweet media
English
0
0
1
214
Robert retweetledi
Barrett
Barrett@SledgeDev·
How it felt to be a software developer before ai.
English
83
289
3.7K
328.6K
Robert
Robert@robertirv1·
By forcing myself to only launch and review training runs in 9-11 and 5-7 windows. It means I’m forced to automate, notice where I wasting time before. Cuts burnout through constant experiment context switching and reviewing experiments which finished late.
English
0
0
0
26
Robert
Robert@robertirv1·
The performance of an LLM as a judge is a function of compute just like many other AI task. Thinking longer makes a better judge. Best of N. Larger models. Judging one example at a time rather than a batch.
English
0
0
0
34
Robert
Robert@robertirv1·
Going for a long jog can turn a bad day into a good one.
English
0
0
0
32
Robert
Robert@robertirv1·
At a first guess I would say AI has 4x my coding output but only 1.2x my actually output (business impact). Mainly because there are so many other bottlenecks which limit output.
English
0
0
0
31
Robert retweetledi
lusso
lusso@luusssso·
When was the last time you saw a corporate space of any kind that blew you away Time to take chances again
lusso tweet medialusso tweet medialusso tweet medialusso tweet media
English
30
363
4.2K
407.1K
Robert
Robert@robertirv1·
@PaulSkallas It’s kind of like grip strength. It’s the first thing to go.
English
4
0
92
4.4K
LindyMan
LindyMan@PaulSkallas·
They stopped making good movies. Can't believe it. imagine telling someone this 20 years ago. "they don't make good movies in the future anymore" what?
English
188
155
4.2K
143K
Robert
Robert@robertirv1·
I viewed my AI work as being a “scientist” which meant an emphasis on understanding. It’s going very poorly this year. I think a better model is “esports”. Emphasis is on clicks per minute, volume of experiments, experiment strategy, psychology, racing to the goal.
English
0
0
1
95
Robert
Robert@robertirv1·
A lot of my RL failures have come from thinking “as long as KL is in this range it’s ok.” But this is wrong, the optimal range depends on the context, the base model, the target, the algorithm. It’s just not a good way to measure overfitting.
English
0
0
0
98
Robert
Robert@robertirv1·
Always get my best run times in the Stanford quad. Knocks a couple mins off my 5km.
English
0
0
3
87
Robert retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
(I still have the bigger cousin running on prod nanochat, working a bigger model and on 8XH100, which looks like this now. I'll just leave this running for a while...)
Andrej Karpathy tweet media
English
72
61
2.1K
440.1K
Robert
Robert@robertirv1·
Trying to get more retention on a product been working on for years. Feels like late stage California gold rush. The surface gold is gone, the valley is overcrowded.
English
0
0
2
58
Robert
Robert@robertirv1·
An LLM should be watching my work at all times to immediately help with debugging with all the context.
English
0
0
0
63