Robert

217 posts

Robert

@robertirv1

AI | Startups | Thinking out loud

Palo Alto Katılım Temmuz 2023

1.6K Takip Edilen196 Takipçiler

Sabitlenmiş Tweet

Robert@robertirv1·14 Mar

In a reward model I found you could turn off 90% the activations in the second half of the NN with no loss in performance. Just need to keep the final N position activations and pass through the residual for all positions greater than N.

English

214

Robert retweetledi

Barrett@SledgeDev·6 May

How it felt to be a software developer before ai.

English

289

3.7K

328.6K

Robert@robertirv1·29 Nis

By forcing myself to only launch and review training runs in 9-11 and 5-7 windows. It means I’m forced to automate, notice where I wasting time before. Cuts burnout through constant experiment context switching and reviewing experiments which finished late.

English

Robert@robertirv1·24 Nis

The performance of an LLM as a judge is a function of compute just like many other AI task. Thinking longer makes a better judge. Best of N. Larger models. Judging one example at a time rather than a batch.

English

Robert@robertirv1·23 Nis

Going for a long jog can turn a bad day into a good one.

English

Robert@robertirv1·22 Nis

At a first guess I would say AI has 4x my coding output but only 1.2x my actually output (business impact). Mainly because there are so many other bottlenecks which limit output.

English

Robert retweetledi

lusso@luusssso·21 Nis

When was the last time you saw a corporate space of any kind that blew you away Time to take chances again

English

363

4.2K

407.1K

Robert@robertirv1·7 Nis

@PaulSkallas It’s kind of like grip strength. It’s the first thing to go.

English

4.4K

LindyMan@PaulSkallas·7 Nis

They stopped making good movies. Can't believe it. imagine telling someone this 20 years ago. "they don't make good movies in the future anymore" what?

English

188

155

4.2K

143K

Robert@robertirv1·7 Nis

I viewed my AI work as being a “scientist” which meant an emphasis on understanding. It’s going very poorly this year. I think a better model is “esports”. Emphasis is on clicks per minute, volume of experiments, experiment strategy, psychology, racing to the goal.

English

Robert@robertirv1·25 Mar

A lot of my RL failures have come from thinking “as long as KL is in this range it’s ok.” But this is wrong, the optimal range depends on the context, the base model, the target, the algorithm. It’s just not a good way to measure overfitting.

English

Robert@robertirv1·19 Mar

Always get my best run times in the Stanford quad. Knocks a couple mins off my 5km.

English

Robert retweetledi

Rokko🇯🇵🌐💻 ✈︎@Msamalam·14 Mar

The best part of Japanese high streets is adding a third dimension. In Europe/US all the interesting stuff is on the ground floor. A 2D world, limited. In Japan you add verticality. Random buildings with 12 floors of niche bars, bordels, and karaokes.

M. Nolan Gray 🥑@mnolangray

Western high streets need more signage. Promenading through East Asian commercial districts is much more thrilling.

English

691

67.2K

Robert retweetledi

Zachary Horvitz@zachary_horvitz·9 Mar

Update: it's pretty clear now that *the* thing to do is 1) figure out how to make your research as verifiable as possible 2) spin up some agents

Zachary Horvitz@zachary_horvitz

Highly recommend NOT vibe coding for your research experiments. Model outputs are least verifiable on the frontier of human knowledge

English

849

Robert retweetledi

Andrej Karpathy@karpathy·7 Mar

(I still have the bigger cousin running on prod nanochat, working a bigger model and on 8XH100, which looks like this now. I'll just leave this running for a while...)

English

2.1K

440.1K

Robert@robertirv1·27 Şub

Trying to get more retention on a product been working on for years. Feels like late stage California gold rush. The surface gold is gone, the valley is overcrowded.

English

Robert@robertirv1·23 Şub

An LLM should be watching my work at all times to immediately help with debugging with all the context.

English

Keşfet

@PaulSkallas @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine