Avi Singh

432 posts

Avi Singh banner
Avi Singh

Avi Singh

@avisingh599

Gemini RL @GoogleDeepMind. Previously worked on RL for robots. Ask for my strava and goodreads :)

SF Katılım Mayıs 2013
1.4K Takip Edilen2.7K Takipçiler
Sabitlenmiş Tweet
Avi Singh
Avi Singh@avisingh599·
Excited to announce our new work on using synthetic data for improving mathematical problem solving and code generation in LLMs! arxiv: arxiv.org/abs/2312.06585 A small amount of fine-tuning can lead to large gains (>6% on Hendrycks MATH with Palm-2)
Avi Singh tweet media
English
14
67
309
67K
Avi Singh
Avi Singh@avisingh599·
Enjoyed watching this talk by @DrJimFan on how robotics might follow a parallel path to frontier modeling. He’s a funny guy.
Avi Singh tweet media
English
3
5
40
7.4K
Avi Singh retweetledi
Aaron Levie
Aaron Levie@levie·
Gemini 3.5 Flash is out, and it's a major jump over Gemini 3 Flash in model capability for knowledge work. We've been evaluating it on our Box AI Complex Work Eval in early release, and the model delivers a 12 percentage point jump on complex document tasks. For testing this model, we give the Box AI Agent (using Gemini 3.5) complex problems to solve that represent common but difficult knowledge worker tasks in banking, consulting, public sector, healthcare, and other industries. These tasks can be things like drafting reports, doing due diligence, and more, given a set of relevant documents. In our tests, Gemini 3.5 Flash delivered jumps across every industry, including: * Financial services: 81% vs 73% (+8pp) * Public sector: 76% vs 59%, (+17pp) * Healthcare: 73% vs 51%, (+22pp) * Life Sciences: 67% vs 47%, (+20pp) Incredible to see the continued performance gains. Gemini 3.5 Flash will be available soon in Box AI Studio and through the Box API. The Box MCP Server will soon be available in the Gemini app with more details to come.
Aaron Levie tweet media
English
29
23
211
39.7K
Avi Singh retweetledi
Ekin Zorer
Ekin Zorer@ekinomicss·
Misaligned robot exhibiting malicious behavior spotted in #ICLR2026
English
7
31
297
33.3K
Swaroop Mishra
Swaroop Mishra@Swarooprm7·
Personal Update: I am back to @GoogleDeepMind. I will continue working on LLM research and product.
Swaroop Mishra tweet media
English
50
18
1.2K
56.8K
Avi Singh
Avi Singh@avisingh599·
Founders Fund bought about half of DeepMind with a 2.3M investment. Simpler times!
English
0
0
1
128
Avi Singh
Avi Singh@avisingh599·
Mustafa Suleyman origin story is pretty insane. Born to a cab driver and abandoned by parents at age 15, he provided for himself and his younger brother by flipping candy bars, cellphones, and eventually cars. Not having a place to live, he often stayed with the Hassabis family.
English
1
0
1
195
Avi Singh
Avi Singh@avisingh599·
Reading The Infinity Machine from @scmallaby, and it has been hard to put down
English
1
0
7
954
Ker Lee Yap
Ker Lee Yap@klyap_·
I made sheep toast today!
Ker Lee Yap tweet media
English
6
2
63
3K
Avi Singh
Avi Singh@avisingh599·
While it's difficult to increase the base intelligence of a model without pouring in significant resources (mostly in the form of data and compute), persistence is something that's easier to recognize and reward, all in post-training.
English
0
0
2
163
Avi Singh
Avi Singh@avisingh599·
As we start using AI for long-running tasks, model intelligence remains important, but behavioral traits like persistence (or, to use a stronger word, relentlessness) become at least as important, if not more so.
English
1
1
4
560