Nakul Shenoy

34 posts

Nakul Shenoy

Nakul Shenoy

@nakulshn

Katılım Temmuz 2023
407 Takip Edilen24 Takipçiler
Nakul Shenoy retweetledi
James Pethokoukis ⏩️⤴️
Data centers didn’t raise electric bills nationally from 2015–24. Surprise! Actually, they modestly lowered them. That's because big fixed grid costs get spread over more kilowatt-hours, and new demand can unlock economies of scale. @arxiv
James Pethokoukis ⏩️⤴️ tweet media
English
65
240
973
493.3K
Nakul Shenoy
Nakul Shenoy@nakulshn·
@tenobrus If we have the infinity machine capable of curing all diseases and more… how much longer does money and patents even stay relevant?
English
0
0
3
225
Nakul Shenoy retweetledi
bayes
bayes@bayeslord·
(thread) Intelligence We are in early takeoff. AI improving AI may end up being one of the most consequential steps of history. This isn’t certain because we don’t know how far from the physical and computational limits of intelligence we are, though I would bet it’s quite far from where we are today (e.g. ~5-10 OOMs more intelligence output per unit of scale seems possible).
English
8
19
224
16.6K
Nakul Shenoy retweetledi
Anthropic
Anthropic@AnthropicAI·
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
English
1.8K
4.6K
28.6K
18.6M
Nakul Shenoy retweetledi
roon
roon@tszzl·
the frontier labs don’t have “comms problems”. reality right now has a comms problem. what is happening is a little scary and there’s no nice words anyone could say, especially not those profiting from it, that’ll make it feel that much better
English
191
183
3.5K
371.4K
Devin Shah
Devin Shah@DevinShah16·
If you give a frontier model the complete ruleset for a strategy game, can it derive a winning strategy from first principles? I wanted to test @claudeai Sonnet 4.6's ability to play the 2009 strategy game Small World. Three identical instances with the same instructions and compute budget played against each other. The games surfaced a reasoning pattern around action bias and locality that I think applies broadly to long-horizon software engineering and knowledge work beyond just strategy games. Full blogpost: dshah.dev/blog/smallworld
English
4
0
12
425
Nakul Shenoy retweetledi
François Fleuret
François Fleuret@francoisfleuret·
The very idea that we are now in the phase of AI self improvement with models working autonomously 24/24 7/7 to optimize models, and that we are about to enter the EGI "Era of General Improvement" ("design better batteries", "design better drugs against X", etc.) is surreal.
English
34
43
524
42.3K
Nakul Shenoy
Nakul Shenoy@nakulshn·
Maybe timelines are fast enough that any public backlash does not end up having material effect?
English
1
0
0
27
Nakul Shenoy
Nakul Shenoy@nakulshn·
@gfodor Possibly just survivorship bias, hard to come up with as many startup ideas if you think capabilities are accelerating vs if they're plateauing. And hedging on the part of the VCs who fund these startups
English
0
0
0
46
gfodor.id
gfodor.id@gfodor·
Every startup being funded around AI has an implicit thesis on the capabilities curve. Very few seem to be taking seriously the prospect that capabilities accelerate and don't level off anytime soon. Anyone building "AI but for Business Thing" assumes we level off this year.
English
14
5
90
5.1K
Nakul Shenoy retweetledi
Dean W. Ball
Dean W. Ball@deanwball·
I walk away from this summit convinced that much of the world, in the U.S. and abroad, is simply delusional with respect to what this technology is, what it can do today, what it will be able to do soon, and what it means their countries should do.
English
6
34
329
24.7K
Nakul Shenoy retweetledi
METR
METR@METR_Evals·
We estimate that Claude Opus 4.6 has a 50%-time-horizon of around 14.5 hours (95% CI of 6 hrs to 98 hrs) on software tasks. While this is the highest point estimate we’ve reported, this measurement is extremely noisy because our current task suite is nearly saturated.
METR tweet media
English
225
448
4K
3.5M
Nakul Shenoy
Nakul Shenoy@nakulshn·
I think this would fit with a general trend in AI of warm starts making impractical problems practical. eg pretraining enabling RL on math since you hit the correct answer at least sometimes
English
1
0
0
103