Soham Mehta

198 posts

Soham Mehta

@SohamThoughts

Research Fellow @JoinFAI | prev @NewYorkStateAG @Columbia @CFPB @knightcolumbia | Views, however unoriginal and derivative, my own

Washington, DC Katılım Eylül 2019

845 Takip Edilen216 Takipçiler

Soham Mehta@SohamThoughts·2d

On the denominator question, I guess that’s fair in principle, but Thomas Bloom’s companion paper directly addresses this. He notes the model succeeded by systematically exploring directions humans dismissed as unpromising, not by brute-force guessing. We know the technique itself was novel. On generalization outside math, I agree we should be cautious, and so does Cal Newport (who wrote about this). And he compared this to CAD software enabling Frank Gehry’s architectural designs. He sees this as a way to augment human creativity (in other words, useful). That framing is miles from “stochastic parrot,” which is what I was pushing back on. Also, yes, I’m aware this was an OpenAI model, not Claude. I brought up Anthropic’s growth as evidence against the broader “AI bubble” thesis in the article I quote-tweeted.

English

Gary Marcus@GaryMarcus·2d

i was satirizing the form of your reason and stand by that. we don’t in fact know how many proofs the same technique was tried on without success; we have a numerator but not a denominator. and literally zero evidence yet on how the system performs in other problems in other domains inside and (especially) outside math. i respect claude code (driving much of the revenue) but have no idea even the relation between the two systems given how little has been revealed.

English

6.3K

Gary Marcus@GaryMarcus·2d

can’t believe people assume that success on highly verifiable problems in math (where we don’t even know how many tests were performed and how many might have failed) iautomatically generalize to everything else when there is not a shred of evidence that they do. 🤷‍♂️

Soham Mehta@SohamThoughts

Can't believe we're doubling down on "it's just a stupid stochastic parrot" when OpenAI disproved a longstanding conjecture in discrete geometry less than 24 hours ago. We need more Bores's, and less of whatever this is.

English

221

38.6K

Soham Mehta@SohamThoughts·2d

LPE Blog@LPEblog

Today, Matthew Scherer argues that the most pressing AI-driven crisis is the overestimation of AI’s capabilities and impacts, which has produced a historically large speculative AI bubble.

English

29.8K

Soham Mehta retweetledi

Joshua Levine@JoshuaTLevine·2d

Today’s announcement from @CommerceGov and @POTUS to invest in quantum computing companies and new quantum foundries is an important step toward securing American leadership in quantum technology development and deployment. @PrinehaN and I outlined our vision of how domestic manufacturing and supply chain security are critical to America’s quantum future. The deals announced today put those ideas in action. Welcome to the quantum moment.

English

2.1K

Soham Mehta@SohamThoughts·12 May

I sometimes get Vermin Supreme and Adrian Vermeule mixed up

English

189

Soham Mehta@SohamThoughts·8 May

Claude Toad, Claude Probe, Claude Road, and Claude Cove

English

370

Soham Mehta retweetledi

Fred Stafford@fredstaffordcs·3 May

"The umbrella North America’s Building Trades Unions said it hit a record number of members and apprentices in 2025. "The organization’s president, Sean McGarvey, compared it to the build trades’ expansion in the 1950s. He attributes today’s growth to data centers, power plants and legislation under former President Joe Biden that subsidized the construction of semiconductor and electric vehicle battery factories, energy efficiency projects and grid transmission improvements. "Data centers’ voracious energy needs are setting off a power plant construction boom and delivering a one-two punch of new life to unions whose members also build and maintain boilers, ductwork, pipelines and other power infrastructure."

The Associated Press@AP

Building trades unions have long been considered a voice of the American worker. Now, they're intertwined with the richest companies in the world as they create America’s artificial intelligence economy. apnews.com/article/artifi…

English

102

61.1K

Soham Mehta retweetledi

Jacobin@jacobin·26 Nis

Blocking data center build-outs is a tempting shortcut to slow the unnerving speed and spread of AI. But the real challenge is building democratic oversight of how it develops. jacobin.com/2026/04/ai-dat…

English

25.9K

Soham Mehta retweetledi

Igor Bobic@igorbobic·23 Nis

AI is so hot rn — especially among Dems on Capitol Hill who are using it for work and personal tasks. Schiff used it to make a living trust. Schatz used it to review grants. Kelly has tried it to build his own apps. One Dem even uses it to read his kids bed time stories. But it’s not just for kids. 82-year-old Angus King used Claude to help him grill a Trump admin official during a hearing this week: “It’s very useful. I use it all the time”

English

452

189K

Soham Mehta retweetledi

Abie Rohrig@AbieRohrig·24 Nis

@SohamThoughts will not stop until we build data centers in the farthest corners of the universe

Soham Mehta@SohamThoughts

1/5 Everyone who's seen @_panthalassa's wave-powered data centers agrees that the tech is incredible. But just as revolutionary is what it does to 400 years of legal thinking about the ocean. The law has always seen the ocean as space you move through or extract from, never a space where you produce things. Ocean compute changes that. If the ocean becomes the site of AI's industrial base, the consequences for great power competition and AI governance are enormous. A 🧵on my recent analysis:

English

3.9K

Soham Mehta retweetledi

Daniel King@The_DanielKing·23 Nis

1/ CAISI is funded at $10 million—less than what a Cheesecake Factory earns in a year. I’ll do you one better, @taoburr. 🧵

Tao Burga@taoburr

These are great steps! Here's 8 other things we could do: 1. Congress should fund CAISI at ~$80 million instead of $10 mn, which is our internal analysis of what it'd take for CAISI to actually fulfill the purposes laid out in the AI Action Plan and other Trump admin directives. 2. The NSA, CAISI + others should plan for the moment when >Mythos-class models are distilled or trained in China, and make a real effort in preemptive cyberdefense. We called this last year, and have some ideas on what to do (ifp.org/operation-patc…, ifp.org/the-great-refa…, ifp.org/preventing-ai-…) 3. OSTP and NSC should coordinate building RAND-style SL-4/SL-5 security for frontier model weights. Distillation is one way to get somewhat capable models, but stealing model weights gets you the best model, and it's completely doable for well-resourced state-backed actors. The weights themselves are the crown jewels, and most labs aren't close to being able to defend them! Once we train a 10x Mythos soon, we'll wish we had a secure environment to run it in. (More implementation details here: ifp.org/a-sprint-towar…) 4. Relatedly, fund + help staff an insider-threat / counter-intel program for frontier labs. It is much harder to protect model weights if adversarial people have privileged access. 5. The White House should direct Commerce/BIS to strengthen AI chip and SME export controls to adversarial countries, so that even if cyber-capable models are distilled or stolen, they can't be deployed at scale on American chips. China has huge domestic production bottlenecks (ifp.org/the-b30a-decis…), so exporting fewer chips makes a difference, pound for pound. 6. And because smuggling is still a problem, we should also be deploying chip security measures like privacy-preserving country-level location verification, which will allow us to export more chips to semi-trusted countries while verifying that they're not being smuggled to adversarial ones (more: rebuilding.tech/posts/conditio…), and there is more AI verification work to be done to enable more mutually beneficial trade without national security downsides (ifp.org/faster-ai-diff…). 7. On top of funding CAISI, we should direct it to run pre-deployment evals for CBRN and cyber uplift on a classified track. You can't hold adversaries accountable for abusing US models if we don't systematically measure what those models can do in the first place. 8. The NSC, NSA and CAISI should write the emergency-response playbook for the day a Mythos-class weight leak is confirmed, or distillation is successful. Who does what, in what order? To be in a good place, we should've started years ago. But it'll only be more urgent each passing month. Compute stock is growing 3.4x/year; LLM inference prices declining at -40x/year for a fixed level of capability; software progress is improving so quickly that the pre-trainig compute we need to reach a capability is 3 times lower each passing year (epoch.ai)... These are just some ideas for government, related to distillation and model weight theft. Philanthropy and the private sector have big roles to play as well. We have so much work to do!

English

13.1K

Soham Mehta@SohamThoughts·23 Nis

Yes,@JoinFAI we take GPU pirates very seriously

Daniel King@The_DanielKing

1/ GPU smuggling is already big business, and GPU pirates on the high seas may be next. 🧵

English

330

Soham Mehta@SohamThoughts·23 Nis

Read the full post here: ansible.pub/p/compute-on-t…

English

167

Soham Mehta@SohamThoughts·23 Nis

4/5 On land, laws like California's SB 53 and New York's RAISE Act regulate frontier AI through compute thresholds and reporting requirements. But those laws assume regulators can verify claims about hardware. By flagging nodes in more permissive states, compute thresholds become moot. Export controls are even worse off: they depend on knowing where GPUs physically are, but ocean nodes exist in a jurisdictional gap the regime has no category for. Russia's shadow fleet already proved that the ocean defeats sanctions enforcement at scale. Chips are a higher-margin cargo that is even easier to exfiltrate.

English

193

Soham Mehta@SohamThoughts·23 Nis

English

5.2K

Soham Mehta retweetledi

Daniel King@The_DanielKing·22 Nis

I think you’re right on the mark. Two reactions: 1) Imagine space-based compute were just a big-brained lobbying project by MAIM. nationalsecurity.ai/chapter/deterr… 2) @SohamThoughts and I will — in short order — be arguing that we vigorously defend American compute in space.

English

268

Soham Mehta retweetledi

Institute for AI Policy and Strategy (IAPS)@iapsAI·16 Nis

Autonomous cyber-agents can already execute in minutes what would take hours of expert human labor — with no off switch and no capacity to judge when to stop. Today in @ForeignAffairs, IAPS's @rosen_br and @JKraprayoon discuss what this means for national security: foreignaffairs.com/united-states/…

English

758

Soham Mehta retweetledi

suhan@vibe_thinking·12 Nis

Waymo's PR issues could be solved if they just make the cars cuter. No one gets mad at these guys

English

Keşfet

@CommerceGov @POTUS @PrinehaN @taoburr @JoinFAI @_panthalassa @ForeignAffairs @rosen_br