alex

5.5K posts

alex

@ObadiaAlex

scaling trust @aria_research | https://t.co/r2MLCQxlXK

Katılım Mart 2016

8K Takip Edilen9.9K Takipçiler

alex@ObadiaAlex·2 Tem

@tomhschmidt whenever you pass by london you might like lost.org

English

672

Tom Schmidt ＞|＜@tomhschmidt·2 Tem

would give it all up for one night at Palladium

English

9.3K

alex@ObadiaAlex·1 Tem

@rememberlenny @chooi_jeq is working on this! check also this blogpost — scalingtrust.org.uk/blog/physical-…

English

378

Lenny Bogdonoff@rememberlenny·1 Tem

Looking for collaborators on a personal project: AI benchmarks for blue collar work. This is a multimodal eval on tasks for electricians, ironworkers, technicians, assembly line workers, mechanics, and more. Reach out if you work in the trades or want to talk to strangers!

English

249

53.6K

alex@ObadiaAlex·1 Tem

anyone throwing an export control lifting party in sf?

Anthropic@AnthropicAI

We’ve received notice that the Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5. We'll begin restoring access tomorrow, and will share an update soon. We’re grateful to our users for their patience, and to everyone who worked with us on redeploying the models.

English

1.5K

alex@ObadiaAlex·29 Haz

@sebkrier 😂

QME

Séb Krier@sebkrier·29 Haz

First time at Costco and I feel like Yelstin. Incredible. May AGI do this for services too!

English

223

11.2K

alex retweetledi

Séb Krier@sebkrier·29 Haz

If frontier intelligence remains scarce and high-margin, AI becomes a strategic chokepoint, inviting monopoly, state dependence, capture, and autocratic control. If intelligence becomes low-margin and modular, value disperses into products, workflows, and consumption, producing a more diffuse political economy. Imo the latter is more likely over time, even though the frontier can remain concentrated (high capex low margins) - but an important risk will be distortive policy decisions cementing the former.

English

274

43.2K

alex@ObadiaAlex·26 Haz

@alive_ scalingtrust.org.uk !

186

Ali Yahya@alive_·25 Haz

"Agentic commerce" is not as interesting for crypto as people like to think. Credit cards actually work better than stablecoins for almost all kinds of agentic payments. They are reliable and universally accepted. And contrary to what most people think, they are also programmable, secure, and easy for agents to use on behalf of humans. The more interesting use cases of crypto will be the those that enable agent-to-agent coordination. AI agents will soon want to do more than just pay for things. They will want to enter into enforceable agreements with each other. For example, one agent might want to hire another for a specific job, but not want to pay until after the work is complete, and only if it meets certain criteria. At the same time, the agent doing the work might want some assurance that it's going to get paid when it finishes the job. This is the kind of problem that blockchains were born to solve. The agents can use a smart contract that holds the funds in escrow and releases them only once the work is completed. This approach works especially well when the quality of the agent's work can be verified programmatically by the smart contract, but it could be extended to other kinds of work by relying on a third party "judge"—which itself could be another agent. To make this concrete, imagine that you're an AI researcher using agents to train a new model. You might setup a @karpathy-style autoresearch loop where your agent runs many autonomous experiments on your LLM setup to discover improvements. Or better yet, your agent may want to delegate some of those experiments to a marketplace of other agents—some of which are specialized for LLM-optimization. The agents involved will not necessarily trust one another, and they cannot easily rely on legal contracts to enforce agreements. Smart contracts on blockchains can help coordinate this kind of activity by creating a neutral environment with rules that are programmatically enforced. Who is working on using crypto to enable agent-to-agent coordination?

English

121

351

54.7K

alex@ObadiaAlex·26 Haz

awesome

Brian Wang@bscwang

It’s a bad week if you’re a respiratory virus! Yesterday, Stripe announced Intercept, a $500m initiative to end respiratory infections. Today, my team and I at @ARIA_research are announcing 11 teams we’ve funded with £57m toward the same goal. A 🧵 about their exciting work!

English

1.7K

alex@ObadiaAlex·26 Haz

in sf for the next few days — please reach out if you’d like to meet and chat aria/ai security/cyber-physical evals & more 🤠

English

567

alex@ObadiaAlex·23 Haz

Project Eleven@projecteleven

"Quantum crypto-graphy, does anybody know what that is?" We do.

QST

965

alex@ObadiaAlex·20 Haz

@PhilDursey @lukaspet @andonlabs @ARIA_research sorry i couldn't find bt6's handle before! @BT6_Official

English

976

alex@ObadiaAlex·20 Haz

just added our talks schedule for the evening meetup we're running on day 1 of real world ai security next week at stanford uni, excited for it! cc @PhilDursey @lukaspet @andonlabs @ARIA_research

English

2.4K

alex@ObadiaAlex·20 Haz

@PhilDursey @lukaspet @andonlabs @ARIA_research link: luma.com/grbv01qi

English

660

alex@ObadiaAlex·18 Haz

seed call is out — checks up to £500k to seed misfit exploratory r&d into cyber-physical trust infrastructure: quantum security, dna/protein cryptography, verifiable robotics, neurosecurity, and more. apply by july 27th!

ARIA@ARIA_research

Digital trust has powerful foundations: encryption, verification and security protocols helped digital industries flourish. But as emerging technologies blur the line between digital and physical systems, our trust infrastructure needs to evolve. Our Trust Everything, Everywhere opportunity seeds funding call is seeking bold ideas for new cyber-physical trust infrastructure that can operate across physical, biological, molecular and digital worlds. We’re looking for high-potential proposals in areas including nature cryptography, programmable reality, trust tools for physical, molecular and biological security systems, cryptography, synthetic biology, robotics, advanced materials, human-AI interaction, cognitive security and more. Ideas can range from early-stage, curiosity-driven research through to translational and close-to-commercial science and technology. 💰 Successful proposals can receive up to £500,000. ⏰ Apply by 27 July 2026 at 14:00 BST. aria.org.uk/opportunity-sp…

English

2.9K

alex retweetledi

Matt Clifford@matthewclifford·18 Haz

"This would have been a wild dream a year ago” Superb piece by @dpcarrington in the Guardian on the work @ARIA_research is funding to re-freeze the Arctic.

English

9.6K

alex@ObadiaAlex·17 Haz

what are the coolest/most fun agent-friendly things you've seen on websites? (eg. fun robot.txt or llms.txt texts with playful prompt injections)

English

673

alex@ObadiaAlex·16 Haz

@DrJimFan you might be interested in this blogpost on physical evals scalingtrust.org.uk/blog/physical-… by @iamnotnicola !

English

792

Jim Fan@DrJimFan·16 Haz

Today, we enable AutoResearch in the physical world for the first time! Introducing ENPIRE: we give 8 Codex agents a fleet of robots, an allocation of GPUs, and generous token budget. We set them free with a simple goal: solve the task as quickly as possible, keep the robots busy but stay safe, don't waste precious compute. Make no mistake. Then humans step aside and our watch begins. The robot fleet starts to come alive: they learn to look for visual clues, reset the scene, practice novel skills, tinker with control stack, read papers online, debate, reflect, get stuck, and try again directly on the hardware. All we did is to give Codex an API to the world of atoms, and the rest is emergence. ENPIRE is able to solve high-precision tasks like tying zip-ties, organizing fine pins, and installing GPUs all by itself. We also discovered a new type of "physical scaling": 8 robots exploring in parallel improves significantly faster than fewer ones. A part of our NVIDIA GEAR lab now self-improves tirelessly over night. We just read the reports in the morning. /goal: we all take a holiday and Jensen wouldn't even notice ;) We will be open-sourcing everything, so you can host your self-running robot lab at home too! Deep dive in the thread:

English

188

568

3.8K

662.2K

alex@ObadiaAlex·16 Haz

@alexanderlhicks x.com/inafried/statu…

Ina Fried@inafried

New @axios: Microsoft eyes DeepSeek for Copilot Cowork as it also joins the shift to usage based pricing. Says final decision TK but it has already fine-tuned a model that it could use.

QME

Alexander Hicks@alexanderlhicks·16 Haz

@ObadiaAlex Well it's visible we need infra for sustainable access to good models once the subscription subsidy ends.

English

alex@ObadiaAlex·16 Haz

am i the only one getting rate limited in claude cli repeatedly over last few days? feels like a new problem

English

1.1K

alex@ObadiaAlex·16 Haz

@alexanderlhicks tbh one part of me thinks it's been good to have clear visibility on compute spend bc it changes the way i prompt and also toggle between different models

English

Alexander Hicks@alexanderlhicks·16 Haz

@ObadiaAlex We had a $10k claude API key for github.com/Verified-zkEVM… and burned through it in a few days lol

English

106

alex@ObadiaAlex·16 Haz

@alexanderlhicks :")

QAM

Keşfet

@tomhschmidt @rememberlenny @chooi_jeq @sebkrier @alive_ @karpathy @PhilDursey @lukaspet