
James Fox
120 posts

James Fox
@James_D_Fox
Senior Science Associate at Schmidt Sciences (AI Institute)







What are the largest software engineering tasks AI can perform? In our new benchmark, MirrorCode, Claude Opus 4.6 reimplemented a 16,000-line bioinformatics toolkit — a task we believe would take a human engineer weeks. Co-developed with @METR_Evals. Details in thread.


In related news… I’m building out a tiger team to pursue this mission with me! 🦸 I’m looking for people who are mission-driven, technically deep, and comfortable moving between formal methods, programming languages, AI, AI safety, and cybersecurity.



The British Government is a complicated beast. Dozens of departments, hundreds of public bodies, more corporations than one can count... Such is its complexity that there isn't an org chart for it. Well, there wasn't... Introducing ⚙️Machinery of Government⚙️

Today we’re launching a £100m call to double down on ARIA's AI Scientist and Activation Partners initiatives We're looking for partners to bring advanced AI capabilities to R&D alongside translational expertise to help us turn speculative ideas into world-changing capabilities🧵













