Christian Schroeder de Witt

245 posts

Christian Schroeder de Witt

@casdewitt

EPSRC Open Fellow (incoming) + RAEng RF + Schmidt AI2050 ECF, University of Oxford. Agentic Safety & Security / Multi-Agent Security.

Oxford, England Katılım Temmuz 2017

1K Takip Edilen1.6K Takipçiler

Christian Schroeder de Witt@casdewitt·3d

While we cannot always detect steganography directly, sometimes the effects of sharing information secretly can be observed relative to the subsequent behaviour of the agents - an important decision-theoretic approach to steganography detection in CoT settings pioneered by @usmananwar391 @j_piskorz_

Usman Anwar@usmananwar391

✨New AI Safety work on Steganography and LLM monitoring✨ We propose ‘steganographic gap’: the first principled metric for detecting and quantifying encoded reasoning in LLMs, which can reveal hard-to-detect forms of steganography, e.g., paraphrasing-resistant steganography.

English

1.4K

Christian Schroeder de Witt retweetledi

Xander Davies@alxndrdavies·6 Mar

The Red Team at @AISecurityInst is hiring! We work with frontier AI companies to red team their misuse safeguards, control measures, and alignment techniques. As the stakes rise, we need much stronger red teaming and many more talented researchers working within gov 🧵

English

225

65.4K

Christian Schroeder de Witt@casdewitt·21 Şub

🚀 I am recruiting MSc, undergraduate, and CDT/PhD students to join wittlab.ai at Oxford. Projects span autonomous agents, multi-agent security, interpretability, and evaluation science - ambitious, publication-oriented research at the frontier of AI capability & safety. Details: wittlab.ai/student_projec… 📩 christian.schroeder@eng.ox.ac.uk

English

457

29.5K

Christian Schroeder de Witt retweetledi

Sumeet Motwani@sumeetrm·9 Ara

Some thoughts on the current synthetic environment scaling paradigm

SAIL Media@readsail

Thoughts on long horizon reasoning via @sumeetrm in the SAIL podcast booth at NeurIPS

English

6.5K

Christian Schroeder de Witt retweetledi

Oxford Torr Vision Group@OxfordTVG·6 Kas

🤩🤩Congratulations to @philiptorr & @casdewitt both have been awarded 2025 Schmidt Sciences AI2050 Research Fellowships. Read more here: tinyurl.com/24x3e7rs & here ai2050.schmidtsciences.org

English

1.1K

Christian Schroeder de Witt retweetledi

alex@ObadiaAlex·31 Eki

1. Introduction to ARIA by jenny read 2. Why are we here? by yours truly 3. Security Primitives: New Advances & State of the Art by @iamnotnicola 4. Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents by @casdewitt 5. Embodied AI: What’s happening and how fast are things progressing? by @rowstron 6. Hardness in Silicon by @0xquintus 7. Challenges in Securing Ultra-Large-Scale Cyber Physical Infrastructures by Awais Rashid 8. Verification in Physical Systems Enable Autonomous Engineering by Eder Medina 9. Trust Robots, Everywhere by @engineerEdith 10. Consumable Quantum Data by Dar Gilboa 11. Cryptographic Sensing by Yuval Ishai 12. Mathematical Formalization of Cognition as an Attack Surface by @babagley 13. Cryptographically-Verifiable Sustainability x AI: A Powerful Future Tool for Our Planet? by Jessica Man

English

824

Christian Schroeder de Witt@casdewitt·21 Eki

Huge congrats, Tim @frtimlive - joining David Silver's RL team at DeepMind is epic. Looking back fondly at our ICLR spotlight on Illusory Attacks. Onward! 🚀🥳

Tim Franzmeyer@frtimlive

I recently joined @GoogleDeepMind in London. Excited to be part of David Silver's RL team to work on Gemini, Reinforcement Learning and Agents. It’s been amazing speaking with so many fascinating people in the first weeks and learning from them!

English

1.8K

Christian Schroeder de Witt@casdewitt·10 Eki

Emerging from presenting MALT: Improving reasoning with multi-agent LLM training @COLM2025 to share the next work on reasoning: this time, showing that long-horizon reasoning can be significantky improved by curriculum training on chained tasks. Fantastic efforts led by @sumeetrm Alesia Ivanova @CharlieLondon02

Sumeet Motwani@sumeetrm

🚨How do we improve long-horizon reasoning capabilities by scaling RL with only existing data? Introducing our new paper: "h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning"🫡 > RL on existing datasets saturates very quickly > Reasoning over complex interdependent problems is incredibly important, but we currently lack enough long-horizon reasoning data > Long-horizon problems are hard, which means training signal is sparse. We’d need a way to provide dense supervision Our solution composes existing short-horizon data to form a synthetic curriculum that keeps growing in complexity! This allows us to scale RL on the same dataset while avoiding saturation, with curriculum acting as dense rewards. At a small scale, we see massive in-domain long-horizon improvements, which transfer to significantly harder benchmarks. Training on composed 6th grade math problems leads to strong gains on AIME! 1/N🤿🧵

English

2.1K

Christian Schroeder de Witt retweetledi

Cooperative AI Foundation@coop_ai·12 Tem

Thank you to ❇️Christian Schroeder de Witt @casdewitt (Open challenges in multi-agent security) and ❇️Nora Ammann @AmmannNora (Gradual Disempowerment) for their fantastic talks and office hours at the Cooperative AI Summer School today.

English

984

Christian Schroeder de Witt retweetledi

Karim Abdel Sadek@Karim_abdelll·8 Tem

The paper, "Mitigating goal misgeneralization via minimax regret" will appear at @RL_Conference 2025! Joint work with the great @MatthewFdashR @usmananwar391, @hannaherlebach , @casdewitt , @DavidSKrueger , and @MichaelD1729 . ArXiv: arxiv.org/abs/2507.03068

English

1.6K

Christian Schroeder de Witt@casdewitt·14 May

Huge thanks to @coop_ai (shout out to the amazing @lrhammond), @foresightinst, and Prof. Philip Torr @philiptorr/@OxfordTVG for their funding and support of our growing research programme in Multi-Agent Security. We’re just getting started.

English

294

Christian Schroeder de Witt@casdewitt·14 May

The roots of this work go back to our NeurIPS 2023 workshop on Multi-Agent Security, co-organized with brilliant colleagues @krawiecka_kl @HawraMilani @swapneel_mehta Zoe Cremer & @MasorX. It led to a chapter I lead-authored in @coop_ai’s Multi-Agent Risks from Advanced AI report: 📄 arxiv.org/abs/2502.14143

English

448

Christian Schroeder de Witt@casdewitt·14 May

🔐 TL;DR: AI security must be anticipatory, not reactive. We can't just defend what's already been exploited - we must prepare for what is mathematically possible.

English

1.9K

Christian Schroeder de Witt@casdewitt·18 Nis

@divgarg @agi_inc x.com/the_agi_compan…

AGI, Inc.@agi_inc

🚀 INTRODUCING REAL Bench: Our New Standard for Web AI Agent Evaluation We're thrilled to announce the release of REAL Bench - our groundbreaking benchmark to transform how web AI agents are evaluated! Why we created REAL Bench: ✅ We built functional replicas of popular websites to test what agents can REALLY do ✅ We wanted to measure ACTUAL performance, not academic abstractions ✅ We compared leading frameworks including BrowserUse (31%) and StageHand (19%) What web tasks would YOU like to see AI agents tackle? Join our community to be part of the agentic revolution reshaping AI! ⚡ 👉 Explore REAL Bench → [realevals.xyz] 🛠️ Try REAL Bench and get your REAL score today → [github.com/agi-inc/agisdk]

QME

342

Christian Schroeder de Witt@casdewitt·18 Nis

Very excited to announce new work from @divgarg and the team at @agi_inc on REAL Bench - a benchmark designed to evaluate frontier web agents on realistic tasks!

GIF

English

878

Keşfet

@usmananwar391 @j_piskorz_ @AISecurityInst @philiptorr @iamnotnicola @rowstron @0xquintus @engineerEdith