Luis Cosio

5.2K posts

Luis Cosio banner
Luis Cosio

Luis Cosio

@luiscosio

Hardening frontier AI systems against nation-state adversaries and loss-of-control failure modes.

가입일 Kasım 2009
1.1K 팔로잉3.4K 팔로워
고정된 트윗
Luis Cosio
Luis Cosio@luiscosio·
Proud to share I'll be a co-mentor at @MATSprogram for the Autumn 2026 cohort, working alongside @LisaThiergart on the @SL5TaskForce . MATS is one of the strongest talent pipelines in AI safety. Three months, real research with a mentor, on a problem that ships. Many scholars join their mentor's org afterward or spin out their own. Our stream is building SL5: security against priority nation-state attacks for frontier AI infrastructure. This year we're prototyping a real datacenter with frontier labs. The work needs people who can lead. Who we want: 3+ years security or infrastructure engineering Previously led a project with 2+ people on novel technical ground Comfortable in highly automated workflows (Claude Code, etc.) Strong Python or Rust, or excellent technical communication Bonus: TEMPEST, SCIF construction, or datacenter physical security experience. Apply by June 7. matsprogram.org/apply
English
0
5
32
1.8K
Luis Cosio 리트윗함
Luis Cosio 리트윗함
Claude
Claude@claudeai·
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
English
4.8K
14.2K
101.8K
50.5M
Luis Cosio 리트윗함
Shann³
Shann³@shannholmberg·
what is agent looping for the last two years we prompted agents one task at a time. that is starting to change instead of asking an agent to build the landing page and then driving every step yourself, you set up a loop that handles discovery, planning, the work, checking, and iterating until the goal is met looping is a setup you build. almost any agent harness can run it, it just depends on how you wire it up at its simplest, looping is one agent working on itself: > researches > drafts > checks the draft against a goal > fixes what is weak > runs that cycle again until the work clears the requirements you are not prompting each step anymore. the agent repeats the cycle for you the bigger version is a fleet looping. you give an orchestrator agent a goal, it breaks the goal into pieces, hands each piece to a specialist agent, and those specialists hand smaller jobs to their own subagents the whole tree keeps looping through discovery, planning, execution, and verification until the goal is met one agent looping is like a person redoing their own draft. a fleet looping is a whole team running a project end-to-end you create a goal, and the system runs the loop until it finishes within the reqs you set open and closed looping: OPEN LOOPING is exploratory. it still has conditions and a goal, but you give the agent or the fleet a wide space to move in. it can try different paths, discover things, build something you did not fully spec out this is the exciting end, it is what Peter and others are doing, and tbh it is where I want to spend more time the catch is cost, an open loop with real room to explore burns an insane amount of tokens. for the 90 percent of people without an unlimited budget it is not runnable yet, and pointed at projects with a loose standard it turns into a slop machine CLOSED LOOPING is bounded. a human designs the end-to-end path first: > clear goal > defined steps > an eval at each step > a point where it stops or hands back to you (and feeds back performance data) the agents still loop, but inside framework you built. it gets better every run because each pass feeds the next, and it runs on a normal budget because the path is tight. for most marketing work, closed is the one that pays off today. > the orchestrator owns the goal > the specialists own the steps > the subagents do the narrow work > an eval gate make sure its not slop
Shann³ tweet media
Peter Steinberger 🦞@steipete

Here’s your monthly reminder that you shouldn’t be prompting coding agents anymore. You should be designing loops that prompt your agents.

English
195
694
6K
729.2K
Luis Cosio 리트윗함
dany
dany@danywander·
a reminder that branding matters.
dany tweet mediadany tweet media
English
359
1.1K
22.4K
4.1M
Luis Cosio 리트윗함
Seb Johnson
Seb Johnson@SebJohnsonUK·
The UK is getting its own Sovereign Frontier AI Model! It is being trained on Isambard-AI and will run with no dependence on foreign infrastructure. The model, Lumen Sovereign, is being built by @CosineAI, one of the companies selected by the UK Government for its £500m Sovereign AI programme. The startup is working with leading big companies (such as Babcock, BT, Lloyds LSEG, NatWest Group, PwC) to help design it. Cosine was founded by @AlistairPullen and @yangli_ and its models have outperformed OpenAI, Anthropic, Mistral, and DeepSeek on independent coding benchmarks for two consecutive years. The model will be trained using compute provided by @UKSovereignAI! Amazing stuff @Jameswise, @SuzanneAshman, @KanishkaNarayan.
English
49
67
477
62.4K
Luis Cosio 리트윗함
MATS Research
MATS Research@MATSprogram·
🚨 Applications for MATS Autumn 2026 close tonight (June 7 AoE)! Spend 10 weeks fully funded working with mentors from Anthropic, DeepMind, OpenAI, Redwood Research, SecureBio, and more. New this cohort: 🧬 Biosecurity 🚀 Founding & Field-Building Apply now: matsprogram.org/apply
MATS Research tweet media
English
7
11
152
56.4K
Luis Cosio 리트윗함
Agencia de Transformación Digital
Hoy durante la instalación del Comité Técnico de la Supercomputadora Coatlicue anunciamos que se construirá en la unidad Zacatenco del @IPN_MX. Con esta infraestructura se dará respuestas a problemas públicos y fortalecerá la soberanía tecnológica 🇲🇽. #BoletínDePrensa 👇 gob.mx/atdt/comunicac…
Agencia de Transformación Digital tweet mediaAgencia de Transformación Digital tweet media
Español
6
40
104
4.6K
Luis Cosio 리트윗함
Tao Burga
Tao Burga@taoburr·
Today: 1. AI CEOs call for mandatory DNA synthesis screening in a letter led by @IFP and @JoinFAI - screendna.org 2. @JanikaSchmitt and @jtmonrad published the grand strategy for actually securing the DNA supply chain -- the main barrier to bioweapons -- with a prioritized list of concrete interventions - ifp.org/how-to-secure-… Not only does the piece specify what to do to solve the problem, but it also includes a funding opportunity focused on it: form.typeform.com/to/lqql7lsP I can't imagine a higher calling than protecting humanity from biological weapons. If you feel similarly, check out their Launch Sequence piece above.
Tao Burga tweet mediaTao Burga tweet media
Janika Schmitt@JanikaSchmitt

It’s great to see AI leaders like Sam Altman, Dario Amodei, and Demis Hassabis calling for mandatory DNA synthesis screening, which is a no-brainer policy for preventing (AI-enabled) bioterrorism. But fewer than 50 people in the world currently work on DNA security full-time. We need a comprehensive plan and at least 5x as many people to secure the DNA supply chain before AI and biotech outpace us. @jtmonrad and I spent the past two years developing a field strategy for how to do it. Successfully defending against this risk (while still capturing innovation benefits) requires four things: 1. Coverage: More than 80% of synthetic DNA providers screen both orders and customers 2. Strategic ambiguity: a bad actor can’t easily tell which providers will screen their order 3. Access: legitimate customers can still order DNA cheaply and easily 4. Effectiveness: 90% of providers reliably catch dangerous sequences when red-teamed We’re already seeing real momentum. Many DNA providers screen voluntarily, and governments in several countries are moving toward mandates. But that doesn’t mean the problem will be solved in time by default. Our guide lays out exactly which projects we need to launch. We’re looking for founders, operators, and technical experts to own pieces of the solution. We’re also hiring a Senior Program Officer at Sentinel to drive this work. Get in touch if you or someone you know would be a strong fit! (links for EOI form and JD below) Read our full field strategy in @IFP's Launch Sequence: ifp.org/how-to-secure-…

English
1
20
98
10.3K
Luis Cosio 리트윗함
Anthropic
Anthropic@AnthropicAI·
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
English
1.8K
4.7K
28.6K
18.4M
Luis Cosio 리트윗함
Lukasz Olejnik
Lukasz Olejnik@lukOlejnik·
AI-powered computer worm, a self-replicating agent that reasons its way through a network instead of carrying a fixed exploit list. It steals compute from compromised GPU machines to run its own open-weight LLM, then uses weaker machines as relays for reach. In trials on a corporate testbed, it identified vulnerabilities, exploited systems, and launched replicas across Linux, Windows, and IoT targets. Every new infection can add more infrastructure while costing the attacker almost nothing. Patching one flaw no longer ends the threat, because the worm can operationalise fresh advisories, generate new attack logic, and keep adapting without a human operator. It is not a WannaCry-style worm with one baked exploit and one baked ransomware payload. It can adapt across many vulnerability classes it can discover and operationalise arxiv.org/pdf/2606.03811
Lukasz Olejnik tweet mediaLukasz Olejnik tweet media
English
22
84
260
16.9K
Luis Cosio 리트윗함
Seva Ustinov
Seva Ustinov@sevaustinov·
Seva Ustinov tweet media
ZXX
236
1.9K
31.9K
1.2M
Luis Cosio 리트윗함
Anthropic
Anthropic@AnthropicAI·
This Executive Order is an important step in strengthening America’s leadership in AI. We look forward to collaborating with the White House to support its implementation. whitehouse.gov/presidential-a…
English
266
317
2.5K
323.9K
Luis Cosio 리트윗함
Dylan Malyasov | 🧐
Dylan Malyasov | 🧐@DylanMalyasov·
A Chinese company is now selling spray-on coating that makes drones harder for radar to detect, available in buckets and applied with a spray gun. What was once a classified military technology is now a commercial product sold by the kilogram. defence-blog.com/chinese-firm-s…
Dylan Malyasov | 🧐 tweet mediaDylan Malyasov | 🧐 tweet mediaDylan Malyasov | 🧐 tweet media
English
95
442
3.4K
546.2K
Luis Cosio 리트윗함
MATS Research
MATS Research@MATSprogram·
Three paths: → Founders: launch a new initiative (your idea or one of ours) → Field-Builders: own talent development and deployment within AI safety → Amplifiers: join an existing AI safety org to drive impact as it scales
English
1
1
28
2.2K
Luis Cosio 리트윗함
MATS Research
MATS Research@MATSprogram·
AI safety needs to scale fast. MATS has trained 10 cohorts of top researchers, but high-impact orgs keep hitting the same bottleneck: more promising ideas than people to champion them, and funding outpacing deployment. This track exists to close that gap.
English
1
2
34
2.7K
Luis Cosio 리트윗함
MATS Research
MATS Research@MATSprogram·
🚨 New for MATS Autumn 2026: the Founding & Field-Building track. A fully-funded track for founders, field-builders and amplifiers ready to launch and scale new AI safety initiatives. Apply by June 7 AoE ↓
English
2
26
262
48.6K
Luis Cosio 리트윗함
METR
METR@METR_Evals·
Could an AI company lose control of its own agents? To find out, Anthropic, Google, Meta, and OpenAI let us (1) test their best internal models with CoT access, (2) review non-public info about capabilities, alignment, and control. The result: our first Frontier Risk Report.
METR tweet media
English
31
193
918
346.6K