Luis Cosio

5.2K posts

Luis Cosio

@luiscosio

Hardening frontier AI systems against nation-state adversaries and loss-of-control failure modes.

가입일 Kasım 2009

1.1K 팔로잉3.4K 팔로워

고정된 트윗

Luis Cosio@luiscosio·23 May

Proud to share I'll be a co-mentor at @MATSprogram for the Autumn 2026 cohort, working alongside @LisaThiergart on the @SL5TaskForce . MATS is one of the strongest talent pipelines in AI safety. Three months, real research with a mentor, on a problem that ships. Many scholars join their mentor's org afterward or spin out their own. Our stream is building SL5: security against priority nation-state attacks for frontier AI infrastructure. This year we're prototyping a real datacenter with frontier labs. The work needs people who can lead. Who we want: 3+ years security or infrastructure engineering Previously led a project with 2+ people on novel technical ground Comfortable in highly automated workflows (Claude Code, etc.) Strong Python or Rust, or excellent technical communication Bonus: TEMPEST, SCIF construction, or datacenter physical security experience. Apply by June 7. matsprogram.org/apply

English

1.8K

Luis Cosio 리트윗함

Thomas Roccia 🤘@fr0gger_·12h

We reported this prompt two days ago in PromptIntel! Have a look 👇 promptintel.novahunting.ai/prompt/85d492c…

John Scott-Railton@jsrailton

NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me socket.dev/blog/mini-shai…

English

2.8K

Luis Cosio 리트윗함

Claude@claudeai·1d

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

English

4.8K

14.2K

101.8K

50.5M

Luis Cosio 리트윗함

Shann³@shannholmberg·2d

what is agent looping for the last two years we prompted agents one task at a time. that is starting to change instead of asking an agent to build the landing page and then driving every step yourself, you set up a loop that handles discovery, planning, the work, checking, and iterating until the goal is met looping is a setup you build. almost any agent harness can run it, it just depends on how you wire it up at its simplest, looping is one agent working on itself: > researches > drafts > checks the draft against a goal > fixes what is weak > runs that cycle again until the work clears the requirements you are not prompting each step anymore. the agent repeats the cycle for you the bigger version is a fleet looping. you give an orchestrator agent a goal, it breaks the goal into pieces, hands each piece to a specialist agent, and those specialists hand smaller jobs to their own subagents the whole tree keeps looping through discovery, planning, execution, and verification until the goal is met one agent looping is like a person redoing their own draft. a fleet looping is a whole team running a project end-to-end you create a goal, and the system runs the loop until it finishes within the reqs you set open and closed looping: OPEN LOOPING is exploratory. it still has conditions and a goal, but you give the agent or the fleet a wide space to move in. it can try different paths, discover things, build something you did not fully spec out this is the exciting end, it is what Peter and others are doing, and tbh it is where I want to spend more time the catch is cost, an open loop with real room to explore burns an insane amount of tokens. for the 90 percent of people without an unlimited budget it is not runnable yet, and pointed at projects with a loose standard it turns into a slop machine CLOSED LOOPING is bounded. a human designs the end-to-end path first: > clear goal > defined steps > an eval at each step > a point where it stops or hands back to you (and feeds back performance data) the agents still loop, but inside framework you built. it gets better every run because each pass feeds the next, and it runs on a normal budget because the path is tight. for most marketing work, closed is the one that pays off today. > the orchestrator owns the goal > the specialists own the steps > the subagents do the narrow work > an eval gate make sure its not slop

Peter Steinberger 🦞@steipete

Here’s your monthly reminder that you shouldn’t be prompting coding agents anymore. You should be designing loops that prompt your agents.

English

195

694

729.2K

Luis Cosio 리트윗함

dany@danywander·3d

a reminder that branding matters.

English

359

1.1K

22.4K

4.1M

Luis Cosio 리트윗함

Seb Johnson@SebJohnsonUK·3d

The UK is getting its own Sovereign Frontier AI Model! It is being trained on Isambard-AI and will run with no dependence on foreign infrastructure. The model, Lumen Sovereign, is being built by @CosineAI, one of the companies selected by the UK Government for its £500m Sovereign AI programme. The startup is working with leading big companies (such as Babcock, BT, Lloyds LSEG, NatWest Group, PwC) to help design it. Cosine was founded by @AlistairPullen and @yangli_ and its models have outperformed OpenAI, Anthropic, Mistral, and DeepSeek on independent coding benchmarks for two consecutive years. The model will be trained using compute provided by @UKSovereignAI! Amazing stuff @Jameswise, @SuzanneAshman, @KanishkaNarayan.

English

477

62.4K

Luis Cosio 리트윗함

MATS Research@MATSprogram·3d

🚨 Applications for MATS Autumn 2026 close tonight (June 7 AoE)! Spend 10 weeks fully funded working with mentors from Anthropic, DeepMind, OpenAI, Redwood Research, SecureBio, and more. New this cohort: 🧬 Biosecurity 🚀 Founding & Field-Building Apply now: matsprogram.org/apply

English

152

56.4K

Luis Cosio 리트윗함

Agencia de Transformación Digital@AgenciaGobMX·5d

Hoy durante la instalación del Comité Técnico de la Supercomputadora Coatlicue anunciamos que se construirá en la unidad Zacatenco del @IPN_MX. Con esta infraestructura se dará respuestas a problemas públicos y fortalecerá la soberanía tecnológica 🇲🇽. #BoletínDePrensa 👇 gob.mx/atdt/comunicac…

Agencia de Transformación Digital tweet media

Español

104

4.6K

Luis Cosio 리트윗함

Miles Cranmer@MilesCranmer·5d

This is an insane paper and I love it arxiv.org/abs/2605.31514

English

159

1.3K

11.2K

615.4K

Luis Cosio 리트윗함

Tao Burga@taoburr·6d

Today: 1. AI CEOs call for mandatory DNA synthesis screening in a letter led by @IFP and @JoinFAI - screendna.org 2. @JanikaSchmitt and @jtmonrad published the grand strategy for actually securing the DNA supply chain -- the main barrier to bioweapons -- with a prioritized list of concrete interventions - ifp.org/how-to-secure-… Not only does the piece specify what to do to solve the problem, but it also includes a funding opportunity focused on it: form.typeform.com/to/lqql7lsP I can't imagine a higher calling than protecting humanity from biological weapons. If you feel similarly, check out their Launch Sequence piece above.

Janika Schmitt@JanikaSchmitt

It’s great to see AI leaders like Sam Altman, Dario Amodei, and Demis Hassabis calling for mandatory DNA synthesis screening, which is a no-brainer policy for preventing (AI-enabled) bioterrorism. But fewer than 50 people in the world currently work on DNA security full-time. We need a comprehensive plan and at least 5x as many people to secure the DNA supply chain before AI and biotech outpace us. @jtmonrad and I spent the past two years developing a field strategy for how to do it. Successfully defending against this risk (while still capturing innovation benefits) requires four things: 1. Coverage: More than 80% of synthetic DNA providers screen both orders and customers 2. Strategic ambiguity: a bad actor can’t easily tell which providers will screen their order 3. Access: legitimate customers can still order DNA cheaply and easily 4. Effectiveness: 90% of providers reliably catch dangerous sequences when red-teamed We’re already seeing real momentum. Many DNA providers screen voluntarily, and governments in several countries are moving toward mandates. But that doesn’t mean the problem will be solved in time by default. Our guide lays out exactly which projects we need to launch. We’re looking for founders, operators, and technical experts to own pieces of the solution. We’re also hiring a Senior Program Officer at Sentinel to drive this work. Get in touch if you or someone you know would be a strong fit! (links for EOI form and JD below) Read our full field strategy in @IFP's Launch Sequence: ifp.org/how-to-secure-…

English

10.3K

Luis Cosio 리트윗함

Anthropic@AnthropicAI·6d

Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…

English

1.8K

4.7K

28.6K

18.4M

Luis Cosio 리트윗함

Lukasz Olejnik@lukOlejnik·3 Haz

AI-powered computer worm, a self-replicating agent that reasons its way through a network instead of carrying a fixed exploit list. It steals compute from compromised GPU machines to run its own open-weight LLM, then uses weaker machines as relays for reach. In trials on a corporate testbed, it identified vulnerabilities, exploited systems, and launched replicas across Linux, Windows, and IoT targets. Every new infection can add more infrastructure while costing the attacker almost nothing. Patching one flaw no longer ends the threat, because the worm can operationalise fresh advisories, generate new attack logic, and keep adapting without a human operator. It is not a WannaCry-style worm with one baked exploit and one baked ransomware payload. It can adapt across many vulnerability classes it can discover and operationalise arxiv.org/pdf/2606.03811

English

260

16.9K

Luis Cosio 리트윗함

Seva Ustinov@sevaustinov·3 Haz

ZXX

236

1.9K

31.9K

1.2M

Luis Cosio 리트윗함

Anthropic@AnthropicAI·3 Haz

This Executive Order is an important step in strengthening America’s leadership in AI. We look forward to collaborating with the White House to support its implementation. whitehouse.gov/presidential-a…

English

266

317

2.5K

323.9K

Luis Cosio 리트윗함

Dylan Malyasov | 🧐@DylanMalyasov·1 Haz

A Chinese company is now selling spray-on coating that makes drones harder for radar to detect, available in buckets and applied with a spray gun. What was once a classified military technology is now a commercial product sold by the kilogram. defence-blog.com/chinese-firm-s…

English

442

3.4K

546.2K

Luis Cosio 리트윗함

MATS Research@MATSprogram·29 May

Streams led by: · @dewierwan_ (BlueDot) · @MikeMcCormick_ & @nickraushenbush (@HalcyonFutures ) · @gralston & @incredutility (Bio Action Plan) · @LisaThiergart & @luiscosio (@SL5TaskForce) · @AmmannNora (ARIA) · @RosieCampbell (@eleosai ) · @degerturann (@metaculus)

English

Luis Cosio 리트윗함

MATS Research@MATSprogram·29 May

Three paths: → Founders: launch a new initiative (your idea or one of ours) → Field-Builders: own talent development and deployment within AI safety → Amplifiers: join an existing AI safety org to drive impact as it scales

English

2.2K

Luis Cosio 리트윗함

MATS Research@MATSprogram·29 May

AI safety needs to scale fast. MATS has trained 10 cohorts of top researchers, but high-impact orgs keep hitting the same bottleneck: more promising ideas than people to champion them, and funding outpacing deployment. This track exists to close that gap.

English

2.7K

Luis Cosio 리트윗함

MATS Research@MATSprogram·29 May

🚨 New for MATS Autumn 2026: the Founding & Field-Building track. A fully-funded track for founders, field-builders and amplifiers ready to launch and scale new AI safety initiatives. Apply by June 7 AoE ↓

English

262

48.6K

Luis Cosio 리트윗함

METR@METR_Evals·19 May

Could an AI company lose control of its own agents? To find out, Anthropic, Google, Meta, and OpenAI let us (1) test their best internal models with CoT access, (2) review non-public info about capabilities, alignment, and control. The result: our first Frontier Risk Report.

English

193

918

346.6K

탐색

@CosineAI @AlistairPullen @yangli_ @UKSovereignAI @Jameswise @SuzanneAshman @KanishkaNarayan @IPN_MX