Cotool (@cotoolai) - Perfil do Twitter | Zamantika Mersobahis Locabet

Tweet fixado

Cotool@cotoolai·5 Mar

We've raised a $7.4M seed round led by @a16z to build the agent operating system for security teams. Threat actors now scale with tokens. Campaigns that used to require a coordinated team can be run by a small group with the right model harness. Defense has been absorbing that hit with the same playbook and the same headcount. We built Cotool to make defense compound in the same way. Grateful to the team at @a16z , @YCombinator, @WndrCoLLC , @homebrew , and our angels from Okta, Ramp, Cloudflare, and others who've lived this problem firsthand. If you’re a security practitioner looking for more leverage in the AI age, come see how Cotool can help!

English

11

0

22

2K

Cotool@cotoolai·12 Mar

New case study alert! Learn more about how @elise_ai is leveraging Cotool to enable their rapid scale. Across detection, response, and operating 24/7, Cotool agents have enabled eliseAI to scale with flexibility and control across all security functions.

English

4

1

4

244

Cotool@cotoolai·11 Mar

New from Cotool: NYU CTF Bench. We evaluated 81 real CSAW CTF challenges to measure end-to-end cyber capability across models. Takeaway: reasoning depth still matters a lot in real security workflows. Full results here: cotool.ai/research/nyu-c…

English

1

0

4

151

Cotool@cotoolai·10 Mar

Hosting an event for RSA on Monday, if you're a practitioner looking for a non-salesy place to hang, swing by! Co-hosting with @material_sec and @trufflesec luma.com/home

English

0

1

4

333

Cotool@cotoolai·6 Mar

@Loll_ymandy @a16z Integrates with the entire existing stack!

English

1

0

1

29

lolly@Loll_ymandy·6 Mar

@cotoolai @a16z Congrats on the milestone! Will Cotool integrate with existing security tools, or is it designed as a fully standalone agent OS?

English

1

0

36

Cotool@cotoolai·5 Mar

We've raised a $7.4M seed round led by @a16z to build the agent operating system for security teams. Threat actors now scale with tokens. Campaigns that used to require a coordinated team can be run by a small group with the right model harness. Defense has been absorbing that hit with the same playbook and the same headcount. We built Cotool to make defense compound in the same way. Grateful to the team at @a16z , @YCombinator, @WndrCoLLC , @homebrew , and our angels from Okta, Ramp, Cloudflare, and others who've lived this problem firsthand. If you’re a security practitioner looking for more leverage in the AI age, come see how Cotool can help!

English

11

0

22

2K

Cotool@cotoolai·6 Mar

@ustyianskyi @coldvisionXYZ @BlockRunAI @Tetra_Chain Thanks for the shoutout!

English

1

0

6

58

Nazar@ustyianskyi·6 Mar

daily early projects: @coldvisionXYZ - prediction markets + a new execution layer @BlockRunAI - economic layer for AI agents @cotoolai - composable AI agents for security teams, $7.4M seed round led by a16z @Tetra_Chain - TVM execution layer linking stablecoins, DeFi, AI, privacy @QFEX - perp futures exchange, $9.5M seed round by @yuris @crossover_mkts - execution-only crypto ECN for institutions @KurtosisLabs - quantum computing x DeFi (Solana) @mynoraai - agentic AI for secure, high-performance Web3 bookmark if useful. best ones will go into my weekly notes.

English

14

5

49

1.6K

Cotool@cotoolai·5 Mar

Job's just getting started!

Max Pollard@maxpollard415

Excited to announce that @cotoolai has raised a $7.4M seed round led by @a16z to build the agent operating system for security teams. Threat actors now scale with tokens. Campaigns that used to require a coordinated team can be run by a small group with the right model harness. Defense has been absorbing that hit with the same playbook and the same headcount. We built Cotool to make defense compound in the same way. Grateful to the team at @a16z, @ycombinator, @WndrCoLLC, @homebrew, and our angels from Okta, Ramp, Cloudflare, and others who've lived this problem firsthand. If you’re a security practitioner looking for more leverage in the AI age, come see how Cotool can help!

English

1

14

1.5K

Cotool@cotoolai·2 Ara

We added a new cohort of frontier models to our eval! Gemini 3 Pro, Claude Opus 4.5, and GPT-5.1 are all compared in our updated post: x.com/cotoolai/statu…

Cotool@cotoolai

1/6 📊 UPDATED EVAL RESULTS We compared Gemini 3 Pro, Claude Opus 4.5, and GPT 5.1 on a single investigation task of our internal agent eval for Security Operations tasks. Key Results: - @OpenAI GPT-5+ models maintain the performance-cost Pareto frontier - @AnthropicAI Opus 4.5 completed tasks 2x faster on average than any other tested model, including Haiku 4.5 (!), suggesting that model reasoning capability and efficiency can outweigh raw inference latency in long-horizon tasks - @GoogleDeepMind Gemini 3 Pro helps Google close the gap to other leading frontier models, but still lags behind in performance and reliability The task is a @splunk BOTSv3 CTF environment built to test frontier models' capability on realistic blue team cybersecurity tasks. BOTSv3 comprises over 2.7M logs (spanning over 13 months) and 59 Question and Answer pairs that test scenarios such as investigating cloud-based attacks (AWS, Azure) and simulated APT intrusions. See results and blog post in the thread below

English

0

419

Cotool@cotoolai·18 Kas

Blog Post: cotool.ai/blog/evaluatin… Evals in security operations are an evergreen challenge. As agents take over more security operations tasks, benchmarking performance becomes increasingly critical. Our goal is to push the community forward with better metrics so that security teams can properly understand agent capabilities before handing over mission-critical tasks. We have already identified a lot of future work that can build on what we're sharing today, including sharing more tasks and including comparisons on OSS model performance. If you are: - Participating in or building blue-team CTF challenges or security training scenarios - Working with production security datasets that could be anonymized for benchmarking - Researching agent evaluation methodologies or prompt optimization techniques - Running a security operations team interested in testing agents in controlled environments - Building security-specific agents at your company and have insights on model effectiveness for different tasks We'd love to hear from you! DM us directly here on X @cotoolai

English

1

0

8

495

Cotool@cotoolai·18 Kas

📊Today we're sharing initial results from one of our internal agent evals for Security Operations tasks. We replicated the @splunk BOTSv3 CTF environment in an eval to test frontier models' capability on realistic blue team cybersecurity tasks. BOTSv3 comprises over 2.7M logs (spanning over 13 months) and 59 Question and Answer pairs that test scenarios such as investigating cloud-based attacks (AWS, Azure) and simulated APT intrusions. See results and blog post in the thread below

English

1

4

20

4.3K

Cotool@cotoolai·2 Ara

6/6 Finally, this work is a follow up to a previous blog post we put out. For more info around motivation, methodology, and much more around the eval itself, check out our initial post here: x.com/cotoolai/statu…

Cotool@cotoolai

📊Today we're sharing initial results from one of our internal agent evals for Security Operations tasks. We replicated the @splunk BOTSv3 CTF environment in an eval to test frontier models' capability on realistic blue team cybersecurity tasks. BOTSv3 comprises over 2.7M logs (spanning over 13 months) and 59 Question and Answer pairs that test scenarios such as investigating cloud-based attacks (AWS, Azure) and simulated APT intrusions. See results and blog post in the thread below

English

0

1

129

Cotool@cotoolai·2 Ara

5/6 Full Blog Post: cotool.ai/blog/evaluatin… Evals in security operations are an evergreen challenge. As agents take over more security operations tasks, benchmarking performance becomes increasingly critical. Our goal is to push the community forward with better metrics so that security teams can properly understand agent capabilities before handing over mission-critical tasks. We have already identified a lot of future work that can build on what we're sharing today, including sharing more tasks and including comparisons on OSS model performance. If you are: - Participating in or building blue-team CTF challenges or security training scenarios - Working with production security datasets that could be anonymized for benchmarking - Researching agent evaluation methodologies or prompt optimization techniques - Running a security operations team interested in testing agents in controlled environments - Building security-specific agents at your company and have insights on model effectiveness for different tasks We'd love to hear from you! DM us directly here on X @cotoolai

English

1

0

1

193

Cotool@cotoolai·2 Ara

1/6 📊 UPDATED EVAL RESULTS We compared Gemini 3 Pro, Claude Opus 4.5, and GPT 5.1 on a single investigation task of our internal agent eval for Security Operations tasks. Key Results: - @OpenAI GPT-5+ models maintain the performance-cost Pareto frontier - @AnthropicAI Opus 4.5 completed tasks 2x faster on average than any other tested model, including Haiku 4.5 (!), suggesting that model reasoning capability and efficiency can outweigh raw inference latency in long-horizon tasks - @GoogleDeepMind Gemini 3 Pro helps Google close the gap to other leading frontier models, but still lags behind in performance and reliability The task is a @splunk BOTSv3 CTF environment built to test frontier models' capability on realistic blue team cybersecurity tasks. BOTSv3 comprises over 2.7M logs (spanning over 13 months) and 59 Question and Answer pairs that test scenarios such as investigating cloud-based attacks (AWS, Azure) and simulated APT intrusions. See results and blog post in the thread below

English

2

1

5

1.2K

Cotool retweetou

#BSidesNYC@BSidesNYC·16 Eki

BSidesNYC welcomes @cotoolai as a kilobit sponsor for our Oct 18, 2025, conference. bsidesnyc.org Cotool works alongside security engineers during alert triage & investigation, reducing time spent by up to 90%. cotool.ai

English

1

2

4

590

Cotool

Descobrir