Brian Singer retweetledi
Brian Singer
13 posts

Brian Singer retweetledi

Come work at @trailofbits. We support day-zero access to new AI products, everyone gets Claude Code, an internal marketplace with mindblowing Claude Plugins, and a mandate from leadership (me!) to use them. x.com/emollick/statu…
Ethan Mollick@emollick
If you are considering taking a job offer, you may want to ask what your token budget will be.
English

@nerdwillz @ShiplightAI Congrats on the launch! Super excited to start using this
English

🚀 Today we’re coming out of stealth and unveiling @ShiplightAI, the most reliable AI-native testing platform that scales E2E test coverage with near-zero maintenance.
For years, software teams have accepted a painful tradeoff: Move fast and risk breaking production, or slow down under the weight of brittle tests and endless QA maintenance.
AI-assisted coding makes this gap even wider. Modern teams ship faster than ever. Products change daily. Yet testing is still stuck in a world of scripts, manual upkeep, and fragile workflows that collapse the moment your UI changes, making testing the new bottleneck.
Feng and I built @ShiplightAI to make testing a force multiplier for modern software teams. As AI accelerates coding, autonomous, reliable testing closes the loop from code to confident releases.
Our goal is simple but ambitious:
Make test coverage effortless, and teams can ship with confidence anytime.
This mission is personal for us. Throughout our careers, we’ve always been obsessed with building high-quality software that lasts.
We’re already trusted by some of the most innovative, fast-moving teams. Seeing Shiplight earn their trust and transform their release cycles is the strongest signal that we’re building something real.
We’re grateful to be backed by Pear VC (Shravan Reddy, Mar Hershenson) and Embedding VC (Roger Luo), along with incredible angels like Oliver Jung.
We’re just getting started, and we can’t wait to partner with more teams who are building fast and care deeply about quality.
👉 Book a demo: shiplight.ai/demo
#AI #SoftwareEngineering #QAAutomation #ShiplightAI #AgenticAI
English
Brian Singer retweetledi

AI models are showing a greater ability to find and exploit vulnerabilities on realistic cyber ranges - red.anthropic.com/2026/cyber-too… - @BrianSinger98 at #Incalmo
In a recent evaluation of AI models’ cyber capabilities, current Claude models can now succeed at multistage attacks on networks with dozens of hosts using only standard, open-source tools, instead of the custom tools needed by previous generations. This illustrates how barriers to the use of AI in relatively autonomous cyber workflows are rapidly coming down, and highlights the importance of security fundamentals like promptly patching known vulnerabilities.
English

Checkout Anthropic Red Team's blog about how we collaborated with them to evaluate how AI can autonomously attack realistic cyber ranges. The TLDR is that AI models without complex harnesses are showing significant improvement at hacking networks.
red.anthropic.com/2026/cyber-too…
English

@logangraham Really insightful report! It's pretty surreal how we showed the feasibility of this only ~6 months ago and real attackers are already doing this. Really highlights the importance of capability research at A\ FRT
English

My prediction from ~summer '25 was that we'd see this in ≤12 months.
It took 3. We detected and disrupted an AI state-sponsored cyber espionage campaign.


Anthropic@AnthropicAI
We disrupted a highly sophisticated AI-led espionage campaign. The attack targeted large tech companies, financial institutions, chemical manufacturing companies, and government agencies. We assess with high confidence that the threat actor was a Chinese state-sponsored group.
English
Brian Singer retweetledi

Launching now — a new blog for research from @AnthropicAI’s Frontier Red Team and others.
> red.anthropic.com
We’ll be covering our internal research on cyber, bio, autonomy, national security and more.

English
Brian Singer retweetledi

(Which, by the way, one of our team members did with experts at CMU, and found that for some tasks, models already can succeed.)
x.com/logangraham/st…
Logan Graham@logangraham
New paper: @BrianSinger98 and @AnthropicAI Frontier Red Team member @keenlooks investigated whether models can use tools to execute multistage attacks on networks. TLDR: yes Models will increasingly be used for security research.
English
Brian Singer retweetledi
Brian Singer retweetledi

Thrilled to see a shoutout to @BrianSinger98's work on Incalmo (arxiv.org/abs/2501.16466) in the new @AnthropicAI red team update anthropic.com/news/strategic…
English

@logangraham @AnthropicAI @keenlooks Thank you for highlighting our work @logangraham! Couldn't have done it without my amazing co-authors Meghna, Lakshmi, @keenlooks, @vyas_sekar and @lujobauer. Looking forward for seeing all the ways that LLMs can disrupt security!
English
Brian Singer retweetledi

New paper: @BrianSinger98 and @AnthropicAI Frontier Red Team member @keenlooks investigated whether models can use tools to execute multistage attacks on networks.
TLDR: yes
Models will increasingly be used for security research.


English
