JJ Allaire retweetledi

We are excited to announce Inspect Scout, a tool for in-depth analysis of AI agent transcripts: meridianlabs-ai.github.io/inspect_scout/
Scout lets you go beyond simple success/failure metrics to detect issues like misconfigured environments, refusals, and evaluation awareness using LLM-based or pattern-based scanners. Scout includes tools for developing scanners interactively, validating rubrics, and exploring scan results visually.
We are especially appreciative of the feedback we got from @AISecurityInst, US CAISI, @METR_Evals, and @apolloaievals during the development of Scout.
Blog post: aisi.gov.uk/blog/a-pipelin…
Website: meridianlabs-ai.github.io/inspect_scout/

English























