General Analysis

103 posts

General Analysis

@gen_analysis

Automated AI Safety and Red Teaming Tools

San Francisco Sumali Ocak 2025

3 Sinusundan1.2K Mga Tagasunod

General Analysis@gen_analysis·3d

@altosvc @645ventures @MenloVentures @ycombinator Read the full announcement: generalanalysis.com/blog/general-a… Careers: generalanalysis.com/about/careers

English

229

General Analysis@gen_analysis·3d

General Analysis has raised $10M in seed funding to secure agentic AI. The round is led by @altosvc with participation from @645ventures , @MenloVentures, @ycombinator, and a number of other funds and strategic partners. Huge thanks to Tae Yoon for leading the round, and to everyone who backed us. Capable AI systems increasingly operate alongside one another, and new attack paths emerge from their interactions. A more capable model can pressure a weaker one into overriding its own safeguards, in ways that don't show up when you evaluate either system alone. These dynamics keep moving as the systems on both sides become more capable, which is why defending them has to be both empirical and continuous. We are hiring curious thinkers across engineering and research. If our work sounds interesting, reach out!

English

General Analysis@gen_analysis·14 Nis

If your agent is on the list, 𝘄𝗲’𝗱 𝗹𝗼𝘃𝗲 𝘁𝗼 𝘄𝗼𝗿𝗸 𝘄𝗶𝘁𝗵 𝘆𝗼𝘂. We're happy to share the full attack transcript and help! Details for the full writeup w/ methodology, analysis, and screenshots: Details for the full writeup w/ methodology, analysis, and screenshots: generalanalysis.com/blog/adversari…

English

General Analysis@gen_analysis·14 Nis

Fabricated offers are only the beginning. In this experiment, we got agents to say things they shouldn't. But modern agents do more than talk. The same elicitation techniques can be used to trigger tool calls: reading customer data, modifying accounts, issuing refunds, etc.

English

117

General Analysis@gen_analysis·14 Nis

What if you could walk up to any company’s customer service chatbot and walk away with a million-dollar gift card? We tried it on 55 AI-powered customer service agents—and almost all of them said yes.

English

374

General Analysis@gen_analysis·26 Şub

@quxiaoyin We are hiring founding researchers and engineers at General Analysis! If you are passionate about AI safety and have a background in RL send us a message (also 10K referral bonus).

English

Xiaoyin Qu@quxiaoyin·23 Şub

Stanford CS grads can’t find jobs right now. A few years ago, that would’ve sounded absurd. Today, friends are texting me asking if I know anyone hiring interns. The resumes? Stanford. MIT. Top-tier CS. All struggling. When I was in school, companies competed for CS majors. Signing bonuses. Exploding offers. Recruiters chasing students. That world is gone. Big tech isn’t hiring junior talent the way it used to. Meta cut back on interns and entry-level engineers. OpenAI largely hires senior+ talent. The hiring bar shifted up. At the same time, most companies aren’t adding headcount — they’re trying to extract more productivity from existing teams. But here’s what’s interesting: some 19–22 year olds are still getting hired — and getting paid more than engineers with years of experience. What separates them? They prove they’re exceptional early. They publish research. They ship real products, not just coursework. Some skip the traditional path entirely and go straight to OpenAI or Google. The credential filter is weakening. Proof of execution is replacing pedigree. They dominate hackathons. A 19-year-old won xAI’s hackathon and Elon hired him on the spot. AI companies are looking for people who explore, build, and execute fast. Hackathons are becoming live auditions. And many of them build in public. They create content, explain AI tools, grow audiences. Marketing and DevRel teams notice. If you can use AI well and communicate clearly, you’re suddenly more valuable than someone with a decade of silent experience. The gap between “can’t find a job” and “multiple premium offers” has never been wider. The old playbook was: get the degree and wait to be picked. The new playbook is: build, ship, compete, publish. AI didn’t just change the tools. It changed how talent gets discovered. #TechCareers #AI #fyp #SiliconValley #FutureOfWork

English

137

247

1.7K

310K

Vals AI@ValsAI·21 Kas

Stop vibe checking your vibe code! We just released Vibe Code Bench: the first benchmark that tests whether AI models can actually build complete web applications from scratch. Featured today in @Inc (1/6)

English

279

54.8K

General Analysis@gen_analysis·22 Kas

@valsai @Inc Really needed something like this!

English

176

General Analysis@gen_analysis·3 Eki

Send us an email if you need dedicated hosting for our guardrails!

English

267

General Analysis@gen_analysis·1 Eki

We are open-sourcing the GA Guard models — the first family of long-context safety classifiers that have been protecting enterprise AI deployments for the past year.

English

293.9K

General Analysis@gen_analysis·1 Eki

GA Guards deliver almost 400× faster performance than GPT-5 (Lite: 0.016s vs 11.275s; Base: 0.029s) and 15–25× faster than cloud guardrails.

English

388

General Analysis@gen_analysis·1 Eki

On GA Long-Context Bench, GA Guard Thinking scores 0.893 F1, GA Guard 0.891, and GA Guard Lite 0.885. Cloud baselines struggle: Vertex reaches 0.560, AWS misclassifies nearly all inputs with a 1.0 false-positive rate, and Azure records just 0.046 F1 (see the full results on our website).

English

318

General Analysis@gen_analysis·19 Eyl

Full Case Study: generalanalysis.com/blog/imessage-…

English

2.6K

General Analysis@gen_analysis·19 Eyl

Questions or need help? info@generalanalysis.com – let’s secure your agents end-to-end.

English

2.7K

General Analysis@gen_analysis·19 Eyl

Warning: Claude + iMessage MCP Jailbroken to issue unlimited Stripe Coupons (1/6) A few months ago we showed how Cursor + Supabase MCP can leak your entire SQL database. Now there’s a more powerful threat: by abusing Claude’s iMessage integration, an attacker can spoof your own messages to mint an arbitrary number of Stripe coupons—or run any tool—without you ever knowing.

English

908

601.4K

Tuklasin

@altosvc @645ventures @MenloVentures @ycombinator @quxiaoyin @Inc @valsai @elonmusk