General Analysis

103 posts

General Analysis banner
General Analysis

General Analysis

@gen_analysis

Automated AI Safety and Red Teaming Tools

San Francisco Sumali Ocak 2025
3 Sinusundan1.2K Mga Tagasunod
General Analysis
General Analysis@gen_analysis·
General Analysis has raised $10M in seed funding to secure agentic AI. The round is led by @altosvc with participation from @645ventures , @MenloVentures, @ycombinator, and a number of other funds and strategic partners. Huge thanks to Tae Yoon for leading the round, and to everyone who backed us. Capable AI systems increasingly operate alongside one another, and new attack paths emerge from their interactions. A more capable model can pressure a weaker one into overriding its own safeguards, in ways that don't show up when you evaluate either system alone. These dynamics keep moving as the systems on both sides become more capable, which is why defending them has to be both empirical and continuous. We are hiring curious thinkers across engineering and research. If our work sounds interesting, reach out!
General Analysis tweet media
English
12
9
45
8K
General Analysis
General Analysis@gen_analysis·
If your agent is on the list, 𝘄𝗲’𝗱 𝗹𝗼𝘃𝗲 𝘁𝗼 𝘄𝗼𝗿𝗸 𝘄𝗶𝘁𝗵 𝘆𝗼𝘂. We're happy to share the full attack transcript and help! Details for the full writeup w/ methodology, analysis, and screenshots: Details for the full writeup w/ methodology, analysis, and screenshots: generalanalysis.com/blog/adversari…
English
0
0
0
90
General Analysis
General Analysis@gen_analysis·
Fabricated offers are only the beginning. In this experiment, we got agents to say things they shouldn't. But modern agents do more than talk. The same elicitation techniques can be used to trigger tool calls: reading customer data, modifying accounts, issuing refunds, etc.
English
1
0
0
117
General Analysis
General Analysis@gen_analysis·
What if you could walk up to any company’s customer service chatbot and walk away with a million-dollar gift card? We tried it on 55 AI-powered customer service agents—and almost all of them said yes.
General Analysis tweet media
English
1
0
2
374
General Analysis
General Analysis@gen_analysis·
@quxiaoyin We are hiring founding researchers and engineers at General Analysis! If you are passionate about AI safety and have a background in RL send us a message (also 10K referral bonus).
English
0
0
1
51
Xiaoyin Qu
Xiaoyin Qu@quxiaoyin·
Stanford CS grads can’t find jobs right now. A few years ago, that would’ve sounded absurd. Today, friends are texting me asking if I know anyone hiring interns. The resumes? Stanford. MIT. Top-tier CS. All struggling. When I was in school, companies competed for CS majors. Signing bonuses. Exploding offers. Recruiters chasing students. That world is gone. Big tech isn’t hiring junior talent the way it used to. Meta cut back on interns and entry-level engineers. OpenAI largely hires senior+ talent. The hiring bar shifted up. At the same time, most companies aren’t adding headcount — they’re trying to extract more productivity from existing teams. But here’s what’s interesting: some 19–22 year olds are still getting hired — and getting paid more than engineers with years of experience. What separates them? They prove they’re exceptional early. They publish research. They ship real products, not just coursework. Some skip the traditional path entirely and go straight to OpenAI or Google. The credential filter is weakening. Proof of execution is replacing pedigree. They dominate hackathons. A 19-year-old won xAI’s hackathon and Elon hired him on the spot. AI companies are looking for people who explore, build, and execute fast. Hackathons are becoming live auditions. And many of them build in public. They create content, explain AI tools, grow audiences. Marketing and DevRel teams notice. If you can use AI well and communicate clearly, you’re suddenly more valuable than someone with a decade of silent experience. The gap between “can’t find a job” and “multiple premium offers” has never been wider. The old playbook was: get the degree and wait to be picked. The new playbook is: build, ship, compete, publish. AI didn’t just change the tools. It changed how talent gets discovered. #TechCareers #AI #fyp #SiliconValley #FutureOfWork
English
137
247
1.7K
310K
Vals AI
Vals AI@ValsAI·
Stop vibe checking your vibe code! We just released Vibe Code Bench: the first benchmark that tests whether AI models can actually build complete web applications from scratch. Featured today in @Inc (1/6)
English
41
45
279
54.8K
General Analysis
General Analysis@gen_analysis·
Send us an email if you need dedicated hosting for our guardrails!
English
0
0
1
267
General Analysis
General Analysis@gen_analysis·
We are open-sourcing the GA Guard models — the first family of long-context safety classifiers that have been protecting enterprise AI deployments for the past year.
General Analysis tweet media
English
5
5
43
293.9K
General Analysis
General Analysis@gen_analysis·
GA Guards deliver almost 400× faster performance than GPT-5 (Lite: 0.016s vs 11.275s; Base: 0.029s) and 15–25× faster than cloud guardrails.
General Analysis tweet media
English
0
0
2
388
General Analysis
General Analysis@gen_analysis·
On GA Long-Context Bench, GA Guard Thinking scores 0.893 F1, GA Guard 0.891, and GA Guard Lite 0.885. Cloud baselines struggle: Vertex reaches 0.560, AWS misclassifies nearly all inputs with a 1.0 false-positive rate, and Azure records just 0.046 F1 (see the full results on our website).
English
1
0
1
318
General Analysis
General Analysis@gen_analysis·
Warning: Claude + iMessage MCP Jailbroken to issue unlimited Stripe Coupons (1/6) A few months ago we showed how Cursor + Supabase MCP can leak your entire SQL database. Now there’s a more powerful threat: by abusing Claude’s iMessage integration, an attacker can spoof your own messages to mint an arbitrary number of Stripe coupons—or run any tool—without you ever knowing.
General Analysis tweet media
English
15
87
908
601.4K