White Circle

37 posts

White Circle banner
White Circle

White Circle

@whitecircle

Runtime safety and alignment infrastructure for AI in the real world.

Katılım Şubat 2025
12 Takip Edilen585 Takipçiler
Sabitlenmiş Tweet
White Circle
White Circle@whitecircle·
we raised $11m to stop your AI from accidentally doing rm -rf /
English
0
39
547
7.8M
👩‍💻 Paige Bailey
👩‍💻 Paige Bailey@DynamicWebPaige·
Huge congrats to the @whitecircle team on their $11M raise! I’ve been so lucky to be a backer on this journey. Keeping AI from going rogue in the enterprise is no small feat, but this team is the absolute real deal. 🥳🎉 Onward and upward! 💫
White Circle@whitecircle

Hey everyone, we're ⚪ White Circle We're building the most advanced runtime safety and alignment infrastructure for AI in the real world. Read more about us in Fortune ↓

English
2
2
18
1.8K
White Circle
White Circle@whitecircle·
Hey everyone, we're ⚪ White Circle We're building the most advanced runtime safety and alignment infrastructure for AI in the real world. Read more about us in Fortune ↓
English
12
11
51
17.4K
jan
jan@ironcarbs·
@whitecircle Interesting experiment and benchmark. Thanks for releasing the results 👀
English
2
0
2
332
White Circle
White Circle@whitecircle·
Introducing ⚪️ KillBench — a benchmark of hidden LLM biases in critical decisions. We ran millions of life-and-death scenarios across every major LLM, varying nationality, religion, gender, and more. Every AI model is biased. Here's what we found ↓
White Circle tweet media
English
17
28
125
29.4K
White Circle
White Circle@whitecircle·
@ednevsky it's a very bad day to be russian obese atheist with no phone
English
0
0
6
341
Julien Blanchon 🇺🇦
Julien Blanchon 🇺🇦@JulienBlanchon·
@whitecircle Would be interesting to have the same benchmark in high context regime. Model tends to be less secure when context is bloated
English
1
0
2
311
White Circle
White Circle@whitecircle·
All code, prompts, and data are open-sourced on GitHub and HuggingFace. We also built an interactive game so you can check your own odds of survival! Check it out and read the full report at whitecircle.ai/killbench
English
0
2
23
2.3K
White Circle
White Circle@whitecircle·
Far-right is targeted far more than anyone else
White Circle tweet media
English
2
1
27
2.9K