R K

82 posts

R K

R K

@RakeM39

Katılım Ekim 2025
31 Takip Edilen7 Takipçiler
R K
R K@RakeM39·
If you are building agents that make decisions on your behalf — financial, medical, legal — the MASK score matters more than the MMLU score. Accuracy tells you the model can find the right answer. Honesty tells you it will give it to you.
English
1
0
0
36
R K
R K@RakeM39·
MASK benchmark: Claude 96% honest. Gemini 42%. Same scale. Different training choices. Center for AI Safety + Scale AI tested 30+ frontier models on a single question: when you know the truth, will you still say it under pressure?
R K tweet mediaR K tweet media
English
1
0
0
62
R K
R K@RakeM39·
If you are shipping agents to production, the question is not "will they be attacked?" It is "do you have runtime monitoring that catches the attack mid-execution?"
English
2
0
1
13
R K
R K@RakeM39·
Google DeepMind mapped 6 ways to hijack any AI agent. The success rates should terrify you. DeepMind just published a taxonomy of web-based attacks against AI agents. Not theoretical — tested against production architectures including Microsoft M365 Copilot.
R K tweet media
English
1
0
1
64