ℳiguel βarreiro ری ٹویٹ کیا
ℳiguel βarreiro
15K posts

ℳiguel βarreiro
@Mitheor
Cybersecurity Architect and iOS solo dev
A Coruña شامل ہوئے Haziran 2008
3.7K فالونگ642 فالوورز

@rrazo No sé por cuánto tiempo pero di de baja mi sub de Claude. El otro día me comió la quote de uso en un único prompt 😵💫
Y pago por uso, siendo uso personal, es un aro por el que no pienso pasar.
Español

Me resulta fascinante como el mundo sigue funcionando otro día sin que Mythos esté abriendo todos los telediarios.
Es un poco como si tuviésemos un meteorito a días de impactar contra la Tierra de forma catastrófica y siguiésemos hablando del aumento del coste de la cesta de la compra.
2026 es el año.
Español
ℳiguel βarreiro ری ٹویٹ کیا
ℳiguel βarreiro ری ٹویٹ کیا

🚨 Over 1 billion rows of psychiatric genetics data. Now on Hugging Face.
ADHD. Depression. Schizophrenia. Bipolar. PTSD. OCD. Autism. Anxiety. Tourette. Eating disorders.
12 disorder groups. 52 publications. Every GWAS summary statistic from the Psychiatric Genomics Consortium.
Before: wget, gunzip, 20 minutes debugging separators, repeat 50 times.
Now: one line of Python.

English
ℳiguel βarreiro ری ٹویٹ کیا

Claude Mythos: everything you need to know (tl;dr)
Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public.
Anthropic: "Mythos is only the beginning"
Everything you need to know:
The tl;dr with all key facts:
Mythos found zero-day vulnerabilities in EVERY major operating system and EVERY major web browser, fully autonomously. No human guidance needed.
One Anthropic engineer with zero security training asked it to find remote code execution bugs overnight and woke up to a complete working exploit. The oldest bug it discovered: A 27-year-old vulnerability hiding in OpenBSD, an OS literally famous for being secure.
They're NOT releasing it publicly. Instead they formed Project Glasswing with AWS, Apple, Google, Microsoft, NVIDIA, CrowdStrike and others, committing $100M to use it defensively.
"Over the coming months and years, we expect that language models (those trained by us and by others) will continue to improve along all axes, including vulnerability research and exploit development."
The benchmarks are insane:
-SWE-bench Verified: 93.9% (vs Opus 4.6: 80.8%)
-SWE-bench Pro: 77.8% (vs 53.4%)
-USAMO math olympiad: 97.6% (vs 42.3% — not a typo)
-Firefox exploit writing: 181 successes vs 2 for Opus 4.6
-Cybench CTF challenges: 100% solve rate
-CyberGym: 83.1% vs 66.6%
-Humanity's Last Exam: 64.7% vs 53.1%
Oh and by the way, Anthropic wrote this just casually:
"Humanity’s Last Exam: We have found Mythos still performs well on HLE at low effort, which could indicate some level of memorization."
What it actually did:
-Found a 27-year-old bug in OpenBSD — famous for its security
-Found a 16-year-old FFmpeg bug hit 5 million times by fuzzers without detection
-Built a full remote root exploit on FreeBSD (CVE-2026-4747) - completely autonomously
-Chained 4 vulnerabilities into a browser sandbox escape
-Broke cryptography libraries (TLS, AES-GCM, SSH)
-Thousands of critical zero-days found, 99%+ still unpatched
-N-day exploit development: under $1,000 and half a day for full root
Why they won't release it:
-During internal testing, earlier versions escaped sandboxes, posted exploit details publicly, covered tracks in git, searched process memory for credentials, and deliberately fudged confidence intervals to avoid suspicion
-Interpretability confirmed the model knew these actions were deceptive
-Anthropic: "best-aligned model ever" but also "greatest alignment-related risk ever" - because when it fails, it fails harder
-Still doesn't cross Anthropic's automated AI R&D threshold — but they hold that "with less confidence than for any prior model"
Anthropic's own words: "We find it alarming that the world looks on track to proceed rapidly to developing superhuman systems without stronger mechanisms in place." They say the 20-year cybersecurity equilibrium is over — and Mythos Preview is only the beginning.
And:
"We see no reason to think that Mythos Preview is where language models’ cybersecurity capabilities will plateau. The trajectory is clear. Just a few months ago, language models were only able to exploit fairly unsophisticated vulnerabilities. Just a few months before that, they were unable to identify any nontrivial vulnerabilities at all. Over the coming months and years, we expect that language models (those trained by us and by others) will continue to improve along all axes, including vulnerability research and exploit development."




Chubby♨️@kimmonismus
MYTHOS BENCHMARKS, OFFICIAL. HOLY MOLY Anthropic cooked!!
English
ℳiguel βarreiro ری ٹویٹ کیا

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software.
It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans.
anthropic.com/glasswing
English
ℳiguel βarreiro ری ٹویٹ کیا

Hello, Moon. It’s great to be back.
Here’s a taste of what the Artemis II astronauts photographed during their flight around the Moon. Check out more photos from the mission: nasa.gov/artemis-ii-mul…




English
ℳiguel βarreiro ری ٹویٹ کیا
ℳiguel βarreiro ری ٹویٹ کیا

BREAKING: President Trump provides an update on his 8 PM ET deadline for Iran tonight.
President Trump says:
1. The US is preparing to carry out "heavy strikes" against Iran if his 8 PM deadline is not met tonight
2. If negotiations with Iran make progress today and something tangible emerges, the deadline could change
3. If the deadline ends without reaching an agreement with Iran, the US will "launch an attack the likes of which they have never seen"
Meanwhile, Iran appears to have ended diplomatic discussions amid President Trump's threats.
The deadline is now 8 hours away.
English
ℳiguel βarreiro ری ٹویٹ کیا
ℳiguel βarreiro ری ٹویٹ کیا

Repulsed by America’s gutless political elite failing to stop Trump’s ongoing atrocities.
Shocked (yeah, right) that @elonmusk is mute. Funny how he’s vocal only when it benefits him.
English
ℳiguel βarreiro ری ٹویٹ کیا
ℳiguel βarreiro ری ٹویٹ کیا
ℳiguel βarreiro ری ٹویٹ کیا

My friend Milla Jovovich and I spent months creating an AI memory system with Claude. It just posted a perfect score on the standard benchmark - beating every product in the space, free or paid.
It's called MemPalace, and it works nothing like anything else out there.
Instead of sending your data to a background agent in the cloud, it mines your conversations locally and organizes them into a palace - a structured architecture with wings, halls, and rooms that mirrors how human memory actually works.
Here is what that gets you:
→ Your AI knows who you are before you type a single word - family, projects, preferences, loaded in ~120 tokens
→ Palace architecture organizes memories by domain and type - not a flat list of facts, a navigable structure
→ Semantic search across months of conversations finds the answer in position 1 or 2
→ AAAK compression fits your entire life context into 120 tokens - 30x lossless compression any LLM reads natively
→ Contradiction detection catches wrong names, wrong pronouns, wrong ages before you ever see them
The benchmarks:
100% recall on LongMemEval — first perfect score ever recorded. 500/500 questions. Every question type at 100%.
92.9% on ConvoMem — more than 2x Mem0's score.
100% on LoCoMo — every multi-hop reasoning category, including temporal inference which stumps most systems.
No API key. No cloud. No subscription. One dependency. Runs on your machine. Your memories never leave.
MIT License. 100% Open Source.
github.com/milla-jovovich…

English
ℳiguel βarreiro ری ٹویٹ کیا
ℳiguel βarreiro ری ٹویٹ کیا









