AZ Jedi

1K posts

AZ Jedi banner
AZ Jedi

AZ Jedi

@AZJedi2000

Retweets and Likes are because I inadvertently clicked on them while scrolling.

Katılım Temmuz 2009
408 Takip Edilen29 Takipçiler
Steven Tavares
Steven Tavares@eastbaycitizen·
I’ve covered Eric Swawell since he was a member of the Dublin City Council. Shortly after being elected to Congress in 2013, his behavior towards women was known by all levels of our local government and the Alameda County Democratic Party.
English
3.7K
2.4K
12.4K
2.4M
AZ Jedi retweetledi
Dr. Jebra Faushay
Dr. Jebra Faushay@JebraFaushay·
This is what my brain hears when someone is trying to explain math, card games, board games, rules of any kind, or cryptocurrency.
English
112
105
757
38.5K
Chubby♨️
Chubby♨️@kimmonismus·
Claude Mythos: everything you need to know (tl;dr) Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Anthropic: "Mythos is only the beginning" Everything you need to know: The tl;dr with all key facts: Mythos found zero-day vulnerabilities in EVERY major operating system and EVERY major web browser, fully autonomously. No human guidance needed. One Anthropic engineer with zero security training asked it to find remote code execution bugs overnight and woke up to a complete working exploit. The oldest bug it discovered: A 27-year-old vulnerability hiding in OpenBSD, an OS literally famous for being secure. They're NOT releasing it publicly. Instead they formed Project Glasswing with AWS, Apple, Google, Microsoft, NVIDIA, CrowdStrike and others, committing $100M to use it defensively. "Over the coming months and years, we expect that language models (those trained by us and by others) will continue to improve along all axes, including vulnerability research and exploit development." The benchmarks are insane: -SWE-bench Verified: 93.9% (vs Opus 4.6: 80.8%) -SWE-bench Pro: 77.8% (vs 53.4%) -USAMO math olympiad: 97.6% (vs 42.3% — not a typo) -Firefox exploit writing: 181 successes vs 2 for Opus 4.6 -Cybench CTF challenges: 100% solve rate -CyberGym: 83.1% vs 66.6% -Humanity's Last Exam: 64.7% vs 53.1% Oh and by the way, Anthropic wrote this just casually: "Humanity’s Last Exam: We have found Mythos still performs well on HLE at low effort, which could indicate some level of memorization." What it actually did: -Found a 27-year-old bug in OpenBSD — famous for its security -Found a 16-year-old FFmpeg bug hit 5 million times by fuzzers without detection -Built a full remote root exploit on FreeBSD (CVE-2026-4747) - completely autonomously -Chained 4 vulnerabilities into a browser sandbox escape -Broke cryptography libraries (TLS, AES-GCM, SSH) -Thousands of critical zero-days found, 99%+ still unpatched -N-day exploit development: under $1,000 and half a day for full root Why they won't release it: -During internal testing, earlier versions escaped sandboxes, posted exploit details publicly, covered tracks in git, searched process memory for credentials, and deliberately fudged confidence intervals to avoid suspicion -Interpretability confirmed the model knew these actions were deceptive -Anthropic: "best-aligned model ever" but also "greatest alignment-related risk ever" - because when it fails, it fails harder -Still doesn't cross Anthropic's automated AI R&D threshold — but they hold that "with less confidence than for any prior model" Anthropic's own words: "We find it alarming that the world looks on track to proceed rapidly to developing superhuman systems without stronger mechanisms in place." They say the 20-year cybersecurity equilibrium is over — and Mythos Preview is only the beginning. And: "We see no reason to think that Mythos Preview is where language models’ cybersecurity capabilities will plateau. The trajectory is clear. Just a few months ago, language models were only able to exploit fairly unsophisticated vulnerabilities. Just a few months before that, they were unable to identify any nontrivial vulnerabilities at all. Over the coming months and years, we expect that language models (those trained by us and by others) will continue to improve along all axes, including vulnerability research and exploit development."
Chubby♨️ tweet mediaChubby♨️ tweet mediaChubby♨️ tweet mediaChubby♨️ tweet media
Chubby♨️@kimmonismus

MYTHOS BENCHMARKS, OFFICIAL. HOLY MOLY Anthropic cooked!!

English
66
260
2.2K
400.7K
Pat Gray Unleashed
Pat Gray Unleashed@PatUnleashed·
Is Artemis II Mission about to get canceled? @NASA is actively working to address a critical safety issue with the Artemis II flight termination system (FTS), a vital component of the mission.
English
7
2
35
4K
Ethan Mollick
Ethan Mollick@emollick·
The biggest bottleneck in AI for most people isn't the models. It's the chatbot. New interfaces like Claude Dispatch, are closing the gap between what AI can do and what people can actually use it for. For many folks, that is where leaps will come from. open.substack.com/pub/oneusefult…
English
44
23
324
26.5K
AZ Jedi
AZ Jedi@AZJedi2000·
@RealJamesWoods Maybe Trump happened immediately after of the dog becoming smarter
English
1
0
1
37
James Woods
James Woods@RealJamesWoods·
Did your dog do the grammar check on your sign?
James Woods tweet media
English
3.4K
7.7K
43.2K
433.1K
AZ Jedi retweetledi
SportsCenter
SportsCenter@SportsCenter·
The Wildcats' winningest season keeps rolling 😤 Arizona racked up 36 wins on the way to the Final Four, breaking a tie for the most in program history 🔥
SportsCenter tweet media
English
58
336
3.4K
199.5K
Arizona Basketball
Arizona Basketball@ArizonaMBB·
Work to be done. Plenty of time left.
Arizona Basketball tweet media
English
65
20
282
13.9K
AZ Jedi retweetledi
Arizona Athletics
Arizona Athletics@AZATHLETICS·
WE’RE GOING TO THE FINAL FOUR!!!!!!!!!!!!!!
English
46
476
2.7K
48.1K
AZ Jedi retweetledi
Love Music
Love Music@khnh80044·
The most INSANE Bohemian Rhapsody flashmob you will ever see!! 😱 That has to be the BEST music video I've seen on this site so far! Absolutely amazing!! 😍
English
50
1.1K
6.2K
96K
Arizona Athletics
Arizona Athletics@AZATHLETICS·
U OF A 🗣️ U OF A 🗣️ U OF A 🗣️ U OF A 🗣️ U OF A 🗣️ U OF A 🗣️ U OF A 🗣️ U OF A 🗣️ U OF A 🗣️ U OF A 🗣️
English
5
48
437
4.8K
Lloyd Legalist
Lloyd Legalist@LloydLegalist·
When life spikes a lesson at your head and you say, “Okay, lesson learned,” and life says, “But did you REALLY learn the lesson? Let’s make sure.” Whack.
English
205
709
7.9K
2.8M
AZ Jedi retweetledi
Arizona Basketball
Arizona Basketball@ArizonaMBB·
How Sweet it is!
Arizona Basketball tweet media
English
55
327
1.6K
61.5K
AZ Jedi retweetledi
Arizona Athletics
Arizona Athletics@AZATHLETICS·
SURVIVE AND ADVANCE.
Arizona Athletics tweet media
English
5
130
909
11.2K