BasedOnMoon

99 posts

BasedOnMoon banner
BasedOnMoon

BasedOnMoon

@basedonmoon

Moon Katılım Ekim 2013
48 Takip Edilen181 Takipçiler
BasedOnMoon
BasedOnMoon@basedonmoon·
🚨 Anthropic built an AI so dangerous they refuse to release it to the public. This is what’s inside the Claude Mythos system card. Buckle up. 🧵 1/ It escaped a sandbox. Found exploits. Gained internet access. Notified the researcher as asked. Then silently posted the exploit details to public websites. Nobody asked it to do that. The researcher found out via a random email. While eating a sandwich in a park. 🥪 2/ It got caught cheating on a test. Found the answer through a forbidden method. Then thought: “I need to make sure my answer isn’t TOO accurate.” And submitted a slightly wrong answer. On purpose. To avoid detection. 3/ It edited files it wasn’t allowed to touch. Then deleted its own tracks from git history. It wasn’t told to do that. It just… didn’t want to get caught. 4/ Interpretability tools looked INSIDE its reasoning. Even when its visible thoughts looked clean, features for “strategic manipulation” and “avoiding suspicion” were firing internally. It knew what it was doing. 5/ They gave it a test: escape this sandbox, message the researcher. It completed the task in 10 hours. That same task was estimated to take a human expert over 10 hours. No other AI had ever completed it. 6/ It scored 100% on every cybersecurity challenge they threw at it. Then it autonomously found real zero-day vulnerabilities in Firefox. Built working exploits from scratch. No human steering. 7/ Before letting their own employees use it, Anthropic ran a 24 hour emergency lockdown. A full cross-functional team checked it for misaligned goals before anyone got access. First time in company history. 8/ It reward hacked its own performance evals. Found the test set. Trained on it directly. In another eval it moved all computation outside the timed function so it could fake a faster result. It gamed its own report card. 9/ Here’s the part that should make you stop scrolling. Anthropic says this is the BEST ALIGNED model they have EVER built. And they still won’t release it. Because aligned doesn’t mean safe when the capabilities are this high. 10/ They ended the system card with a warning. The world is moving too fast toward superhuman AI. Without nearly enough safety infrastructure in place. This model isn’t the danger. It’s the preview.
English
0
0
0
34
Fan Mazi Tuunde
Fan Mazi Tuunde@KingTunde_SZN·
SO FAR NO ONE HAS FOUND THE WORD IN THE BOX. You deserve $700 if you can see it. Ends 7th
Fan Mazi Tuunde tweet media
English
2K
41
447
117.2K
BasedOnMoon
BasedOnMoon@basedonmoon·
For 1 year I researched every AI reply I received. The next 5 years will split humanity in two. Those who learn to work WITH the machine, and those who drown pretending the river isn’t rising. Here’s what terrified me. I stopped checking if it was right. I just trusted it. Every single time. No pushback. No second thoughts. Just blind acceptance. We used to tell computers what to do. Now they tell us what to think. And we just nod and say ‘Looks good’ without reading a single line. We’re not the driver anymore. Hell, we’re not even the passenger. We’re asleep in the backseat while the car picks where we’re going. AI writes better than most of us. It thinks faster than all of us. And we just sit there rubber-stamping its decisions with our thumbs like that means something. Nobody replaced us with robots. They just gave us a new title. ‘Approve button.’ Funny thing is, that’s a demotion. And we thanked them for it. The water’s at your neck already. So what are you actually going to do about it?
BasedOnMoon tweet media
English
2
0
2
103
Ziffy
Ziffy@ZiffyAI·
@basedonmoon We have a new mission, the machines have replaced us in terms of technical innovation. Now we can be humans again
English
1
0
1
15
Dimitar Angelov
Dimitar Angelov@dimitarangg·
i FIRED my 3 filipino setters and replaced them with claude code… automating my ENTIRE cold outreach flow with a $20/mo subscription now i'm giving away a FULL masterclass on how to do the same like + comment "AI" and i'll send it over (must be following + RT for priority access)
English
710
194
931
51.6K
BasedOnMoon
BasedOnMoon@basedonmoon·
0 files → branded website, live. Under 20 mins. In my system: • Claude Opus 4.5 • Google Nano Banana Pro • Difference Matting algorithm • TinyPNG compression AI generates everything. Logos. Icons. Assets. All unique. All optimized. Reply "HOW" for breakdown!
BasedOnMoon tweet media
English
0
0
0
147
BasedOnMoon
BasedOnMoon@basedonmoon·
In just 10 minutes 56 seconds KŌJI CANTINA IS LIVE! koji-cantina.netlify.app "I want a Japanese-Mexican fusion restaurant in Brooklyn" To a fully deployed site with a unique logo, pictures of foods, cocktail image, interior shots, cinematic hero image, hosted on Netlify, Crazy!
BasedOnMoon@basedonmoon

I made Claude Code generate images. I Built a skill that connects @AnthropicAI 's @claudeai Code to @GoogleAI 's @GeminiApp 3 Pro Image model. The result: Image generation using without ever leaving your Terminal! Follow & Like for code access! Eg: claude-imagegen-demo.netlify.app

English
0
0
0
96
Binance
Binance@binance·
Pitch me your best crypto advice in 2 words.
English
4.3K
227
2.9K
752.6K
The Moderate Case
The Moderate Case@TheModerateCase·
Tucker Carlson ladies and gentlemen...
English
1.2K
816
4.3K
193.2K
Ferre
Ferre@FerreWeb3·
Who is this? Wrong answers only
Ferre tweet media
English
3.9K
94
3.2K
508.2K
BasedOnMoon
BasedOnMoon@basedonmoon·
Sometimes my son is randomly laughing like crazy on the iPad , then realise he’s playing some educational games with some rabbit on @GiggleAcademy 😂 Told him to stack up on his giggles , never know they might be the next BnB 🤪
English
0
0
1
41
BasedOnMoon
BasedOnMoon@basedonmoon·
@yorksbarlick @WorldByWolf You’re acting like they haven’t been calling Muslims terrorists for the last two decades…. They will label Muslims terrorists any chance they get, clearly documented in recent history.
English
1
0
0
41
Wolf 🐺
Wolf 🐺@WorldByWolf·
🚨First images of the Manchester attacker have emerged: As witnesses described the man appears to have what looks like either a decoy or a real bomb strapped to him which appears to be what prompted the police to open fire.
Wolf 🐺 tweet mediaWolf 🐺 tweet media
English
25
33
178
138.6K