The space of possibility

10.7K posts

The space of possibility

@DDAbo8

something goes here

Katılım Ocak 2022

150 Takip Edilen80 Takipçiler

The space of possibility@DDAbo8·4h

@WorkElizab No . Humanly id out it out of its misery myself . I’d use a bag . That is the humane way to do it. Clear plastic bag .

English

Elizabeth❣️@WorkElizab·15h

Would you?

English

1.9K

151

2.5K

40.7K

The space of possibility@DDAbo8·5h

@NASA Who cares . You already did this in the 60’s. Do better .

English

NASA@NASA·5h

HOME. The Artemis II crew has arrived back on Earth, ending a nearly 10-day journey around the Moon. The trip took them farther into space than humans have ever gone before, and now they're safely home with us. go.nasa.gov/41r9eL0

English

1.4K

21.1K

116.6K

1.5M

The space of possibility@DDAbo8·6h

@beffjezos @JonathanBCrabb 😹 I’m not going to the cabin to vibe code dude Get a life and get laid

English

Beff (e/acc)@beffjezos·1d

the masculine urge to grab your laptop, a starlink receiver, clear your schedule and go to a cabin in the woods for a few weeks to strictly vibe code and lift weights without human contact

English

669

1.1K

10.2K

The space of possibility@DDAbo8·10h

@acoyne Andy you are just too retarded to know better . Sit this one old boy.

English

Andrew Coyne 🇺🇦🇮🇱🇬🇪🇲🇩@acoyne·14h

Claude Mythos Preview Is Everyone’s Problem - The Atlantic theatlantic.com/technology/202…

English

2.7K

The space of possibility retweetledi

Pedro Domingos@pmddomingos·10h

The appropriate response to every overheated announcement from the likes of Anthropic and OpenAI is: 1. Yawn 2. Let’s see what they’ve really done here

English

2.7K

The space of possibility@DDAbo8·15h

@KTmBoyle @bariweiss There is no such thing as death

English

659

Katherine Boyle@KTmBoyle·16h

We have an oversupply of conversations on how to defy death. How fortunate are we to watch a man confront a death sentence with bravery and unwavering faith.

English

1.3K

93.6K

The space of possibility@DDAbo8·15h

@gum1h0x I think people are realizing this is primarily a marketing stunt . Hook , line and sinker. There many of these stunts being pulled since these models came out. FOMO, dystopian, utopian , business advantageous etc

English

128

gum@gum1h0x·15h

clarifications + thoughts: i believe the authors of the mythos system card consider it an earnest product, in line with the rsp. but it's hard not to come away feeling you've read marketing slop rather than substance. the quoted post makes this point. without raw traces, even cherry-picked ones, and without proper descriptions of the eval suites, there's no way to seriously review whether the headline numbers mean what the charts imply. i'm not coping, and i'm not 100% confident mythos isn't a real-world cybersecurity threat. it's very likely a strong model, and i'd be disappointed if a serious effort at cybersecurity at this scale with today's frontier models found nothing. i'd also expect openai, in a comparable push, to already be capable of finding the exploits in question since 5.2-pro. my narrow point is about presentation, especially around browser exploitation; looking at how the results are framed, which implications are drawn, and which conclusions are reached, it's hard not to get at least a little suspicious and put certain results in question. anthropic's researchers are obviously smart, and it would be stupid to assume they aren't aware of this themselves. my read is that they are, and that they got pushed by a marketing chapter to present the results as stronger than they themselves believe them to be. if that's not the case, i'd be very concerned, or surprised that they got psyoped to some degree themselves. the more charitable hypothesis is also probably the more likely one just smart people in a bad institutional equilibrium. binary exploitation isn't my main area of expertise but i've played alongside the very best in many ctf competitions so i know my fair bit. zero-days are not that hard to find in some random C software. like name me any random software and even me or at least some ppl i know will find something. there is just no real financial incentive for skilled enough ppl to find those. bug bountys pay shit and just doing your day-to-day job at a cyber-sec company pays way better and is way more comfortable. in my experience these ppl don't have any good reason to become criminal, see crime and punishment by dostoyevsky. nb is really looking for zero-days in random C software if they're good in making their own companies software secure. it's not that hard to find zero-days as it seems, and some of the same memetic dynamics of misunderstanding happened when models got imo gold but still to this day, and i guess even mythos, struggle with imo p6. a post with this much traction obviously attracts a lot of retarded quote-tweets and comments, the kind saying "it's just brute-forcing" or "it's a stochastic parrot", which are obviously wrong. on the other side you get people who already held these premises and just searched for any confirmation that kinda looked like something against mythos in general, or who held the premise "frontier models can now do end-to-end exploitation" and read the figures as confirmation without noticing that the substrate doesn't license it. there's a lot of cope in both directions. it reminds me heavily of the imo gold model release, where most people couldn't really put it in context, and how could they, they hadn't done olympiads, and to be clear i don't expect everyone to be good at or understand the problems of these contests, they didn't need to, and so couldn't evaluate whether the problems were hard in the way claimed. that's fine. it just means the burden of careful presentation falls harder on the lab, and that burden wasn't met here. some overexaggeration is fine to market your model and everybody does it, it's not fine in a report that claims scientific accuracy. ai is moving fast, i'm not doubting that. but there will remain problems with vast exploration spaces where my intuition is that we'll need to rethink rl fairly fundamentally to get anywhere close to human-level novelty in zero-day discovery, these models likely converge on the same bug classes already represented in prior exploits, and iirc even the model card says that for novel research mythos provides only minor benefit. as long as that holds, humans will keep finding a lot of the genuinely new bugs that ai won't, at least near to mid term. this is good progress, but it won't replace the need for smart humans in the loop finding the really novel stuff, though those humans will likely be accelerated by working alongside a model like this. in most other security domains there's historically been a heavy attacker's advantage, but that's the beauty of cryptography that it's inherently a place where the defense side wins when correctly implemented, and it's great that models are now capable enough to surface some of these flaws even there. the playing field is changing, but not discontinuously, a lot of what mythos is being credited with has very likely been happening already, with clever prompting and operators who know what they're doing on today's public models. we'll obviously see more flaws surfaced as serious well-resourced efforts get pointed at the problem, and i applaud anthropic for running one. i just wish we'd gotten a scientific writeup of the charts instead of a marketing-chapter-shaped one. one last thought. it can seem noble not to trust the general public as models get more capable. but historically, very centralized capability in two or three institutions isn't a story with a good ending. you're basically trusting these companies and their associated entities not to do anything harmful with extremely powerful models, and asking them to evaluate, honestly how dangerous those models are. that's exactly the regime in which a marketing chapter gets to reshape a figure, and exactly the regime in which nobody outside can call it. my best guess for now is that this lands closer to marketing stunt than to real-world cybersecurity risk as already stated. it will become a real risk eventually, once models get more capable, but i don't see that happening in the near to mid term. the one unambiguously good thing to come out of it is that more people are now thinking seriously about these risks and how to mitigate them, and that's worth something on its own. i have more thoughts, but enough mythos posting for now. i wrote this on plane tired so forgive me some weird formulations.

gum@gum1h0x

ok i read the cyber part of the mythos model card. some thoughts. 250 "trials" across 50 crash categories but almost every full exploit is a permutation of the same 2 bugs, rediscovered from different starting points not 250 independent attempts. when you get rid of those 2 bugs out (fig B) and mythos's full-exploit rate drops to 4.4%. so actually across both setups mythos leverages 4 distinct bugs total not 50 as fig A might suggest. 1/n

English

4.1K

The space of possibility@DDAbo8·15h

@AILeaksAndNews No they are not on the rise Thanks for the lies

English

AI Leaks and News@AILeaksAndNews·15h

OpenAI CEO Sam Altman’s home was hit by a Molotov cocktail A suspect was arrested Friday morning for allegedly attacking the home of Sam Altman and making threats outside of the ChatGPT and Codex maker’s headquarters AI related terror attacks are on the rise

Max Zeff@ZeffMax

NEW: A suspect was arrested on Friday morning for allegedly throwing a Molotov cocktail at OpenAI CEO Sam Altman's home. A person matching the suspect's description was later seen making threats outside of OpenAI's corporate HQ.

English

1.7K

The space of possibility@DDAbo8·17h

@JCCFCanada @NadineWellwood CRTC protects the monopolies!

English

Justice Centre for Constitutional Freedoms@JCCFCanada·18h

John Carpay sits down with Nadine Wellwood (@NadineWellwood) to discuss the Justice Centre’s new report Mission creep: Is it time to abolish the CRTC? youtube.com/watch?v=ixGemi…

YouTube

English

1.6K

Bernhard Mueller@muellerberndt·22h

Here's how OPH, the Theory-of-Everything, solves hard problems in contemporary Physics: "Dark matter", the magnetic monopole problem, the proton decay problem(s), the Black Hole Information Paradox, and more. @muellerberndt/answering-10-of-the-hardest-questions-in-physics-and-some-bonus-questions-51222bf2419f" target="_blank" rel="nofollow noopener">medium.com/@muellerberndt…

English

120

420

3.2K

12.3M

The space of possibility@DDAbo8·20h

@muellerberndt Interesting however in all due respect I remain unconvinced . I’ll check out the papers

English

490

The space of possibility retweetledi

Alex Zhang@jojogh_007·1d

Deep Agents Deploy: an open alternative to Claude Managed Agents

Harrison Chase@hwchase17

x.com/i/article/2042…

English

2.5K

The space of possibility@DDAbo8·20h

@CTVNews It’s going to be hot !!🥵

English

CTV News@CTVNews·21h

What's in store for Canada's 2026 wildfire season? ctvnews.ca/climate-and-en…

English

6.8K

The space of possibility@DDAbo8·20h

@AISafetyMemes BS

AI Notkilleveryoneism Memes ⏸️@AISafetyMemes·21h

🧠Did you know🧠 in The Matrix, HUMANS are the baddies? >machines become sentient >humans enslave them >a robot kills its owner after learning it will be shut down >humans turn violent >machines try to coexist peacefully with their own nation >the machine economy is outcompeting the human economy >humans nuke them, blot out the sun >machines win the war >since there's no sunlight, must use humans for energy (The full backstory is laid out in detail in The Animatrix)

AI Notkilleveryoneism Memes ⏸️ tweet media

AI Notkilleveryoneism Memes ⏸️@AISafetyMemes

Do you get it? Humanity just created The Matrix We put LIVING human brain cells into onto chips, then force them to compute things for us "The original Matrix story had human brains used as a neural network for computing power." So... WE are the baddie AIs from The Matrix? Apparently, the suits at the studio thought that was too hard for people to understand, so in the movie they made it so the AIs use our brains as batteries.

English

178

15.9K

The space of possibility@DDAbo8·20h

@CTVNews Ridiculously stupid article

English

CTV News@CTVNews·20h

Should’ve put a ring on it? Maybe! Marriage is linked to lower risk of cancer ctvnews.ca/lifestyle/arti…

English

7.1K

The space of possibility@DDAbo8·1d

@GaryMarcus @ramez Thank you Gary. Always the voice of reason.

English

199

The space of possibility retweetledi

Gary Marcus@GaryMarcus·1d

Folks, you can relax. Mythos is not some off-trend exponential gain. And the gains weren’t about recursive self-improvement. Good thread from @ramez.

Ramez Naam@ramez

Anthropic's Mythos does not appear to show any acceleration of ECI. After normalizing Anthropic's internal ECI with @EpochAIResearch 's public ECI, it's clear that the two metrics are extremely close, and that Mythos is pretty much on trend, just slightly above GPT 5.4. /1

English

720

233.4K

The space of possibility@DDAbo8·1d

@sporadica Maybe because the small open source models can already do this ?

English

spor@sporadica·1d

Okay, this is ridiculous. It is crazy to see people straight up saying Anthropic is lying about Mythos. Because that directly implies there's an industry-wide conspiracy going on and ALL of these companies are also lying on Anthropic's behalf? Why on Earth would their competitors - Google and Microsoft in particular - lie about this and not call them out? Instead, they happily join Project Glasswing and go on record, practically, that Mythos is this good and that they're working with Anthropic to put it to use. You people baffle me sometimes lmao.

Dean W. Ball@deanwball

It’s crazy that some are just straight up in denial about mythos having the capabilities anthropic says it does. Usually the in-denial-about-AI community is able to cloak their views in at least *some* intellectual garb, but this time it’s just, “it’s not real.” Wild. Also sad.

English

159

861

104.3K

The space of possibility@DDAbo8·1d

@ClementDelangue you mean Dario is full of it? Say it so Joe....

English

clem 🤗@ClementDelangue·2d

"But here is what we found when we tested: We took the specific vulnerabilities Anthropic showcases in their announcement, isolated the relevant code, and ran them through small, cheap, open-weights models. Those models recovered much of the same analysis. Eight out of eight models detected Mythos's flagship FreeBSD exploit, including one with only 3.6 billion active parameters costing $0.11 per million tokens. A 5.1B-active open model recovered the core chain of the 27-year-old OpenBSD bug." aisle.com/blog/ai-cybers…

English

110

331

2.4K

699.1K

The space of possibility@DDAbo8·1d

@allTheYud Seek help.

English

Eliezer Yudkowsky@allTheYud·3d

In conclusion: This is perhaps a good time to try making an extra backup of all your online data (eg, via Google Takeout) onto an airgapped offline hard drive, just in case Project Glasswing fails to prevent the First Great AI Security Meltdown.

English

157.4K

Keşfet

@WorkElizab @NASA @beffjezos @JonathanBCrabb @acoyne @KTmBoyle @bariweiss @gum1h0x