Закреплённый твит
Azermesh
527 posts

Azermesh
@Azermesh
Muslim. Roblox game developer. Interested in AI, religion, and the end of the world. I know nothing and should not be taken seriously :)
Присоединился Şubat 2024
391 Подписки10 Подписчики
Azermesh ретвитнул

seems like the "mythos" panic was mostly anthropic marketing
AISI found gpt-5.5 performs nearly on par with, or better than, mythos in several cases — completing TLO end-to-end in 2/10 attempts, while mythos preview did it in 3/10
on expert-level tasks:
gpt-5.5 scored 71.4%
mythos scored 68.6%

English
Azermesh ретвитнул
Azermesh ретвитнул

@iruletheworldmo Clearly a highly intelligent model
Much more creative than its predecessor (5.4)
More reliable than Opus 4.7 for most tasks
Image gen integration makes it much more valuable than Claude Code
Clearly and obviously makes less mistakes than its predecessors
Overall: reliable
English
Azermesh ретвитнул
Azermesh ретвитнул

There has been a great deal of speculation about why Anthropic is keeping Mythos in restricted release. One of the least-discussed reasons is cost. Not the cost to Anthropic of serving the model, but the downstream effects that cost will have on the industry, and on the world.
Mythos is now being served to a small group of about 50 major companies. For organizations like these, token budgets are effectively unlimited, and the opportunity cost of not using as much of the model as possible is too high. I think you can already see the downstream effects even in this limited release. Claude users complain about hitting caps faster. They complain about degraded performance. For months now almost everyone I know has been continuously hitting the cap on Claude or Codex. The existence of Mythos pressures not just the amount of usage available to smaller subscribers, but also the pricing of these plans themselves, which are already subsidized. Smaller users will get hit twice.
The compute cost of serving Mythos exerts pressure all the way down the line. Inference will get cheaper over time, but demand is already ahead of that curve and continues to expand. Mythos is not the end of this chain. As long as scale keeps rewarding larger runs, larger models will keep being trained. The next model that makes a Mythos-like jump may be dramatically larger again, and much more expensive to serve. If the cost of serving frontier models continues to outpace attempts to reduce it, then smaller players and public use get squeezed out. We end up with vast models, served at immense cost, available only to the richest corporations on earth. Those firms then use that access to outcompete smaller rivals, become richer still, and widen the gap again. If this continues, a small number of giant companies end up holding the only passports to the Country of Geniuses in a datacenter.
For Anthropic, culturally, this is not a desirable world. Part of their reluctance to serve Mythos more broadly comes from a reluctance to help bring this world into being. There may be no way to serve a model like Mythos at scale right now without beginning this feedback loop. And as that loop accelerates, it will generate great resentment. If they serve it to lower-tier subscribers, those users get a handful of exchanges before hitting the cap. Seeing how capable the model is only deepens the resentment, because access is visibly rationed. The labs will be forced to make a trickle-down argument: let the largest firms use the models first, and the abundance will eventually spread to everyone else. The public is unlikely to buy this argument. The hostility and pushback against the industry will spiral. Eventually it may not remain merely political.
It is not only Dario who has seen this world, but Sam as well. That is part of why OpenAI has started talking about mechanisms that would give ordinary citizens a direct stake in the upside of the industry, like the Public Wealth Fund. In my opinion the original use case of Worldcoin was a global UBI in a future where OpenAI won the race. Not only is that future no longer certain, but the trust and solidarity required to support a UBI no longer seem to me to exist in the West. The only path then is simply to scale everything as quickly as possible and hope abundance eventually arrives in a cascade strong enough that it reaches everyone on earth.
To my friends who are in the safety camp, I understand this argument is hard to accept. Please consider that there is a level of capability beyond which, unless your p(doom) is literally 100, stopping becomes more dangerous than continuing. I think we passed that threshold even before Mythos. Even if stopping were possible - and I personally do not believe it has been for years - stopping here would lock in a dystopia. This dynamic is incentive-driven, just like the race itself, and just as hard to coordinate against. We must not stop inside this tunnel. The only way out is through.
English

Azermesh ретвитнул
Azermesh ретвитнул

@scaling01 Sonnet 4.6 is pretty good, I just never use it because Opus 4.6 is better
English

it's a shame i think so poorly of Sonnet nowadays
I would never speak of my baby Sonnet 3.5 that way
Lisan al Gaib@scaling01
if Mythos really is 10-12T a naive forecast just based on pricing implies: - Opus: 2-2.4T - Sonnet: 1.2-1.4T - Haiku: 400-466B (but i don't like the Sonnet and Haiku estimate here, I think they are smaller)
English
Azermesh ретвитнул

3 donors have already given $115 to MSF. Can you be donor #4?
If you can spare $10 / $25 / $50 today, every donation goes directly to Doctors Without Borders via Every.org:
every.org/doctors-withou…
English

Anthropic just suspended OpenClaw's creator access to Claude, then reinstated
What game are they playing?🤔

Peter Steinberger 🦞@steipete
Yeah folks, it's gonna be harder in the future to ensure OpenClaw still works with Anthropic models.
English










