Rollback 1984
2K posts

Rollback 1984
@rollback1984
It was supposed to be a warning not a guideline.
Katılım Şubat 2025
316 Takip Edilen75 Takipçiler
Rollback 1984 retweetledi

@VictorTaelin This is obvious. It’s the reason why got only works properly with codex harness and Claude code works the best with third party harnesses.
English

My final thoughts on Opus 4.6: why this model is so good, why I underestimated it, and why I'm so obsessed about Mythos.
When I first tested GPT 5.4 vs Opus 4.6 - both launched at roughly the same time - I was initially convinced that GPT 5.4 was vastly superior, because it did better on my logical tests. That's still true: given the same prompt, by default, GPT will be more competent, careful, and produce a more reliable output, while Opus will give you a half-assed, buggy solution, and call it a day.
Now, here's what I failed to realize: Opus bad outputs are not because it is dumb. They're because it is a lazy cheater. And you can tell because, if you just go ahead and tell it:
"you did X in a lazy way, do it in the right way now"
And if you show that this is serious, it will proceed to do a flawless job. That doesn't happen with dumber models. And, the more I work with Opus, the more I realize that, if you just keep pushing it, its intelligence ceiling is much, much higher than it seems. It IS there, you just need to be patient and push it. GPT, on the other hands, when it fails, it already did its best, so, pushing it further will give you no added results.
That is also one of the reasons that benchmarks lie. When Claude and GPT score the same in a given benchmark, it is likely that Claude is actually smarter, because it puts less effort. Now, consider that for a moment, and remember that Mythos is outperforming GPT 5.4 *Pro* on benchmarks. How insane that is?
Remember that Sonnet 3.5 lagged behind on benchmarks, yet everyone knew that it was superior to 4o. I think it is this effect at play: for whatever reason, Claude-series model "try less hard" on the first shot.
Because of that, even if Spud gets close to Mythos on benchmarks (which I predict will be the case), I suppose Mythos will still be superior. This also leads me to wonder if perhaps Anthropic actually has a real lead over OpenAI, that will only get larger? I could totally see a timeline where Anthropic's models become so good that OpenAI simply fails to catch up as the recursive improvement unfolds?
Just my silly thoughts though, what do I know
As always I could be wrong, and I hope I am!!
English

@Geniustechw no one told us the war had ended and that we don't need to eat like this anymore.
English
Rollback 1984 retweetledi

🚨RUPERT LOWE PLEDGES MAJOR RESTRICTIONS ON ISLAM IN THE UK
- Halal slaughter BANNED
- Cousin marriage BANNED
- Burqa/Niqab BANNED
- Sharia courts BANNED
- Foreigners promoting Islamism DEPORTED
- Mass Islamic public prayer BANNED
- Public Call to Prayer BANNED
"This is a Christian country. Under a Restore Britain Government, it would stay that way" - Rupert Lowe
We're taking our country back


English

@Xaviololo @PatronesHumanos When you go to the supermarket do you pay less if you earn less? You socialist retard.
English

@PatronesHumanos 600€ sobre 1.400€ es el 43% de su sueldo
600€ sobre 5.500€ es el 11% del tuyo
“ir a medias” no existe cuando los sueldos no son iguales
Español

@pb551221 @John_F_kJr Are you paid or just retarded for free?
English


🚨 BOMBSHELL ALERT: First-in-the-World IVERMECTIN, Mebendazole, and Fenbendazole Protocol for CANCER Has Been Peer-Reviewed and Published – BIG PHARMA TREMBLES
🚨 EXPOSED: A peer-reviewed study confirms that the banned drugs Ivermectin, Fenbendazole, and Mebendazole obliterate cancer cells — and Big Pharma is LOSING ITS MIND. The truth is out. Their billion-dollar lie is collapsing, and the panic is very, very real.
🚨 FIRST-IN-THE-WORLD #IVERMECTIN, #MEBENDAZOLE, AND #FENBENDAZOLE PROTOCOL IN CANCER
FOLLOW ME, THE NEXT DROP WILL BE SHOCKING.


English

SENIOR ENGINEERS ARE QUIETLY SWITCHING FROM CLAUDE CODE TO CODEX AND HERE'S THE BRUTAL BREAKDOWN
a 14-year principal engineer spent ~120 hours co-developing (not vibe coding) across both tools on an 80k LOC python/typescript project.
here's what he found:
Claude feels like an engineer on a time crunch:
> speeds toward getting things working
> ignores CLAUDE.md at least once per session
> leaves tasks half-done mid-migration
> changes tests to match what IT thinks the goal is
> almost never creates new files — just bloats existing ones
Codex feels like a 5-6 year senior:
> stops mid-task to rethink and refactor unprompted
> never once ignored AGENTS.md
> doesn't extend god classes — it factors them out
> does things you hadn't thought of that are actually additive
> you can fire it off and come back when it's done
the raw numbers:
> Claude: more done per session, more cleanup every few days
> Codex: 3-4x slower, but the work is just better
> Codex Pro x5 ≈ Claude Max x20 in usage caps
the real difference:
> Claude needs a skilled, focused driver or it goes off the rails
> Codex demonstrates competence and earns autonomy
his verdict:
> vibe coding a weekend project? Claude wins
> building enterprise software? Codex wins
"Claude requires a skilled, focused driver more than Codex does"
both give crap output if you don't know SWE. the tool isn't the skill.

English

@stats_feed Now put it next to the minimum wage of each country.
English

🛢️ Diesel price per litre across Europe as of April 9th 2026
🇳🇱 Netherlands: €2.58
🇩🇰 Denmark: €2.56
🇩🇪 Germany: €2.43
🇧🇪 Belgium: €2.31
🇫🇮 Finland: €2.26
🇫🇷 France: €2.23
🇦🇹 Austria: €2.23
🇸🇪 Sweden: €2.20
🇱🇺 Luxembourg: €2.19
🇱🇹 Lithuania: €2.18
🇵🇹 Portugal: €2.13
🇮🇪 Ireland: €2.10
🇮🇹 Italy: €2.09
🇬🇷 Greece: €2.07
🇱🇻 Latvia: €2.06
🇪🇪 Estonia: €2.03
🇷🇴 Romania: €2.02
🇨🇿 Czechia: €1.98
🇭🇷 Croatia: €1.87
🇨🇾 Cyprus: €1.84
🇪🇸 Spain: €1.81
🇵🇱 Poland: €1.80
🇸🇮 Slovenia: €1.77
🇸🇰 Slovakia: €1.75
🇧🇬 Bulgaria: €1.70
🇭🇺 Hungary: €1.67
The most expensive diesel? Netherlands 🇳🇱
The cheapest? Hungary 🇭🇺
Source: Eurostat
English

🇨🇳🇮🇷🇺🇲CHINA has issued a strong warning to US reinstating that China has an energy agreement with IRAN and it ships will not be intercepted.
Chinese Defense Ministry:
‘Chinese ships continue to move in and out of the waters of the Strait of Hormuz. We have trade and energy agreements with Iran, which we will respect and abide by.
We expect others not to interfere in our affairs. Iran controls the Strait of Hormuz, and has opened it to us.’
English

@VicctorianChad x.com/eugeniovalle64… how about this one
𝗗𝗼𝗻 𝗘𝘂𝗴𝗲𝗻𝗶𝗼 🇪🇸@EugenioValle64
🚨 La Guardia Civil confirma: la vía de Adamuz se rompió 22 horas antes. El sistema lo detectó. No saltó ninguna alerta. 46 españoles muertos que podían estar vivos. El ministro Óscar Puente se enfrenta a 3 años de prisión por retirar 42 metros de vía sin permiso judicial. Sigue en su cargo. 🔁 Retuitea para que España no lo olvide #Adamuz #ÓscarPuente #ADIF #Negligencia #España
English

@RoundtableSpace question is whether it improves output or not. opus for any serious coding is unusable.
English

@marcowenjones The EU? With what? A GDPR compliant fishing boat?
English

@tmaiaroto @javitm @plainionist The amount of times you interview candidates and they seem decent only to be completely useless.
English

@javitm @plainionist How much time did you spend interviewing and reviewing the AI. How many years of experience did the AI model have? Did you check its references?
English
Rollback 1984 retweetledi

@Portuga62717995 @jessicatuga Nota-se que és um papagaio socialista
Português

@jessicatuga Nota-se que nem viste o video, ele disse "tropas políticas no terreno"
Português
Rollback 1984 retweetledi

🚨 Read this slowly.
• Wife lives in the U.S. 🇺🇸
• Four kids live & study in the U.S. 🇺🇸
• ~91% of his portfolio in the U.S. 🇺🇸
• Home in the U.S. 🇺🇸
• Brookfield moved HQ to the U.S. 📍
Yet he tells Canadians: 🇨🇦
“We can’t depend on America.” 🇺🇸
Do you see the contradiction?
#cdnpoli #Canada #US #Reality
English

@RupertLowe10 @BayeOA He only understands that he gets paid to exist at the end of the month.
English

@BayeOA You clearly understand nothing about how the economy works.
English

Lmfao nah this country is so finished, look at what they’re campaigning on? Is no one concerned about the actually economy?
Rupert Lowe MP@RupertLowe10
A Restore Britain Government would raise the speed limit on British motorways to 80mph, and slash back 20mph zones other than around schools. Too much petty regulation has held Britain back for far too long - Restore Britain would gladly burn that all away.
English


















