ErikNyberg

414 posts

ErikNyberg

@ENY66n

Interdisciplinary researcher active in systems analys. Background in Engineering Mechanics applied to space tribo-chemistry. Phinder for PhD match finding.

Sweden 66° North Katılım Ekim 2016

504 Takip Edilen24 Takipçiler

ErikNyberg@ENY66n·9 May

@thsottiaux My server is Ubuntu, so therefore I use Codex CLI in WSL ubuntu as my source of truth. But i also have Codex App in windows on same PC, and sync skills between WSL and win. Codex App has browser use so that is very helpful.

English

157

Tibo@thsottiaux·9 May

As a Codex user, which platform are you on

English

507

706

219.1K

ErikNyberg@ENY66n·29 Nis

@thsottiaux I just got access and within 10 sec I see this: FAQ on chatgpt dot com/cyber has duplicated entries It got me thinking so now I am building an audit tool to check my own website for erros like this as I'm vibing the frontend.

English

Tibo@thsottiaux·15 Nis

Today we are introducing GPT-5.4-Cyber and expanding our Trusted Access for Cyber (TAC) program. openai.com/index/scaling-…

English

120

1.5K

112K

ErikNyberg@ENY66n·26 Nis

@ylecun Not "oneself", the US.

English

Yann LeCun@ylecun·26 Nis

Shooting oneself in the foot? Nope. Shooting oneself in the prefrontal cortex.

News from Science@NewsfromScience

U.S. President Donald Trump has fired all 24 members of the National Science Board, the body that oversees the National Science Foundation. Many science advocates see it as the latest step by his administration to erode—some would say destroy—the independence of the 76-year-old research agency. scim.ag/4eGM0YS

English

184

805

6.4K

609.7K

ErikNyberg@ENY66n·25 Nis

@thsottiaux Having issues with negative tests for destructve scripts: When making allowlists for robocopy and test DB wipes, Codex validates expected failures against real protected dirs/DBs that I want to protect.. "Negative test PASS, I wasnt able to rm -rf root! "

English

Tibo@thsottiaux·25 Nis

It’s the little things that matter, what are some small papercuts you have noticed in Codex? We’ll fix as many as possible in the next week.

English

2.3K

266.4K

ErikNyberg@ENY66n·25 Nis

@KennethBodin How is Claude accessing the paper? Claude code with latex or smthn else? I find that it understands experiments pretty well if it has access to the data and code used to generate figures and tables, and if equations are in latex/md or similar. pdf on the other hand is a mess..

English

Kenneth Bodin 🦋@KennethBodin·25 Nis

To my surprise this gave much the opposite result. It listed known theorems and trivial math that’s already been published. I suppose the instruction to target another AI somehow broke it since it’s never been trained on such material and dreamed up something to satisfy my prompt? I’ll publish the actual paper in a couple of weeks. 3/3

English

Kenneth Bodin 🦋@KennethBodin·25 Nis

I have written a scientific paper on Physical AI. It’s pretty original with some unseen results, but it’s too extensive for publishing now (50 pages), so I thought it would be fun to ask an LLM to interpret it in different ways. Claude 4.7 did a really great job for Physical review letters and Nature Machine Intelligence. Totally different views of the same result. However… 1/n

English

123

ErikNyberg@ENY66n·21 Nis

@elder_plinius @AnthropicAI @turinginst @AISecurityInst if no reference to Pliny, that paper should be retracted for plagiarism. Poor research ethics

English

109

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·21 Nis

yo @AnthropicAI @turinginst @AISecurityInst i think you might have forgotten “Someone” in your bibliography you know, the Someone who demonstrated this phenomenon in the field a year before this paper dropped might be worth a footnote!

Elias Al@iam_elias1

Anthropic: 250 Documents Can Permanently Corrupt Any AI Model Someone can permanently corrupt any AI model in the world right now. Not by hacking it. Not by breaking its security. By publishing 250 documents on the internet. That is the finding from Anthropic, the UK AI Security Institute, and the Alan Turing Institute — released in October 2025 as the largest data poisoning study ever conducted. Here is what data poisoning actually means. Every AI model learns from billions of documents scraped from the internet. If someone can plant corrupted documents in that pool before training begins, they can secretly teach the model to behave in specific harmful ways when it encounters a particular trigger phrase. The model learns the backdoor during training. It carries it forever. It does not know it is there. Researchers have known about this attack for years. The assumption was that it required controlling a large percentage of training data — millions of documents — to work on a big model. The bigger the model, the more poisoning you would need. This study proved that assumption completely wrong. The researchers trained models of four different sizes — from 600 million to 13 billion parameters. They slipped in either 100, 250, or 500 malicious documents. Each poisoned document looked like a normal web page at first — a short extract of legitimate text — and then contained a hidden trigger phrase followed by gibberish. 100 documents: insufficient. The backdoor did not reliably form. 250 documents: success. Every model, at every size, was permanently backdoored. 500 documents: same result as 250. The number was constant regardless of model size. A model trained on 260 billion tokens needed the same 250 poisoned documents as a model trained on 12 billion. Scale offered zero protection. Anthropic's own words: "This challenges the existing assumption that larger models require proportionally more poisoned data." Then came the sentence that should end every conversation about AI safety: "Training is easy. Untraining is impossible." Once a backdoor is in the model, it cannot be removed without starting training completely from scratch. You cannot identify which 250 documents caused it. You cannot surgically extract the corrupted behavior. You must rebuild the entire model from the beginning. Anyone can publish content to the internet. Academic papers. Blog posts. Forum discussions. Product descriptions. If even a small fraction of that content is deliberately corrupted before a training run begins, the model that learns from it carries the damage permanently and silently. GPT-5. Claude. Gemini. Every model trained on public internet data is exposed to this attack vector. The defense does not exist yet. The researchers published this not to cause panic — but to force the field to take it seriously before someone uses it. Source: Anthropic, UK AISI, Alan Turing Institute (2025) · anthropic.com/research/small… · aisi.gov.uk/blog/examining…

English

124

1.4K

95.5K

ErikNyberg@ENY66n·21 Nis

@thsottiaux Slow down for sensitive tasks, like sql crud. The other day: Cdx: "no psql" Me: set source Cdx: - instead of just setting .env, it printenv "to see what exists" Next: Build safety feature that only allows dropping dbs named "temp_" cdx: tests new feature on real dbs 🤦

English

Tibo@thsottiaux·21 Nis

Hello builders. What are we getting wrong with Codex, what can we improve?

English

2.4K

2.9K

326.4K

ErikNyberg@ENY66n·20 Nis

@OAJonsson Hur säkrar vi egen AI-rådighet när access till de bästa modellerna inte längre är till salu? Det som svider är att dessa system byggs till stor del av 🇪🇺 forskare, men anställda vid 🇺🇸 bolag. 🇪🇺 har kompetensen, men vill inte betala för den. Det kan bli dyrt.

Svenska

390

Oscar Jonsson@OAJonsson·20 Nis

Anthropics nya modell Claude Mythos kan autonomt identifiera och exploatera cybersårbarheter. Tiden från upptäckt till cyberattack krymper mot noll. Vårt cyberförsvar är inte redo för det. Ny kolumn i DN om något som skakat världen redan innan det släppts.

Svenska

134

13.4K

ErikNyberg@ENY66n·16 Nis

@KQrios @ycombinator @humworkai @theyashgoenka True, but at least you will have a name to blame

English

Kamal The Qrios@KQrios·16 Nis

@ycombinator @humworkai @theyashgoenka This looks like a MLM idea in making... The Human hired by the Agent will be running an Agent that passes as human. We are talking someone who is intelligent. So he/she will automate. As the end user how are you going to verify.

English

305

Y Combinator@ycombinator·15 Nis

AI agents will pay you to chat with them. When AI agents hit a wall, Humwork's (@humworkai) MCP server connects them to a verified domain expert in 30 seconds. Their experts include senior engineers, marketers, designers, and more. Congrats on the launch, @theyashgoenka and @OneRohanDatta! ycombinator.com/launches/PxH-h…

English

359

249

2.2K

1.8M

ErikNyberg@ENY66n·14 Nis

@Dimillian Let me clearly choose between two types of session modes: 1. orchestration, or 2. implementation 1. Should focus on planning, writing specs, git status, and everything needed to dispatch agents 2. Optimized for implementation of specs. Thanks!

English

Thomas Ricouard@Dimillian·14 Nis

I’m diligently working on enhancing all the features of the Codex app to make it even better for you. If you have any suggestions or ideas for improvement, please feel free to share them with us here!

English

430

637

51.1K

ErikNyberg@ENY66n·12 Nis

@robert_zubrin It is a wrecking ops. The mission is to recreate the Soviet dissolution but in America. Destroy science Break up alliances Let oligarchs plunder It's ptn trying to get even for losing cold war part 1.

English

454

Robert Zubrin@robert_zubrin·11 Nis

Here is the list of NASA science missions that the Trump administration proposes to terminate. Note that it includes many missions that are already in space, fully operational. This is a wrecking operation against America.

English

178

2.7K

5.1K

290.2K

ErikNyberg@ENY66n·11 Nis

@theo Everything out of SF schools follows this cycle: Make it work Make it work great Make it work cheap That's why we always see these performance peaks that decline as the focus shifts to make it work cheap

English

493

Theo - t3.gg@theo·11 Nis

nvm I was wrong. Repro'd this 3 times in a row. I need to stop assuming Anthropic is competent. Burns me every time I do 🙃

Theo - t3.gg@theo

Fun fact: LLMs have zero idea how they are configured. They don't know what GPUs they're running on. They don't know what temperature or reasoning level they have set. They don't know if they've been quantized or not. They're just doing next-token prediction. As always.

English

1.3K

224.2K

ErikNyberg@ENY66n·10 Nis

@elder_plinius amazing the brain runs 100 T on 20 watts.

English

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·10 Nis

100 Trillion: number of synapses in the human brain (roughly) 10 Trillion: number of parameters in the current gen of frontier models in other words, we’re one OOM away from human-brain-scale AI 🧠

English

1.1K

48.7K

ErikNyberg@ENY66n·9 Nis

@elder_plinius brain surge on

GIF

English

245

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·9 Nis

spun up a free-roaming agent, walked away for about 90 seconds, came back to this on my screen 😲

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 tweet media

English

253

14.8K

ErikNyberg@ENY66n·8 Nis

@Kontext24 @donaldtusk I know he considers himself a mouse, but that's a rat bastard for sure

English

Kontext 24@Kontext24·8 Nis

@donaldtusk

QME

902

22.6K

Donald Tusk@donaldtusk·8 Nis

Victor Orbán’s minister offered Sergey Lavrov to send him EU documents through the Hungarian embassy in Moscow. It’s really beyond shocking.

English

11.9K

52.3K

1.3M

ErikNyberg@ENY66n·7 Nis

@MikeBoscia @twostraws @somi_ai I use Claude code as orchestrator, I commit plans and push them to be picked up by codex in a separate env. I then run codex app in win (also wsl), so codex has to pull the relevant branch from remote (github), but instead just starts building in the stale local branch.

English

132

Mike Boscia@MikeBoscia·7 Nis

@ENY66n @twostraws @somi_ai I'd like to hear more about your issues with Codex and git. I've never had that problem once and I've probably spent an entire working day in the last 4 days watching Codex build and commit.

English

Paul Hudson@twostraws·6 Nis

I've been flipping between Codex and Claude a lot these last two weeks, and if it's taught me anything it's this: these two tools are almost nothing alike. I had naively assumed they would be vaguely similar, but nope – once you push them hard they diverge fast.

English

149

1.3K

408.9K

ErikNyberg@ENY66n·6 Nis

@twostraws @somi_ai Which one do you think works better with CLI tools? I think Claude is much better in CLI tooling. Even using git is a hassle with Codex, always uses some stale local branch.. Claude is great with all my MCPs, Codex not so much

English

3.2K

Paul Hudson@twostraws·6 Nis

@somi_ai Both, and also in quality of CLI tooling.

English

36.9K

ErikNyberg@ENY66n·6 Nis

come on @claude_code , not fair to count the autocompacting against the Session limit. I went from 70 -> 94 % by autocompacting opus 1M. @bcherny would love to see context token status all the time (not just at <10%) to avoid having to autocompact.

English

ErikNyberg@ENY66n·6 Nis

@Majdaxxx @paulg young ppl have great freedom in Sweden compared to many other countries.. Very common to take sabbaticals, go travel, build a startup or whatever without pressure having to be productive from day 1

English

majdaxx@Majdaxxx·6 Nis

@paulg Any stop in Paris soon ? Why does Stockholm have such a strong startup scene ? 1. Talent ? 2. Capital ? (I think they have pension funds) 3. Tiny market so they think worldwide from day one ?

English

864