
delaniac 🌹🌱
62.3K posts

delaniac 🌹🌱
@ChadNotChud
G.A. Cohen stan | democratic socialist | vegan



🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵


Jensen Huang says every company will need an OpenClaw agentic system strategy by calling it “the new computer.” He claims OpenClaw became the most popular open-source project in $NVDA history within weeks and comparing its impact to Linux reshaping the software stack.




🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵

2/ Our method: test them on esoteric programming languages. Brainfuck. Befunge-98. Whitespace. Unlambda. Shakespeare. All Turing-complete. All requiring identical reasoning to Python. All with 1,000-100,000x fewer GitHub repos than mainstream languages. Same problems. Radically less training data.


Remember the first rule of memory safety in C is to have fun



You cannot write a computer program that looks at any other program and correctly determines whether it will eventually finish or run forever. This isn't a hardware limitation. Alan Turing proved in 1936 that no such program can logically exist, by showing any attempt creates a paradox, which is essentially a formal version of “this sentence is false.” What's unsettling is the proof technique: he didn't find a hard case. He proved the question destroys itself when you try to answer it.





Complexity visualiser

Nick Fuentes is DONE with the goyim superchatters "Can goyim be sent to d!e?"



