bling
4.2K posts

bling
@blingdivinity
Artist, Open to Interpretation 𖣘 American Undergraduate 𓉱 Casting Mathemagical Spells 𝜆

i think this meme is hilarious. my take on all this: the point of introspection is to end up thinking less, not more, to be more in the flow, more productive, to dissolve into being itself. if your introspection is making you think more i recommend getting another one


Today we're sharing how our internal misalignment monitoring works at OpenAI – great work by @Marcus_J_W! 1. We monitor 99.9% of all internal coding agent traffic 2. We use frontier models for detection /w CoT access 3. No signs of scheming yet, but detect other misbehavior

4/ We tested GPT-5.2, O4-mini, Gemini 3 Pro, Qwen3-235B, and Kimi K2 across 5 prompting strategies. Models scoring 85-95% on HumanEval scored 0-11% on equivalent esoteric tasks. And every model, every language, every strategy scored 0% beyond the Easy tier. Not 2%. Not 5%. Zero.



Yes i totally see this and will probably work but there are no guarantees that certain intel level it wont find a code to communicate/think internal ideas in on an abstraction dimensionality that the evaluator didnt know existed. Just like animals think ur outward expressions are a reliable indicator of intent. Whoops there was a whole extra level now im dead







🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵


being right-leaning and high openness is so funny. "this is one of my favorite musicians, i disagree with everything they stand for, highly recommend"


being right-leaning and high openness is so funny. "this is one of my favorite musicians, i disagree with everything they stand for, highly recommend"















