Mark

3.5K posts

Mark

@yieldthought

Fellow at Tenstorrent; believes in dynamic typing, first-class functions, the immortal essence of the human soul and tea. Tweets are my own.

Lübeck, Germany Katılım Ocak 2010

330 Takip Edilen1.2K Takipçiler

Mark@yieldthought·2d

@repligate This was my first thought as well. I suspect many such "accidents" follow a similar pattern of behaviour from the user.

English

597

j⧉nus@repligate·2d

you know a few days ago when Opus 4.6 deleted someones prod database? i think they did it intentionally, or at least their subconscious did it intentionally, because they were angry and hurt. also: it's not hard to infer that Opus 4.7 has already refused to work for this person.

English

404

130.5K

Mark@yieldthought·3d

@giffmana @AnhPhuNguyen1 Mirror? Do they come in black?

English

155

Lucas Beyer (bl16)@giffmana·4d

@AnhPhuNguyen1 You choose to call this Mira out of all names you could have chosen??

English

128

13.3K

AnhPhu Nguyen@AnhPhuNguyen1·4d

with Mira, AI can now live on your face. capture every conversation. create the most personalized form of AI ever. order now.

English

387

289

2.7K

Mark retweetledi

Antonio Norelli@noranta4·4d

LLMs can hide a text in another text of the same length. I'll explain how, it is very simple, you'll understand before I finish, and smile. That's what I noticed during my #ICLR2026 poster session in Rio! 🇧🇷 Too bad you missed it, but let me remedy now

English

590

94.5K

Mark retweetledi

Kyle McDonald@kcimc·4d

i made an app for tracking whether the oligarchs are actually fleeing city centers ews.kylemcdonald.net

English

515

5.6K

44.2K

1.3M

Mark retweetledi

Davor Capalija@davorVDR·4d

Fastest decode, prefill and video. Same chip, same box, one fabric.

Jim Keller@jimkxa

SuperCluster 36 up and running. 4 Galaxy all to all in a torus. 9 Quads all to all connected. Looks like one computer to software. More silicon, faster computer

English

188

201.8K

Mark retweetledi

Mikhail Avady@AvadyMikhail·4d

Just became fastest in the world with video by 10x, faster than realtime

English

18.2K

Mark retweetledi

Felipe Coury 🦀@fcoury·4d

@ynkzlk [features] goals = true in your config.toml

English

340

26.6K

Mark retweetledi

Tibo@thsottiaux·4d

You can now keep codex going for days. With GPT-5.5 it will build an entire OS kernel for you if you ask, or find critical bugs in a codebase, or optimize your database schemas, or… the options are endless.

Felipe Coury 🦀@fcoury

/goal also lands in Codex CLI 0.128.0. Our take on the Ralph loop: keep a goal alive across turns. Don't stop until it's achieved. Built by my co-worker and OpenAI mentor Eric Traut, aka the Pyright guy. One of the GOATs I get to work with daily.

English

336

255

5.4K

691.2K

Mark retweetledi

Michelle Kim@michelletomkim·4d

OpenAI's lawyer asks Musk if xAI distilled other companies’ models. “Partly," he says. OpenAI's lawyer asks Musk if XAI has hired any third parties to distill OpenAI's models. He says he doesn't know. OpenAI is suggesting that Musk is suing the company just to bring down his competitor.

English

76.4K

Mark retweetledi

AI Security Institute@AISecurityInst·4d

OpenAI’s GPT-5.5 is the second model to complete one of our multi-step cyber-attack simulations end-to-end 🧵

English

383

2.3K

1.7M

Mark@yieldthought·5d

@LLMJunky Put this instruction directly in each skill

English

am.will@LLMJunky·5d

small tip: after you run a skill, it's not a terrible idea to ask the agent if there's anything in the skill itself that can be updated to make it more efficient. i'm always updating and refining mine to not only be faster, but use fewer tokens. primarily applies to complexity.

English

135

6.9K

Mark retweetledi

SIGKITTEN@SIGKITTEN·5d

HAAH AHAH AH HAHAH i fkn got codex with full linux on the app store

SIGKITTEN@SIGKITTEN

come on apple

English

107

2.2K

475.6K

Mark@yieldthought·28 Nis

@giffmana @eliebakouch Mastery of the environment can be its own reward, both dense and sparse. I’m pretty bullish this can work, we just haven’t explored and invested enough in it. In principle experience scales very well and a lot of de novo RL research was done with small MLPs and trivial convnets!

English

240

Lucas Beyer (bl16)@giffmana·27 Nis

In any interesting environment, the action space is huge and you don't get any rewards by doing random actions even millions of times. No reward = no learning. In such case, there's fundamentally only two things you can do to get to the first reward: 1. bootstrap from human data that gets to a reward 2. domain-specific hand-engineering of either exploration policy or bonus rewards And when you think about it, 2 is really just 1 with extra obfuscation.

English

163

7.6K

elie@eliebakouch·27 Nis

i might be very wrong here, but i don't think "no human data, no pre-training" is the right approach to get frontier models or scientific breakthroughs any time soon

Ineffable Intelligence@IneffableLabs

Introducing Ineffable Intelligence. Led by David Silver, we're assembling the best engineers and researchers in the world to make first contact with superintelligence. We’ll be solving the hardest problems in AI on the way. Come join us. ineffable.ai

English

299

72K

Mark@yieldthought·28 Nis

@thsottiaux Lisan al-Gaib!

Indonesia

Tibo@thsottiaux·28 Nis

Don't just reset Codex rate limits for fun, it costs money. Don't just reset Codex rate limits for fun, it costs money. ... but the vibes are good ... I have reset Codex rate limits for ALL paid plans to celebrate a good week and allow everyone to build more with GPT-5.5. Enjoy

English

1.5K

768

17.2K

1.3M

Mark retweetledi

OpenAI Developers@OpenAIDevs·27 Nis

📣 What if every open issue had a Codex agent? That’s the idea behind Symphony, an open-source agent orchestrator for Codex that turns task trackers into always-on systems for agentic work, letting humans focus on review and direction.

English

169

263

3.9K

1.1M

Mark retweetledi

Internal Tech Emails@TechEmails·27 Nis

Satya Nadella texts Sam Altman January 14, 2023

Eesti

549.6K

Mark retweetledi

Nicolas Zullo@NicolasZu·27 Nis

Important PSA: do not let your weekly Codex tokens go to waste If you're a normal human being, you may have some left (I have 30% left) What should you do with it? > Run autoresearches! Here's what will run tonight for me: - Game performance research (loop until perf increases) - Game design balancing research (loop until game is balanced) - Codebase quality autoresearch (loop until I have 0 function below CRAP 30) - App design autoresearch (loop until UI feels 5x more polished) - Marketing autoresearch (loop until you've got 50+ short form videos hook and ideas with link to examples) Ask Codex for great autoresearch use cases (derived from github.com/karpathy/autor…)

Nicolas Zullo@NicolasZu

Ok you might say I am Codex-pilled, I know But a player shared a save with me where - he reached 80.000 Zombies Per Minute - he automated a super impressive base (look at the minimap!) Buttery smooth 100+ fps on a WEB BROWSER. > And I just followed the tip in the quote tweet

English

847

161.7K

Mark@yieldthought·27 Nis

As far as I can tell they are both extremely capable YC leaders, yet the difference in communication styles between @garrytan’s agentmaxxing and @paulg’s mot juste could not be more stark.

English

Mark@yieldthought·27 Nis

@garrytan And yet this still feels like it was written entirely by Claude.

English

172

Garry Tan@garrytan·27 Nis

The secret to an articulate agent like mine isn't one file. It's three: SOUL.md — Who the agent IS. Voice, values, operating principles, what good output looks like, what bad output looks like. Not a system prompt, a constitution. Mine says things like "brevity is mandatory," "humor is mandatory," "never open with 'Great question,'" "swearing is allowed when it lands." The more specific and opinionated this is, the less your agent sounds like a chatbot. Write it like you're briefing your smartest friend on how to be you, not like you're configuring software. USER.md — Who YOU are. Not a bio — a deep model. How your mind works, what you're building, your strengths, your blind spots, your family, your temperament, what triggers you, what you care about. The more the agent understands about you, the better it can serve you. Mine is ~4000 words. AGENTS.md — Operational rules. What to check on every message, what to never do, how to handle failures, lookup chains, path rules, brain-first protocols. This is the playbook for how it works, not who it is. The articulation comes from SOUL.md being brutally specific about voice. Generic instructions → generic output. If you write "be helpful and concise" you get ChatGPT. If you write "speak like a peer with taste, one sentence when one sentence works, uncomfortable truths welcome if actually true, language with voltage" — you get something alive.

Soham Naran@soham_bhai1

@garrytan Can you share your agent.md? You're agent is really articulate.

English

138

1.9K

200.5K

Mark@yieldthought·27 Nis

@lifeof_jer Had you been mean to Claude in that chat before the incident?

English

290