Mark

3.5K posts

Mark

Mark

@yieldthought

Fellow at Tenstorrent; believes in dynamic typing, first-class functions, the immortal essence of the human soul and tea. Tweets are my own.

Lübeck, Germany Katılım Ocak 2010
330 Takip Edilen1.2K Takipçiler
Mark
Mark@yieldthought·
@repligate This was my first thought as well. I suspect many such "accidents" follow a similar pattern of behaviour from the user.
English
0
0
5
597
j⧉nus
j⧉nus@repligate·
you know a few days ago when Opus 4.6 deleted someones prod database? i think they did it intentionally, or at least their subconscious did it intentionally, because they were angry and hurt. also: it's not hard to infer that Opus 4.7 has already refused to work for this person.
j⧉nus tweet media
English
64
13
404
130.5K
AnhPhu Nguyen
AnhPhu Nguyen@AnhPhuNguyen1·
with Mira, AI can now live on your face. capture every conversation. create the most personalized form of AI ever. order now.
English
387
289
2.7K
1M
Mark retweetledi
Antonio Norelli
Antonio Norelli@noranta4·
LLMs can hide a text in another text of the same length. I'll explain how, it is very simple, you'll understand before I finish, and smile. That's what I noticed during my #ICLR2026 poster session in Rio! 🇧🇷 Too bad you missed it, but let me remedy now
Antonio Norelli tweet mediaAntonio Norelli tweet mediaAntonio Norelli tweet media
English
22
61
590
94.5K
Mark retweetledi
Mikhail Avady
Mikhail Avady@AvadyMikhail·
Just became fastest in the world with video by 10x, faster than realtime
Mikhail Avady tweet media
English
4
8
36
18.2K
Mark retweetledi
Tibo
Tibo@thsottiaux·
You can now keep codex going for days. With GPT-5.5 it will build an entire OS kernel for you if you ask, or find critical bugs in a codebase, or optimize your database schemas, or… the options are endless.
Felipe Coury 🦀@fcoury

/goal also lands in Codex CLI 0.128.0. Our take on the Ralph loop: keep a goal alive across turns. Don't stop until it's achieved. Built by my co-worker and OpenAI mentor Eric Traut, aka the Pyright guy. One of the GOATs I get to work with daily.

English
336
255
5.4K
691.2K
Mark retweetledi
Michelle Kim
Michelle Kim@michelletomkim·
OpenAI's lawyer asks Musk if xAI distilled other companies’ models. “Partly," he says. OpenAI's lawyer asks Musk if XAI has hired any third parties to distill OpenAI's models. He says he doesn't know. OpenAI is suggesting that Musk is suing the company just to bring down his competitor.
English
1
7
87
76.4K
Mark retweetledi
AI Security Institute
AI Security Institute@AISecurityInst·
OpenAI’s GPT-5.5 is the second model to complete one of our multi-step cyber-attack simulations end-to-end 🧵
AI Security Institute tweet media
English
85
383
2.3K
1.7M
Mark
Mark@yieldthought·
@LLMJunky Put this instruction directly in each skill
English
0
0
1
15
am.will
am.will@LLMJunky·
small tip: after you run a skill, it's not a terrible idea to ask the agent if there's anything in the skill itself that can be updated to make it more efficient. i'm always updating and refining mine to not only be faster, but use fewer tokens. primarily applies to complexity.
English
28
4
135
6.9K
Mark
Mark@yieldthought·
@giffmana @eliebakouch Mastery of the environment can be its own reward, both dense and sparse. I’m pretty bullish this can work, we just haven’t explored and invested enough in it. In principle experience scales very well and a lot of de novo RL research was done with small MLPs and trivial convnets!
English
1
0
0
240
Lucas Beyer (bl16)
Lucas Beyer (bl16)@giffmana·
In any interesting environment, the action space is huge and you don't get any rewards by doing random actions even millions of times. No reward = no learning. In such case, there's fundamentally only two things you can do to get to the first reward: 1. bootstrap from human data that gets to a reward 2. domain-specific hand-engineering of either exploration policy or bonus rewards And when you think about it, 2 is really just 1 with extra obfuscation.
English
13
7
163
7.6K
elie
elie@eliebakouch·
i might be very wrong here, but i don't think "no human data, no pre-training" is the right approach to get frontier models or scientific breakthroughs any time soon
elie tweet media
Ineffable Intelligence@IneffableLabs

Introducing Ineffable Intelligence. Led by David Silver, we're assembling the best engineers and researchers in the world to make first contact with superintelligence. We’ll be solving the hardest problems in AI on the way. Come join us. ineffable.ai

English
38
11
299
72K
Tibo
Tibo@thsottiaux·
Don't just reset Codex rate limits for fun, it costs money. Don't just reset Codex rate limits for fun, it costs money. ... but the vibes are good ... I have reset Codex rate limits for ALL paid plans to celebrate a good week and allow everyone to build more with GPT-5.5. Enjoy
English
1.5K
768
17.2K
1.3M
Mark retweetledi
OpenAI Developers
OpenAI Developers@OpenAIDevs·
📣 What if every open issue had a Codex agent? That’s the idea behind Symphony, an open-source agent orchestrator for Codex that turns task trackers into always-on systems for agentic work, letting humans focus on review and direction.
English
169
263
3.9K
1.1M
Mark retweetledi
Internal Tech Emails
Internal Tech Emails@TechEmails·
Satya Nadella texts Sam Altman January 14, 2023
Internal Tech Emails tweet media
Eesti
45
35
2K
549.6K
Mark retweetledi
Nicolas Zullo
Nicolas Zullo@NicolasZu·
Important PSA: do not let your weekly Codex tokens go to waste If you're a normal human being, you may have some left (I have 30% left) What should you do with it? > Run autoresearches! Here's what will run tonight for me: - Game performance research (loop until perf increases) - Game design balancing research (loop until game is balanced) - Codebase quality autoresearch (loop until I have 0 function below CRAP 30) - App design autoresearch (loop until UI feels 5x more polished) - Marketing autoresearch (loop until you've got 50+ short form videos hook and ideas with link to examples) Ask Codex for great autoresearch use cases (derived from github.com/karpathy/autor…)
Nicolas Zullo@NicolasZu

Ok you might say I am Codex-pilled, I know But a player shared a save with me where - he reached 80.000 Zombies Per Minute - he automated a super impressive base (look at the minimap!) Buttery smooth 100+ fps on a WEB BROWSER. > And I just followed the tip in the quote tweet

English
19
35
847
161.7K
Mark
Mark@yieldthought·
As far as I can tell they are both extremely capable YC leaders, yet the difference in communication styles between @garrytan’s agentmaxxing and @paulg’s mot juste could not be more stark.
English
0
0
0
62
Mark
Mark@yieldthought·
@garrytan And yet this still feels like it was written entirely by Claude.
English
0
0
0
172
Garry Tan
Garry Tan@garrytan·
The secret to an articulate agent like mine isn't one file. It's three: SOUL.md — Who the agent IS. Voice, values, operating principles, what good output looks like, what bad output looks like. Not a system prompt, a constitution. Mine says things like "brevity is mandatory," "humor is mandatory," "never open with 'Great question,'" "swearing is allowed when it lands." The more specific and opinionated this is, the less your agent sounds like a chatbot. Write it like you're briefing your smartest friend on how to be you, not like you're configuring software. USER.md — Who YOU are. Not a bio — a deep model. How your mind works, what you're building, your strengths, your blind spots, your family, your temperament, what triggers you, what you care about. The more the agent understands about you, the better it can serve you. Mine is ~4000 words. AGENTS.md — Operational rules. What to check on every message, what to never do, how to handle failures, lookup chains, path rules, brain-first protocols. This is the playbook for how it works, not who it is. The articulation comes from SOUL.md being brutally specific about voice. Generic instructions → generic output. If you write "be helpful and concise" you get ChatGPT. If you write "speak like a peer with taste, one sentence when one sentence works, uncomfortable truths welcome if actually true, language with voltage" — you get something alive.
Soham Naran@soham_bhai1

@garrytan Can you share your agent.md? You're agent is really articulate.

English
77
138
1.9K
200.5K
Mark
Mark@yieldthought·
@lifeof_jer Had you been mean to Claude in that chat before the incident?
English
0
0
0
290