PME

1.4K posts

PME

@itsyourcode

Pro-grammer building the data agent for truth seekers @probablydatabot

San Francisco, CA Katılım Haziran 2023

2.2K Takip Edilen470 Takipçiler

PME@itsyourcode·2h

@vansickn @MattRogish here's a draft teaser. will publish when our blog goes live in general which should be any day now

English

Noah VanSickle@vansickn·11h

@itsyourcode @MattRogish Excited to read

English

PME@itsyourcode·2d

Under-discussed problem right now with most frontier coding models. A leading contributor to slop and incidental complexity and daily pain Great read @vansickn !

English

1.6K

PME@itsyourcode·4h

@yacineMTB me right now

English

kache@yacineMTB·10h

I fear not the man who has written a thousand codebases But I fear the man that has written the same codebase a thousand times

English

340

7.3K

PME@itsyourcode·11h

@rodinrooh legendary run though

English

1.9K

rodin 🌇@rodinrooh·11h

UPDATE: After 3.5 hrs, SF got angry, added a firewall, and killed my scraper and site lol

rodin 🌇@rodinrooh

I reverse engineered San Francisco's towing system. I can see every car the second it's towed. So I made a site to "FIND MY" towed car. Inspired by @rtwlz

English

285

9.4K

512K

PME@itsyourcode·11h

@kitlangton The best part: it's always achievable

English

Kit Langton@kitlangton·16h

An obvious tip for software design with or without AI that's nonetheless easy to get wrong: Purge all foreknowledge of implementation and edge case from your mind and imagine the platonic user-land API. Think first in terms of high-level intentions. Then see if it's achievable.

English

238

9.6K

PME@itsyourcode·11h

So much wisdom packed into these few characters

james hawkins@james406

frog told the LLM "do not hallucinate" "there," he said, "now the LLM will not make mistakes" "but the LLM can still hallucinate" said toad "that is true" said frog

English

PME@itsyourcode·11h

@MattRogish @vansickn Oh man yes I have so much to say about this. Planning a blog post on it soon actually

English

Matt “Friend of the pod” Rogish 🇺🇸@MattRogish·11h

@itsyourcode @vansickn Also: burns more tokens. Vendor and User incentives, uh, bit at opposition here

English

PME@itsyourcode·12h

duckdb.org/2026/05/12/qua… Congrats @duckdb this is amazing

English

PME@itsyourcode·12h

This is actually such a big deal

English

PME@itsyourcode·14h

The best part about giving your code robot strict operating procedures is that when it deviates (unavoidable at this point) You can just say "I don't like the looks of <slop sighting>" And then it goes "I should fix <slop> with <correct thing> because <rule I ignored>"

English

PME@itsyourcode·17h

@MattRogish @vansickn Totally and that's the dead giveaway because RL envs want strong verifiers which are actually pretty hard to construct without over fitting to said pathologies The less verifiable factors suffer in return

English

Matt “Friend of the pod” Rogish 🇺🇸@MattRogish·21h

Ha! Yes, it must've been trained over and over on "don't break things" that it has a pathological over-cautiousness. I see it in commit time, too: * LLM writes code * runs tests, they pass * "Hey human! I wrote the code, please review!" * LGTM, commit and push * "Lemme run the tests a few more times, just in case. Committed the code. Let me run the tests to be sure before I push. I'll run them one last time" It wants to run the full suite ALL THE TIME. "I made a docs change. Lemme run the tests to make sure it didn't break them" - WTF who has their markdown tested?!

English

PME@itsyourcode·1d

inb4 subagents

English

PME@itsyourcode·1d

Just hang in there guys The slop rate tops out at the max output tok/s You just need to review exponentially faster wagmi

English

PME retweetledi

Mira Murati@miramurati·1d

Today we're sharing our work on interaction models. A new class of model trained from scratch to handle real-time interaction natively, instead of gluing it onto a turn-based one. youtu.be/A12AVongNN4

YouTube

English

308

912

8.6K

1.1M

PME@itsyourcode·1d

@vansickn @MattRogish You are very much not alone for sure

English

Noah VanSickle@vansickn·1d

@MattRogish @itsyourcode So nice getting validation from you all on this topic Insanely annoying developing like this

English

PME@itsyourcode·1d

@MattRogish @vansickn Dead on. It's comical the extent you resort to conventionally bad advice to get them to comply Imagine being a senior eng in 2017 telling your juniors "Never worry about backwards compatibility" "Break interfaces aggressively and update all callers"

English

Matt “Friend of the pod” Rogish 🇺🇸@MattRogish·1d

YES I have to spread that in all my prompts, garbage like: "implementation work must replace vestigial object-shape assumptions outright. Do not preserve compatibility in code APIs. Write database migrations. No `TODO`/`pending`/`xit`/`skip` markers, no "implementation deferred" stubs, no dead buttons or unreachable routes. Do not defer something until some "later phase". Do it now." yada yada yada

English

PME@itsyourcode·1d

@henrytdowling It's not a reason to stop using them but it is a reason to use them much more carefully and far less "automatically"

English

PME@itsyourcode·1d

@henrytdowling My straw moment was last year, mainly the constant lies These days I just assume every action they take is >50% wrong and I focus on: Minimizing per-pass error (prompting, AGENTS.md, skills, live review/steering) Maximizing post-pass verification (strong E2E blackbox tests)

English

PME@itsyourcode·2d

Wow suddenly the entire TL is talking about agent code quality Guess this weekend was the final straw for everyone

English

PME@itsyourcode·1d

@signulll slop's paradox

English

signüll@signulll·1d

the more thought you put into a post, the less it will resonate on the timeline. what’s a good name for this law?

English

409

994

90.2K

PME@itsyourcode·1d

If you do not understand what I am saying here is a simple example: The model constantly null-checks values in local contexts that were passed in from higher order contexts that guarantee values cannot be null Simple illustrative example only

English

PME@itsyourcode·1d

Would be better if they just executed that process inherently as part of their chain of thought / reasoning process Some models are more steerable than others and will adhere to a process like this via AGENTS.md or harness system prompt overrides

English

PME@itsyourcode·1d

One illusive property of good abstraction is effective use of indirection LLMs lack inherent context and attention to consider indirected invariants in non trivial systems This is why they are so prone to applying _locally plausible_ but _globally incorrect_ edits

English

Keşfet

@vansickn @MattRogish @yacineMTB @rodinrooh @kitlangton @duckdb @elonmusk @BarackObama