Manuel Odendahl

3.3K posts

Manuel Odendahl

@ProgramWithAi

Real-world programming with AI. No filler, just killer! github: wesen

Boston, MA Katılım Kasım 2023

523 Takip Edilen1.7K Takipçiler

Sabitlenmiş Tweet

Manuel Odendahl@ProgramWithAi·5 Haz

*taps the sign*

English

Manuel Odendahl retweetledi

Nanda@thepeekpoker·21h

Inspired by the Master Control Program (MCP) from TRON, I'm building ALDRED, which uses Semantic Inter-Process Communication (SIPC) to orchestrate external programs. Instead of relying on rigid API contracts, the MCP 'mutates' to interact with other programs, mimicking human-like behavior. When it engages with a program—such as a hotel booking —it requires no prior knowledge of the target's implementation; it simply negotiates until the task is complete (it is immune if the program changes). If data is missing, it dynamically prompts the user for the necessary information. While human-like communication may seem inefficient, it is the key to building truly autonomous, complex systems. Since the MCP itself is a program, you can create a powerful orchestrator. Check out this TRON-inspired demo! :D

English

189

Manuel Odendahl@ProgramWithAi·23h

@mitchellh github.com/go-go-golems/s… ! (pilfer it, wouldn't recommend using it per se, heavy coding rn). But it has proper OIDC for claude / chatgpt, and it exposes a JS sandbox MCP to do IMAP / sieve. SMTP coming I guess as well.

English

843

Mitchell Hashimoto@mitchellh·1d

I'm dying for pi-mono-style minimal library that handles the hard parts of email (auth, syncing with local state, etc.) and gives me an opinionated way to add agentic loops on top of that. I want to build my own agents and logic and guardrails, I don't trust vendors right now.

English

747

56.7K

Manuel Odendahl@ProgramWithAi·23h

cozodb has kind of a frustrating playground. so... oh but wait what about syntax highlighting? (still need to add it to the macos1 looking thing) oh but wait what about interactive docs?

English

118

Mike Hostetler // Chief Agent Officer@mikehostetler·1d

@ProgramWithAi Monads

English

Manuel Odendahl@ProgramWithAi·1d

@devgerred Yeah it’s like medium vs high is just immediately clockable, high xhigh I just… don’t feel any tangible difference (yet)

English

gerred@devgerred·1d

@ProgramWithAi need to eval more rigorously on high vs xhigh for some of my more frontier tasks and update my evals a bit to better represent those these days but I think they're close.

English

gerred@devgerred·1d

sorry to everyone but if you're doing critical work and not using opus or 5.3 codex / 5.4 with max test time compute idk what you're getting at. only people I know who have a shot are chasing ttc behind the scenes. token maxxing still wins for any real work that moves the needle

English

1.1K

Manuel Odendahl@ProgramWithAi·1d

@mikehostetler just a monoid in the category of endofunctors. I managed to twist my brain enough to "get it" for a while, but it's like CSS and it has just dissolved.

English

Mike Hostetler // Chief Agent Officer@mikehostetler·1d

Everything is computer Everything is tree Algebra bro

octo@the_octobro

Why do people say functional programming is hard? It's very simple: Everything is tree

English

704

Manuel Odendahl@ProgramWithAi·1d

that feeling when chatgpt pro just solves a problem you broke your brain on for years, and it's like "why the fuck didn't i think of that".

English

126

Manuel Odendahl@ProgramWithAi·1d

@ibuildthecloud this is the way

English

Darren Shepherd@ibuildthecloud·1d

codex just lectured me on why factories are better than classes in typescript. I don't agree, but I've learned to not fight AI.

English

1.9K

Manuel Odendahl@ProgramWithAi·1d

my two biggest weapons with LLM coding: - JS sandboxes - monads it's always monads bro

English

120

Manuel Odendahl@ProgramWithAi·1d

@mikehostetler monads bro. (i have to admit i didn't even make it that far in real category theory. I kinda get it in code tho)

English

Manuel Odendahl@ProgramWithAi·1d

@mikehostetler every thing is a thing that relates to other things. or "object" and "morphism".

English

Manuel Odendahl@ProgramWithAi·1d

I've been doing this with screenshotting the routes and then diffing visually (if the goal is really to preserve looks). DOM comparisons don't work well for visual stuff and both DOM and CSS just eat a lot of tokens. If there are differences, I render subparts of the DOM until I narrow it down to certain elements. If the diff is all over, I do actually send dom + CSS comparisons to a bigger model, ask for a set of hypothesis on what might be off, then send the two screenshots _and_ the diff to haiku and the questions to be answered (like: is the left pane closed?) to the smaller model. this allows me to leverage the big intelligence in a tight loop after that, where it still operates in text mode. This was around the 3.7 times, so things might have changed, but basically, DOM + CSS is hard, and I want to milk the context window for as long as possible without blasting it with visual content.

English

David Cramer@zeeg·1d

Has anyone done anything impressive to make it faster and cheaper to verify large code changes to JS UIs? Practical issue: we need to do some large refactors in Sentry’s frontend but traditional integration tests don’t work well here so our coverage is never ideal.

English

11.7K

Manuel Odendahl@ProgramWithAi·1d

@thorstenball @badlogicgames it's all just inference and interpreting the results as code. we haven't explored a single percent of what that means. and yet it's just "hey write code, i'll execute it for you".

English

142

Thorsten Ball@thorstenball·1d

@badlogicgames I know. Hence why we're building something different. Hence our "coding agent is dead" post. All the skills/commands/subagents imho are distractions. Foot pedals for assembly IDEs. I use none of that and expect to use less still in the future.

English

Thorsten Ball@thorstenball·1d

Lately, whenever I open this app and see the latest tricks, and hacks, and notes, and workflows, and spec here and skill there, I can't help but think: All of this will be washed away by the models. Every Markdown file that's precious to you right now will be gone.

English

805

100.5K

Manuel Odendahl@ProgramWithAi·1d

you know what's still fucking hard? bubbling config down to the actual functional code. jfc.

English

139

Manuel Odendahl@ProgramWithAi·1d

@thdxr i've been doing this for a year now or so, it works.

English

dax@thdxr·2d

we've been experimenting with getting rid of the bash tool agents can write js fine which can do what bash can (though some gaps with things like git) and is more cross platform and then could run that in this

Rivet@rivet_dev

Introducing the Secure Exec SDK Secure Node.js execution without a sandbox ⚡ 17.9 ms coldstart, 3.4 MB mem, 56x cheaper 📦 Just a library – supports Node.js, Bun, & browsers 🔐 Powered by the same tech as Cloudflare Workers $ 𝚗𝚙𝚖 𝚒𝚗𝚜𝚝𝚊𝚕𝚕 𝚜𝚎𝚌𝚞𝚛𝚎-𝚎𝚡𝚎𝚌

English

207.5K

Manuel Odendahl@ProgramWithAi·1d

little random side idea, playwright browser.

English

175

Manuel Odendahl@ProgramWithAi·1d

I might have been a bit heavy handed on the verbosity 😭😂

English

Manuel Odendahl@ProgramWithAi·1d

How I review whatever is going on

English

156

Manuel Odendahl@ProgramWithAi·1d

@Byte0fCode Good names make for happy humans and happy agents! And good agents make good names. 🤖

English

jake cukjati@Byte0fCode·1d

@ProgramWithAi Yeah, I usually work out the data schema and naming because I work in a web application. I have to remember what values are what you know. But it’s like can I just allow the agent to make its own field names entirely. 100% of the time.

English

Manuel Odendahl@ProgramWithAi·2d

I think a spec, especially the more technical it gets, very poorly encodes user intent. that programmers think a spec should encode the program's behavior is IMO one of the reason why non-tech people are such better vibecoders. compare: "make me an app to manage my recipes" vs "make a react app with a node.js backend that uses mongodb to store objects with the schema xyz and uses 4 REST routes that go to rtk-query, using tailwind css, so that we have a menu hamburger blablablabla" ... which prompt is going to lead to an app to manage recipes? which prompt will make it easier to iterate?

English

171

Keşfet

@mitchellh @devgerred @mikehostetler @ibuildthecloud @thorstenball @badlogicgames @elonmusk @BarackObama