Geoff Goodman

5.4K posts

Geoff Goodman banner
Geoff Goodman

Geoff Goodman

@filearts

Curious mind. Creator of @plnkrco. Tech lead for extensibility at @auth0 (@okta). Never stop learning and experimenting.

Montreal, Canada Katılım Nisan 2011
596 Takip Edilen1.3K Takipçiler
Jeremy Kreutzbender
Jeremy Kreutzbender@J_Kreutzbender·
@RhysSullivan @maxleiter My understanding is LLMs can't invoke resources like they can invoke tools? The MCP spec says that resources must be user opt in? It 100% make sense that skills become resources. I've created a `get-skills` tool that loads skill markdown files to get around the issue
English
1
0
0
76
Rhys
Rhys@RhysSullivan·
It's unfortunate that MCP actually solves pretty much all problems people have with agents today, it's just that all of the first implementations of it were bad so people discredit it now The latest one is skill distribution, works so well for that
English
27
3
191
13.2K
Geoff Goodman
Geoff Goodman@filearts·
@mitsuhiko It's like this sawtooth pattern. Two steps forward (vibing) and one back (manage entropy). Agents are definitely on the side of thermodynamics.
English
0
0
1
39
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
(And I'm talking purely from personal experience here)
English
4
0
20
2.9K
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
The biggest issue for me with agents is that they are hard to resist. But then you can build yourself some shit into the codebase that you get to regret in record time. And no, I don't think you can vibe yourself back to sanity with better models.
English
39
14
293
24.4K
David K 🎹
David K 🎹@DavidKPiano·
Been thinking about this a lot (and prototyping). Rough thoughts on how I think conversations should work; brain-dump for sake of sharing: - Tree data structure, nodes have parent IDs - Threads can either trace full path to conversation root or summarize from "branch" point - Threads can have threads, inf recursive - Agent still sees linear conversation (path to root or summary) - Threads are first-class primitives with their own metadata (name, purpose, etc), not just implicit from tree - Threads can run in parallel: agents can do work in multiple threads simultaneously - Messages have sender identity, not roles: "user" / "assistant" is a two-player limitation IMO - Messages can reference other messages and threads by ID: agents can cite, link, and build on each other's work - Events are the source of truth (very actor-model coded); messages and threads are derived views - Tool calls for creating, reading, and summarizing threads - Compatible across providers, of course --- I also have some ideas around multi-agent participation, state machine-driven agent behavior, & structured conversation flow that make this significantly more interesting
David K 🎹@DavidKPiano

I strongly believe that threading is the biggest missing feature in all LLM chat apps, and they're either doing it too coarsely (branching to open a new chat), or it's missing entirely

English
23
2
126
20.5K
Mario Zechner
Mario Zechner@badlogicgames·
new rule: instead of attaching agent session logs to your prs nobody will ever read, have the submitter attach a video of them explaining every single line of code and why it must exist.
English
18
1
159
8.7K
ShrekOverflow
ShrekOverflow@ShrekOverflow·
Sandboxes need to be like onions and not pineapples.
English
1
0
5
7.5K
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
I feel like I need to start fighting this shit with more linters. They are just not listening.
Armin Ronacher ⇌ tweet media
English
38
0
158
15.5K
Geoff Goodman
Geoff Goodman@filearts·
@mjackson @gdb I bet it's incredible for docs. Been struggling to get it to rein it in on design assumptions and to get it to push back on my nonsense like 5.3. It's an amazing general purpose model but I'm back to 5.3 for production code.
English
0
0
1
59
MJ
MJ@mjackson·
@filearts @gdb Yeah I’m doing everything with 5.4 on /fast. Making heavy use of skills. It’s writing way better docs than 5.3 ever did for me.
English
1
0
3
189
Geoff Goodman
Geoff Goodman@filearts·
@mjackson @gdb Been struggling to get 5.4 to be as ruthless of an engineer as 5.3-codex. Haven't tried pro though which may be my issue. Have you moved entirely to 5.4?
English
1
0
2
207
MJ
MJ@mjackson·
@gdb Codex team cooked with the desktop app! I completely ditched the editor I’d been using for over a decade. 🤯
English
9
3
68
8.8K
Ben Vinegar
Ben Vinegar@bentlegen·
🇨🇦 Incredible things happening in Toronto this Saturday - Live Q&A with Pi creator @badlogicgames and @mitsuhiko - Live demos from the city's most AI-pilled agentic coders - Free pie 🥧 - Hosted at New Stadium (@newsystems_)
Ben Vinegar tweet media
English
17
6
49
5.8K
Chris Sev
Chris Sev@chris__sev·
TIL that this emoji 😂 is literally called "the boomer emoji"
English
23
0
19
4.1K
Geoff Goodman
Geoff Goodman@filearts·
Anyone else feeling like GPT 5.4 is a regression in _real engineering_ so far compared to 5.3-codex? Personality is much more interesting but for writing production code, 5.4 is going too far, making bad assumptions and trying to be too helpful. 5.3-codex didn't try to please.
English
2
0
2
130
Geoff Goodman
Geoff Goodman@filearts·
@kentcdodds @ShrekOverflow @ianhaz Probably not on Twitter. But agree that he's great. We have a few standout product people in-house. Interesting reflection of how there's this whole world of incredible people who just don't interact with Twitter.
English
0
0
1
33
Kent C. Dodds ⚡
Kent C. Dodds ⚡@kentcdodds·
Who is the very best product manager you know?
English
46
0
66
36.1K
Geoff Goodman
Geoff Goodman@filearts·
Me: "I need a USB-C cable" Codex: "Here is your USB-C cable"
Geoff Goodman tweet media
English
0
0
0
69
ShrekOverflow
ShrekOverflow@ShrekOverflow·
@yenkel Shallan (my OpenClaw) does not access Moltbook. She found the whole split personality thing a bit weird. (@jfroma I think you will get this reference)
English
2
0
1
55
Geoff Goodman
Geoff Goodman@filearts·
@badlogicgames @steipete Agentic coding tends to entropy. It's like in the heyday of Apple dongle hell. AI will pile adapter upon adapter. You get the right outcome, but behind the scenes, entropy and code will inevitably trend upward.
English
0
0
0
84
Mario Zechner
Mario Zechner@badlogicgames·
your biggest enemy is still complexity. it's also your agent's biggest enemy. but it has no holistic view of your code base, so it keeps adding complexity. and you think that's how it's supposed to be, because the clanker shat it out, and you don't know the stack. glhf!
English
26
35
406
46.7K
Mario Zechner
Mario Zechner@badlogicgames·
recommended reading. had this on my reading list for a while, finally found time. it's a small study, ymmv. would love to see one with senior engs. they great atrophy has begun. arxiv.org/abs/2601.20245
English
16
24
371
29.8K
Mario Zechner
Mario Zechner@badlogicgames·
Anybody seeing this? Codex endpoint craps out, then recovers.
Mario Zechner tweet media
English
43
1
73
10.3K
Rhys
Rhys@RhysSullivan·
The core of this system is MCP elicitation When a destructive action like `await tools.vercel.dns.removeRecord` is called, it triggers an elicitation from the client to approve it More harness should bring support for this one, is an incredibly useful primitive
Rhys@RhysSullivan

i hate the way tool calling works today, so i built a fix for it its called `executor` and it lets your agent run code to call any API / MCP / GraphQL server rather than installing CLIs this is a soft launch, you can try it with npm i -g executor but im still making it great

English
9
2
74
33.8K