Ben Davis

2.6K posts

Ben Davis banner
Ben Davis

Ben Davis

@davis7

managing @theo's yt channel and building whatever I'm currently nerd sniped by

San Francisco Katılım Temmuz 2022
405 Takip Edilen8K Takipçiler
Ben Davis
Ben Davis@davis7·
@bayes yea once it's live. It's basically done I just haven't done enough testing to be comfortable yolo pushing it lol
English
0
0
1
4
bayes
bayes@bayes·
@davis7 will you share the svelte rewrite of lawn?
English
1
0
0
13
Ben Davis retweetledi
Micky
Micky@Rasmic·
guys im a thumbnail... what a time to be alive... go watch @davis7's video
Micky tweet media
English
7
2
92
4.2K
Peter Edmonds
Peter Edmonds@peteredm1·
@davis7 and also, this used opencode sdk last i checked - what were the reasons for switching?
English
1
0
0
97
Ben Davis
Ben Davis@davis7·
Working on a much more powerful version of btca web - more agent freedom - sandbox per thread instead of per user - web search tool - powered by pi agent sdk - model picker (default is 5.4 mini) Current version isn’t great. Can’t wait to get this out its so much better
Ben Davis tweet media
English
3
1
48
2.8K
Ben Davis
Ben Davis@davis7·
@peteredm1 daytona, it's actually not that bad since they spin up and down fast and I don't leave them on
English
0
0
0
52
Peter Edmonds
Peter Edmonds@peteredm1·
@davis7 what's your sandbox provider? 1 per thread would blow through limits so fast!
English
1
0
0
85
Ben Davis
Ben Davis@davis7·
Haiku 4.5 has finally been dethroned by GPT 5.4 mini: - faster - cheaper - smarter - better at tool calling Gemini 3.0 flash still has a place in “turn this video/audio/blob of text into json”, but for any small model task that requires an agent loop (sub agents, search, quick fixes, browser use, etc.) GPT-5.4 mini is the new king
OpenAI Developers@OpenAIDevs

We’re introducing GPT-5.4 mini and nano, our most capable small models yet. GPT-5.4 mini is more than 2x faster than GPT-5 mini. Optimized for coding, computer use, multimodal understanding, and subagents. For lighter-weight tasks, GPT-5.4 nano is our smallest and cheapest version of GPT-5.4. openai.com/index/introduc…

English
29
19
400
65.9K
Gonzalo
Gonzalo@zxnelli·
@davis7 these A/B title-thumbnail tests are getting really abstract
Gonzalo tweet media
English
1
0
1
23
Micky
Micky@Rasmic·
man im really fighting to not record a whole video on svelte
English
16
0
54
3.2K
Ben Davis
Ben Davis@davis7·
@uwukko the worst part for me is it makes the pr and issues tab so low signal I don't even bother checking them which then makes the problem worse 🙃
English
1
1
48
1.8K
wukko
wukko@uwukko·
we now get at least one ai slop PR per day, this is an open source pandemic
wukko@uwukko

@aritmiabattito the amount of slop prs is tolerable for now, so not yet at least

English
12
11
564
20.8K
Ben Davis
Ben Davis@davis7·
Ad spot is @agentuity, their platform's gotten really good. If ur building basically anything that uses an agent it's worth checking out
English
0
0
3
885
Ben Davis
Ben Davis@davis7·
Turned this into a vid:
Ben Davis@davis7

The current "MCP is dead discourse" is my current favorite example of the really annoying "simplicity" brainrot that's been plaguing tech for a while On paper "just use a cli agents already know bash" sounds really good. It makes sense and is true, but it misses the bigger picture: > how do they know about the commands + their shape? > just put it in your agents md file or make a skill > ok, so then what happens if the cli's shape changes > well then u just update the markdown file > fine let's pretend people will actually do that (they will not) and this actually works, how are you going to scope authentication and authorization? > use the cli auth tools like what AWS has > ok so we need eng's or agents to be manually making sure every project switches the auth to the correct scopes for each project so that prod can't get vibe killed and then to connect to our internal services over cli in cloud agents we need to run basic agents in sandboxes now b/c we're not using mcp and... You get the point. It sounds really simple, and it is in the toy case, but in the real world an external API gated through MCP is actually way simpler, more secure, and manageable in a lot of cases. This post & article from @GergelyOrosz is a very good example: x.com/GergelyOrosz/s… It's the same thing with the $5 VPS or htmx or postgres or whatever other "simple" enlightened solution that works great in dumb indie hacker demos while making zero sense in the real world. I hate to break it to you guys, but there's no conspiracy. If modern tech solutions really were over complex slop do u really think these companies wouldn't take the free win to just do it the "simple" way? There are tons of problems with the tech, but it exists for a reason. I like CLIs a lot, and in a lot of cases skills make sense, clis make sense, etc. There are a lot of ways to do things, and they all solve different problems. But no, MCP is not dead. It or something like it isn't going anywhere. And there are a lot of good new ideas of how to make it better! Dumping an MCP with 60+ random tools into context sucks I completely agree and needs to be fixed. A lot of the code run solutions are very compelling like @RhysSullivan 's executor, cloudflare's code mode, and others. There are ways to fix this, but a "god mode bash tool" isn't it.

English
4
1
76
11K
Ben Davis
Ben Davis@davis7·
Ik the help command is there and works fine, that's not the real problem. - u still have to tell the agent about the cli in a skill or agent md or prompt - u still have to handle auth somehow (probably authing the cli, but then u have to have a special auth setup for the agent's cli env b/c u do not want it running root commands on ur aws instance) - the agent then spends extra turns discovering what does and doesn't work in the cli It's solvable, but the point is it's just not that simple and for a lot of use cases, having a "function call" the agent can do that runs remotely with proper auth and shaping is really convenient. I think my point is really that both are useful, just in different places. (for example "gh" or the skills sh cli should not be mcps that's dumb, but managing ur db probably should be) See this from newsletter.pragmaticengineer.com/p/how-uber-use…
Ben Davis tweet media
Anthony Shew@anthonysheww

Any CLI worth its salt has a help flag that tells you what commands are available, and what args they accept. Are you and @RhysSullivan seeing these not be helpful enough? I see it answer “what’s the shape” all the time in my chats. (No, this doesn’t answer to anything and everything about potential CLI shortcomings, but I’ve been surprised to see both of you say this when the answer seems to be sitting in plain sight.)

English
4
0
25
4.4K
Ben Davis
Ben Davis@davis7·
@anthonysheww @RhysSullivan Well yea u have to tell it and u easily can lol. Agents md, skills, or prompts all work fine for that, it's just like Rhys said "lets say you have posthog, axiom, vercel, cloudflare, convex" do u really want to have all of those always in context?
English
1
0
0
282
Anthony Shew
Anthony Shew@anthonysheww·
@RhysSullivan @davis7 Sure, all that’s fine to question. I’ve just seen you both say “BUT HOW CAN WE KNOW WHATS IN IT THO” and been confused both times haha.
English
2
0
4
830
Ben Davis
Ben Davis@davis7·
The current "MCP is dead discourse" is my current favorite example of the really annoying "simplicity" brainrot that's been plaguing tech for a while On paper "just use a cli agents already know bash" sounds really good. It makes sense and is true, but it misses the bigger picture: > how do they know about the commands + their shape? > just put it in your agents md file or make a skill > ok, so then what happens if the cli's shape changes > well then u just update the markdown file > fine let's pretend people will actually do that (they will not) and this actually works, how are you going to scope authentication and authorization? > use the cli auth tools like what AWS has > ok so we need eng's or agents to be manually making sure every project switches the auth to the correct scopes for each project so that prod can't get vibe killed and then to connect to our internal services over cli in cloud agents we need to run basic agents in sandboxes now b/c we're not using mcp and... You get the point. It sounds really simple, and it is in the toy case, but in the real world an external API gated through MCP is actually way simpler, more secure, and manageable in a lot of cases. This post & article from @GergelyOrosz is a very good example: x.com/GergelyOrosz/s… It's the same thing with the $5 VPS or htmx or postgres or whatever other "simple" enlightened solution that works great in dumb indie hacker demos while making zero sense in the real world. I hate to break it to you guys, but there's no conspiracy. If modern tech solutions really were over complex slop do u really think these companies wouldn't take the free win to just do it the "simple" way? There are tons of problems with the tech, but it exists for a reason. I like CLIs a lot, and in a lot of cases skills make sense, clis make sense, etc. There are a lot of ways to do things, and they all solve different problems. But no, MCP is not dead. It or something like it isn't going anywhere. And there are a lot of good new ideas of how to make it better! Dumping an MCP with 60+ random tools into context sucks I completely agree and needs to be fixed. A lot of the code run solutions are very compelling like @RhysSullivan 's executor, cloudflare's code mode, and others. There are ways to fix this, but a "god mode bash tool" isn't it.
English
54
30
366
53.5K