Ben Bleikamp
377 posts

Ben Bleikamp
@bleikamp
elite information superhighway surfer 🏄 software design, currently @baseten. prev. co-founded cased, head of design @github, design tools @meta

@a_aMatrix @NotionHQ Sure. I just wonder how eg dogfooding would have not caught this... Back to writing in GDoc, pasting into Notion

Wrote up more on how to play with zclaw: - helpful local dev env. zclaw ships with tools for debugging & building (agents like 'em). zclaw.dev/local-dev.html - real use cases, serious & fun. and why they are interesting on a limited, physical board. zclaw.dev/use-cases.html



we're launching the new Sentry CLI. it's made for developers and agents, by developers and agents, with a focus on dev workflows. It's has things backed in like:



Photographer @mostafabassim1 photographed this boy walking home alone with a snack being "randomly" approached by DHS. "After he was unable to produce documentation proving his citizenship, agents informed him that he was under arrest." He said, "Can I just go home?" Answer: No.

One reason vibe coding is so addictive is that you are always *almost* there but not 100% there. The agent implements an amazing feature and got maybe 10% of the thing wrong, and you are like "hey I can fix this if i just prompt it for 5 more mins" And that was 5 hrs ago

Spent the weekend with Claude Code and Codex (5.2 xhigh, ofc) replacing Notion and Linear for our (tiny) 4-person team We were hacking together something simple with a few complex tools. So I just built the simple thing we wanted. - Markdown files are the datastore, all on a Fly.io volume, YAML frontmatter for structured data - SOTA model agent loop - Manage everything via Slack commands - Notion-style web UI for writing - Auth via GitHub





We've been exploring the value of letting agents use CLIs vs just navigating a REST API directly. The smartest models can do without a CLI, but take longer and cost more. Even small models can succeed when given CLIs. But the puck keeps moving!



This is a big deal. If AI is going to run everything, this is basically Mercedes admitting they can't build the engine and will use BMW’s instead. Siri had 15 years...

It genuinely feels to me like GPT-5.2 and Opus 4.5 in November represent an inflection point - one of those moments where the models get incrementally better in a way that tips across an invisible capability line where suddenly a whole bunch of much harder coding problems open up





