Bill Krueger

23 posts

Bill Krueger banner
Bill Krueger

Bill Krueger

@BGoat

Software Engineer embracing Agentic Engineering 🤖 Co-founder @TeamYouAI 🚀 Expat 🇺🇸 ➡️ 🇨🇷

Costa Rica Tham gia Mart 2011
97 Đang theo dõi28 Người theo dõi
Bill Krueger đã retweet
ClaudeDevs
ClaudeDevs@ClaudeDevs·
Over the past month, some of you reported Claude Code's quality had slipped. We investigated, and published a post-mortem on the three issues we found. All are fixed in v2.1.116+ and we’ve reset usage limits for all subscribers.
English
1.9K
2.6K
38.1K
5.9M
Garry Tan
Garry Tan@garrytan·
Literally can count the hours I have lost as OpenClaw decides to do brain surgery on itself and then ends up dying from an edit it did to its own config
English
222
39
1.5K
142.3K
Malte Ubl
Malte Ubl@cramforce·
Imagine just-bash supports OpenAPI, MCP, and GraphQL discovery for codemode style tool calls! I hooked @RhysSullivan's excellent executor package into just-bash to do exactly that. And you get to freely handle tool approvals in your JS code.
Malte Ubl tweet media
English
6
6
193
14.5K
Bill Krueger
Bill Krueger@BGoat·
@YousifAstar Vercel introduced a method of credential brokering to their Sandbox product that injects credentials on egressing traffic. Does that fully address this kind of attack vector or would it still be susceptible to your approach?
English
0
0
0
1.9K
Enes Akar
Enes Akar@enesakar·
@ikbear @upstash @openclaw we can easily enable this, but not supported as of now. we wanted focus on development cases first, so not sure we should host personal agents initially..
English
4
0
3
144
Upstash
Upstash@upstash·
Introducing Upstash Box - the best way to give your AI agents a computer 🎉 ◆ Secure, isolated cloud sandboxes ◆ Built-in Claude Code, Codex or OpenCode ◆ Sleeps when idle, wakes up in milliseconds
English
41
38
547
67.4K
Bill Krueger đã retweet
Andrej Karpathy
Andrej Karpathy@karpathy·
It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since - the models have significantly higher quality, long-term coherence and tenacity and they can power through large and long tasks, well past enough that it is extremely disruptive to the default programming workflow. Just to give an example, over the weekend I was building a local video analysis dashboard for the cameras of my home so I wrote: “Here is the local IP and username/password of my DGX Spark. Log in, set up ssh keys, set up vLLM, download and bench Qwen3-VL, set up a server endpoint to inference videos, a basic web ui dashboard, test everything, set it up with systemd, record memory notes for yourself and write up a markdown report for me”. The agent went off for ~30 minutes, ran into multiple issues, researched solutions online, resolved them one by one, wrote the code, tested it, debugged it, set up the services, and came back with the report and it was just done. I didn’t touch anything. All of this could easily have been a weekend project just 3 months ago but today it’s something you kick off and forget about for 30 minutes. As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel. The biggest prize is in figuring out how you can keep ascending the layers of abstraction to set up long-running orchestrator Claws with all of the right tools, memory and instructions that productively manage multiple parallel Code instances for you. The leverage achievable via top tier "agentic engineering" feels very high right now. It’s not perfect, it needs high-level direction, judgement, taste, oversight, iteration and hints and ideas. It works a lot better in some scenarios than others (e.g. especially for tasks that are well-specified and where you can verify/test functionality). The key is to build intuition to decompose the task just right to hand off the parts that work and help out around the edges. But imo, this is nowhere near "business as usual" time in software.
English
1.6K
4.7K
37.2K
5.1M
Bill Krueger
Bill Krueger@BGoat·
Seeing the backlash from @NotionHQ users around the pricing of their new Custom Agents feature, it is clear the average person has no idea how much AI inference really costs. It is difficult to build in this space when you cannot heavily subsidize that cost like the big labs do.
English
0
0
1
29
Bill Krueger
Bill Krueger@BGoat·
@cramforce This is a great post. It really cements the mental model I have been developing around how to safely provide users with a powerful agentic harness that abstracts the complexity of something like OpenClaw while protecting both our secrets AND theirs.
English
0
0
2
79
Malte Ubl
Malte Ubl@cramforce·
This is an important conversation to be had: What is the right architecture to running agents in production? My posits: - You can trust agent harnesses, you cannot trust agents - The agent harness, the agent, and the code it generates should not run in the same security context Today this separation is not typically enforced and this will lead to major security issues in the future. The bright side: Existing agent abstractions do make it relatively easy to port to a more secure architecture. More concrete in the blog post ↓
Vercel@vercel

Most coding agents default to running generated code with full access to secrets, creating a major risk for data exfiltration. It's essential that developers are deliberate in defining and enforcing security boundaries. How we're thinking about this ↓ vercel.com/blog/security-…

English
9
4
107
19.9K
Vercel Developers
Vercel Developers@vercel_dev·
You can now write chat bot logic once, and deploy across Slack, GitHub, Teams, and Discord. We're open-sourcing a new Chat SDK, a unified TypeScript library to build for many chat platforms with a single codebase. Now available in public beta. vercel.com/changelog/chat…
English
55
169
1.8K
525.8K
Bill Krueger
Bill Krueger@BGoat·
@Shpigford @replysocial Ouch 😬 I am a bit surprised they don't allow the Pro tier to continue programmatically replying to posts as I doubt most of the people running LLM spam bots would be willing to pay $5k/month. I hope you are able to get an exception for a valid product use case.
English
0
0
2
94
Josh Pigford
Josh Pigford@Shpigford·
Unfortunately this means the end of @replysocial. We used the "POST /2/tweets" endpoint to allow social media managers and support teams to reply to people talking about their companies and this kills off a large part of functionality around that. We did absolutely ZERO automated calls. Not a single line of AI/LLM code in the entire platform and no way to mass/batch interact with any posts in any way. The value proposition was that we centralized all the interactions you were naturally having with your customers. And yes, obviously it still works in the situation where people @-mention your company. But many times customers only mention you by name (and not by handle). That's extremely common for non-tech products. Despite their recent step forward in rolling out pay per usage pricing, the reality is that X is still very hostile to developers. I don't envy their position in dealing with spam/bots. It's an endless game of whack-a-mole and they've simply got to make tradeoffs to keep their platform from becoming ground zero for the dead internet theory. I get it. But as much as I wanted to build interesting/impactful tools *for humans* on X, I just can't keep throwing time/effort/money at a platform that continues to be so unstable. In addition, I'll be shutting down @BotBlockAI. It still functions perfectly fine but I have no idea what other API endpoints X will decide to cutoff overnight so I'd rather not keep pouring resources in to it.
Developers@XDevelopers

To help address automated reply spam, programmatic replies via POST /2/tweets are now restricted for X API. You can only reply if the original author @ mentions you or quotes your post. Non-replies will remain unchanged. Applies to Free, Basic, Pro, Pay-Per-Use. Note: Enterprise & Public Utility apps are not impacted. Details: devcommunity.x.com/t/update-to-re…

English
36
1
121
46.2K
Bill Krueger
Bill Krueger@BGoat·
@theo @trq212 @bcherny How is there still not a clear answer to this question from such a big name in the space? The silence is deafening. It is a simple question that deserves a simple answer.
English
0
0
1
52
Bill Krueger
Bill Krueger@BGoat·
@nicoalbanese10 Love this update. I would love it even more if you could take a snapshot without stopping the sandbox 🙏
English
0
0
1
11
Bill Krueger đã retweet
Rob Zolkos
Rob Zolkos@robzolkos·
Major Claude Code policy clear up from Anthropic: "Using OAuth tokens obtained through Claude Free, Pro, or Max accounts in any other product, tool, or service — including the Agent SDK — is not permitted"
English
296
144
1.7K
2.2M
Bill Krueger
Bill Krueger@BGoat·
@steipete @openclaw People who think you "failed" 43 times are way too focused on "hitting it big" with one project. To them, laying the foundation for something bigger is a waste of time. That is a sad mindset to have.
English
0
0
0
134
Peter Steinberger 🦞
Peter Steinberger 🦞@steipete·
The funniest take is that I "failed" 43 times when people look at my GitHub repos and projects. Uhmm... no? Most of these are part of @openclaw, I had to build an army to make it useful. github.com/steipete/
Peter Steinberger 🦞 tweet media
English
880
1K
16.6K
867.4K
Bill Krueger
Bill Krueger@BGoat·
Sprites from @flydotio are very cool... when they work. Running into major problems with Sprites becoming unresponsive with no way to recover. Attempts to restore to a checkpoint fail. No way to get at the filesystem. I hope they fix these issues quickly! They are unusable ATM.
English
0
0
0
32