ALR

10.8K posts

ALR banner
ALR

ALR

@ALRubinger

You wouldn't let your agent do that. Now you can. Building the plane as we fly https://t.co/E1NdEX8WFS.

Calichusetts Katılım Ocak 2009
1.5K Takip Edilen5.5K Takipçiler
ALR
ALR@ALRubinger·
@mitchellh And legal indemnification. RHT basically absorbs risk. It’s brilliant and also as a lead of the open source project, watching the productization rebuild process up close was 🫠
English
0
0
2
854
Mitchell Hashimoto
Mitchell Hashimoto@mitchellh·
This is why RedHat makes like over $6 billion a year by the way. For anyone wondering what the fuck RedHat does, its basically: take the free chips on the side of the road, validate them, and sell the supply chain contract.
English
9
18
500
42K
Mitchell Hashimoto
Mitchell Hashimoto@mitchellh·
Supply chain attacks and OSS sustainability go hand in hand. I've semi-seriously joked for years that OSS upstreams should periodically purposely inject full vulns into their code and let downstreams fuck around and find out. Downstreams can pay to get the non-FAFO version. The not joke part is simply that OSS maintainers aren't a supply chain. OSS maintainers are not responsible for monitoring CVEs (because, they are not a supply chain). OSS maintainers are not at fault when bad shit happens to downstreams, because basically every OSS license (MIT, Apache, GPL, etc.) literally says: the software is provided "as-is, without warranty." You get what you pay for (that is to say: absolutely nothing!) Now, the joke part is that I do believe there is an ethical obligation to try to prevent harm downstream. But "try" is the key word. So, this isn't a serious proposal. But, if you're using OSS code and you're not paying for a license with a contract that promises some kind of warranty, you have no supply chain. You (the downstream user of an OSS lib) ARE the supply chain. To use a metaphor: physical goods have a real supply chain. Car manufacturers, chips, clothes, toys, etc. You have a signed commercial agreement with all your suppliers that promises quantity AND quality and blowback if either are missed. Thats a supply chain. If someone puts some chips on the side of the road with a "FREE" sign, then you integrate those into a product, then find out those chips are hacking customers, its your fault, not the person who dropped them on the side of the road.
English
46
158
1.5K
112.7K
ALR
ALR@ALRubinger·
@nbaschez Keep on keepin' on. I want this immediately.
English
0
0
1
125
Conrado Brenna
Conrado Brenna@cbr4444·
Square is announced as a co-developer of Universal Commerce Protocol for Food & Bev commerce. See it here: ucp.dev
Conrado Brenna tweet media
English
10
22
138
80.9K
ALR
ALR@ALRubinger·
@calvinnwq @garrytan Same. Agents express intent, control plane to own execution and identity.
English
0
0
0
260
Calvin Ng
Calvin Ng@calvinnwq·
Just have to repost this again as I'm aligned with this and going deeper into building the harnesses around these have seen positive results so far in terms of what we expect: "deterministic outcomes from undeterministic systems"
Garry Tan@garrytan

x.com/i/article/2042…

English
2
3
136
77.6K
ALR
ALR@ALRubinger·
Lots of dunking today. I’m not among them. Empathize with every customer who lost business and real dollars. At the same time: Running infra at this scale means relying on partners, and I saw Railway and @JustJake do everything to remediate and communicate. Google? Hard to reach, slow to respond to their customer.
Railway@Railway

Google Cloud has blocked our account, making some Railway services unavailable. We have escalated this directly with Google. The Railway Platform team has since confirmed access to Google Cloud and is working on restoring access to all workloads. We have access to some of our Google Cloud–hosted infrastructure and are working to restore the rest of the service. We apologize for the disruption.

English
0
0
1
364
ALR
ALR@ALRubinger·
@RhysSullivan This tracks. It’s the best approach I’d come up with, too. The only way I can surmise to reliably put the approval into the same terminal window as the agent is to own the shell.
English
0
0
0
108
Rhys
Rhys@RhysSullivan·
One approach for requiring approval of destructive actions is giving the user a URL to approve it at In this, the returned result of the execute tool tells the model to: - Give the user a URL to approve the action - Immediately call `resume` which waits on the approval
English
18
2
150
18.7K
ALR
ALR@ALRubinger·
Agents make mistakes every day. Here's one from today. The great mistake we accepted was to put rules into context rather than deterministic guardrails. Rules are suggestions.
ALR tweet media
JER@lifeof_jer

x.com/i/article/2048…

English
0
0
0
436
ALR
ALR@ALRubinger·
@mitchellh You can ignore first principles right up until they bite you.
English
0
0
8
495
Mitchell Hashimoto
Mitchell Hashimoto@mitchellh·
I strongly believe there are entire companies right now under heavy AI psychosis and its impossible to have rational conversations about it with them. I can't name any specific people because they include personal friends I deeply respect, but I worry about how this plays out. I lived through the great MTBF vs MTTR (mean-time-between-failure vs. mean-time-to-recovery) reckoning of infrastructure during the transition to cloud and cloud automation. All those arguments are rearing their ugly heads again but now its... the whole software development industry (maybe the whole world, really). It's frightening, because the psychosis folks operate under an almost absolute "MTTR is all you need" mentality: "its fine to ship bugs because the agents will fix them so quickly and at a scale humans can't do!" We learned in infrastructure that MTTR is great but you can't yeet resilient systems entirely. The main issue is I don't even know how to bring this up to people I know personally, because bringing this topic up leads to immediately dismissals like "no no, it has full test coverage" or "bug reports are going down" or something, which just don't paint the whole picture. We already learned this lesson once in infrastructure: you can automate yourself into a very resilient catastrophe machine. Systems can appear healthy by local metrics while globally becoming incomprehensible. Bug reports can go down while latent risk explodes. Test coverage can rise while semantic understanding falls. Changes happens so fast that nobody notices the underlying architecture decaying. I worry.
English
506
1.9K
15K
1.5M
ALR
ALR@ALRubinger·
@ibuildthecloud You seem hip to this kind of nerdery lately. Would love your thoughts before I burn a lake's worth of tokens prototyping it out.
English
0
0
0
74
ALR
ALR@ALRubinger·
I want my agents to fly and --dangerously-skip-permissions. I *think* if we capture the LLM Gateway and OS sandbox both, we can intercept everything we need for policy to let the agents off the leash. Anyone taking this approach?
ALR tweet media
English
1
0
0
137
Darren Shepherd
Darren Shepherd@ibuildthecloud·
Why not? You basically just give the agent it's own machine. It can do whatever it wants and can't screw up anything. It can then run your application, debug it, install random crap, do whatever. It only takes like 2 seconds to create a sandbox with a clone in it. That's what I do. I run tons of these in parallel. Each agent gets it's dedicated environment.
English
4
0
9
2K
Darren Shepherd
Darren Shepherd@ibuildthecloud·
I hate worktrees so much. What is wrong with you all.
English
72
3
411
120.9K
ALR
ALR@ALRubinger·
@brunoborges @techgirl1908 I'm happy we're finally recognizing the problem. For awhile it felt like the industry was so hopped up on what agents made possible, we let first principles for security and privacy fly right out the window.
English
0
0
1
25
Bruno Borges
Bruno Borges@brunoborges·
@techgirl1908 Everyone trying to build control planes and orchestration dashboards. Everyone else using IDEs and CLIs. I don't think we haven't docked the problem yet...
English
1
0
2
309
Angie Jones
Angie Jones@techgirl1908·
which agent control planes are folks using?
English
9
0
9
5K
ALR
ALR@ALRubinger·
@techgirl1908 Building one. withaileron.ai. It has sane controls and a way to add capabilities to your agents that I'm really enjoying. Working hard to get it a healthy set of connectors. If anyone wants to try the Google connectors - let me know and I'll get you on an allowlist.
English
1
0
1
137
ALR
ALR@ALRubinger·
Across Claude, Codex, Goose, and OpenCode. Configure once, use anywhere.
ALR tweet media
English
0
0
0
139
ALR
ALR@ALRubinger·
"Write a 1-sentence elevator pitch for how Aileron will accelerate a team's AI transformation. Send it to email@address.com"
ALR tweet media
English
0
0
1
103
ALR
ALR@ALRubinger·
@nicos_ai The final boss of vendor lock-in.
English
0
0
1
265
Nico
Nico@nicos_ai·
Anthropic acaba de lanzar el empleado más barato y eficaz del mundo. Se llama “Claude for Small Business”. Y esto es lo que puede hacer: • Gestionar facturas, pagos y finanzas • Crear campañas, diseños y contenido • Organizar ventas y clientes automáticamente • Leer, resumir y redactar documentos • Gestionar emails, calendarios y archivos • Ejecutar tareas entre múltiples apps Todo desde Claude. Cómo funciona: → Conectas las herramientas que ya usa tu empresa → Claude entiende el contexto de todo tu negocio → Ejecuta flujos de trabajo automáticamente → Incluye automatizaciones ya preparadas → Funciona con Microsoft 365, Google Workspace, Canva, DocuSign, QuickBooks y más Anthropic no quiere que Claude sea “otro chatbot”. Quiere convertirlo en el sistema operativo de millones de pequeñas empresas. La idea es simple: En vez de abrir 10 herramientas distintas, hablas con Claude y él hace el trabajo por ti.
Polymarket@Polymarket

NEW: Anthropic launches "Claude for Small Business"

Español
165
892
9.1K
1.7M
Omar Shaikh
Omar Shaikh@oshaikh13·
We upgraded Tabracadabra 🎉 to bring an entire context-aware assistant (not just tab to autocomplete!) to any textbox. It's pretty great if you hate switching between the chat interface and what you're working on. We're also open-sourcing, so you can try it out!🧵
English
13
36
172
38.5K
Matthew Yuan
Matthew Yuan@matthew_yuan8·
Yes. Per-repo memory is the first boundary that makes agent memory reviewable. The missing piece I keep wanting is a tiny receipt next to each memory write: - source file/path - task it came from - confidence / expiry - who may reuse it - how to replay or delete it Without that, repo-scoped memory still becomes invisible state.
English
1
0
0
31
ALR
ALR@ALRubinger·
When agent memory namespace is per-repo, that's a strong argument in favor of modular monorepos.
English
1
0
1
158
ALR
ALR@ALRubinger·
Yes. Yes. Yes. Yes. Yes. Yes. Yes. Yes. Yes. No no no, you did it all wrong! - Computer Programming, 2026.
English
0
0
0
94