kerrow

396 posts

kerrow banner
kerrow

kerrow

@itskerrow

software, self improvement, serendipity. https://t.co/GedfCrfhxG https://t.co/51PlgjxYys

Katılım Mart 2019
299 Takip Edilen38 Takipçiler
kerrow
kerrow@itskerrow·
every generative agent needs adversarial compute thrown at it to select, omit, synthesize, and route (which covers a lot: do more work, choose between paths, escalate...). we're all building this manually in use-case specific ways. the right primitives will emerge. i'm working on this problem here: stepwise.run - reach out if you want to talk about the problems
English
0
0
1
31
Leo Tavares
Leo Tavares@LeoTava8·
The escalation point is underrated. Most agent architectures handle the happy path fine — it's the moment the agent needs to say "I don't know" or hand off to a human or a different agent that exposes all the state management gaps. Routing is a solved-ish pattern (intent classification, tool dispatch). Escalation requires the agent to have a model of its own competence boundaries, plus enough session context to make the handoff useful instead of "start over." That's where orchestration gets interesting — it's not about calling the model, it's about everything around the model call.
English
2
1
19
7.1K
Aaron Levie
Aaron Levie@levie·
Another week on the road meeting with a couple dozen IT and AI leaders from large enterprises across banking, media, retail, healthcare, consulting, tech, and sports, to discuss agents in the enterprise. Some quick takeaways: * Clear that we’re moving from chat era of AI to agents that use tools, process data, and start to execute real work in the enterprise. Complementing this, enterprises are often evolving from “let a thousand flowers bloom” approach to adoption to targeted automation efforts applied to specific areas of work and workflow. * Change management still will remain one of the biggest topics for enterprises. Most workflows aren’t setup to just drop agents directly in, and enterprises will need a ton of help to drive these efforts (both internally and from partners). One company has a head of AI in every business unit that roles up to a central team, just to keep all the functions coordinated. * Tokenmaxxing! Most companies operate with very strict OpEx budgets get locked in for the year ahead, so they’re going through very real trade-off discussions right now on how to budget for tokens. One company recently had an idea for a “shark tank” style way of pitching for compute budget. Others are trying to figure out how to ration compute to the best use-cases internally through some hierarchy of needs (my words not theirs). * Fixing fragmented and legacy systems remain a huge priority right now. Most enterprises are dealing with decades of either on-prem systems or systems they moved to the cloud but that still haven’t been modernized in any meaningful way. This means agents can’t easily tap into these data sources in a unified way yet, so companies are focused on how they modernize these. * Most companies are *not* talking about replacing jobs due to agents. The major use-cases for agents are things that the company wasn’t able to do before or couldn’t prioritize. Software upgrades, automating back office processes that were constraining other workflows, processing large amounts of documents to get new business or client insights, and so on. More emphasis on ways to make money vs. cut costs. * Headless software dominated my conversations. Enterprises need to be able to ensure all of their software works across any set of agents they choose. They will kick out vendors that don’t make this technically or economically easy. * Clear sense that it can be hard to standardize on anything right now given how fast things are moving. Blessing and a curse of the innovation curve right now - no one wants to get stuck in a paradigm that locks them into the wrong architecture. One other result of this is that companies realize they’re in a multi-agent world, which means that interoperability becomes paramount across systems. * Unanimous sense that everyone is working more than ever before. AI is not causing anyone to do less work right now, and similar to Silicon Valley people feel their teams are the busiest they’ve ever been. One final meta observation not called out explicitly. It seems that despite Silicon Valley’s sense that AI has made hard things easy, the most powerful ways to use agents is more “technical” than prior eras of software. Skills, MCP, CLIs, etc. may be simple concepts for tech, but in the real world these are all esoteric concepts that will require technical people to help bring to life in the enterprise. This both means diffusion will take real work and time, but also everyone’s estimation of engineering jobs is totally off. Engineers may not be “writing” software, but they will certainly be the ones to setup and operate the systems that actually automate most work in the enterprise.
English
135
382
3.3K
776.8K
kerrow
kerrow@itskerrow·
@ElliotHyun harness engineering == llm in a loop == context engineering == prompt engineering
English
0
0
0
122
Elliot Hyun
Elliot Hyun@ElliotHyun·
harness engineering > prompt engineering
English
12
7
60
6.5K
kerrow
kerrow@itskerrow·
@MatthewBerman we build to live 3 months in the future. the only thing that's durable is practiced velocity.
English
0
0
0
68
Matthew Berman
Matthew Berman@MatthewBerman·
Betting against models getting better is foolish. So as we build out harnesses, memory systems, etc, how are the core models not just going to eat more of the scaffolding around them?
English
70
5
188
19K
kerrow
kerrow@itskerrow·
@garrytan where e2e testing doesn't make sense ive had a lot of success asking it to exercise manually to validate and fix any issues / dial in ergos (if cli)
English
0
0
1
107
Garry Tan
Garry Tan@garrytan·
Just because OpenClaw doesn't write tests on its own when it's creating software for you doesn't mean you shouldn't make it make tests for you...
Garry Tan tweet media
English
29
2
94
12.5K
kerrow
kerrow@itskerrow·
in that zone where you stop planning and just fire off endless small requests to claude that stack as it tries to keep up.
English
0
0
0
8
NIK
NIK@ns123abc·
What are we supposed to do now that anthropic completely nerfed claude?
English
181
21
773
66.5K
kerrow
kerrow@itskerrow·
n8n isn't wrong. there is a place for deterministic agent first orchestration. but to be useful you need an onramp that starts with adhoc claude sessions and skills. and you need to be able to easily fork and modify flows from claude.
English
0
0
0
16
kerrow
kerrow@itskerrow·
@brungarc subagents are just not /tree
English
0
0
0
347
Bruno Garcia
Bruno Garcia@brungarc·
Having to use Claude Code after Pi is such a huge bummer
English
19
4
165
12.7K
geoff
geoff@GeoffreyHuntley·
suspect folks not ready for time to last token at or < 200ms entire applications at 5 generations per sec
English
14
6
112
16.3K
kerrow
kerrow@itskerrow·
@dejavucoder better living through chemistry. but man the simple stuff is hard but works remarkably well
English
0
0
3
182
sankalp
sankalp@dejavucoder·
@itskerrow ofc having a healthy routine where you try to regulate cortisol is great. i like to think of l theanine as an improvement over that and not as a substitute.
English
1
0
7
1.3K
sankalp
sankalp@dejavucoder·
my hot take is 200 mg l theanine + coffee/tea (the appropriate amount of caffeine depending on your sensitivity) brings out the real you to yourself if that makes any sense
English
20
12
688
33.2K
kerrow
kerrow@itskerrow·
@RetentionAdam you can't change people. you can change the environment.
English
0
0
0
5
Adam Robinson
Adam Robinson@RetentionAdam·
The secret to a great team is understanding that you can't actually motivate people. You must hire motivated people and build a system around them where they can succeed.
English
17
12
98
2.6K
kerrow
kerrow@itskerrow·
stepwise.run there are cases where actual deterministic flows beat skills. also: observability, auditability, integration with llms and other systems - cost awareness for non max calls. better control over forking behavior. combining different vendors/harnesses (i.e. codex reviews claude). containment control. don't have to choose between ease of interactive agent and robustness of n8n. there is a narrow path to get both.
English
0
0
0
155
Klaas
Klaas@forgebitz·
remember workflows? i think everyone figured out that agentic setups work better and are more dynamic at this point the only argument i see for workflows is control, but you could let agents build fixed flows with a human in the loop
English
28
1
50
5.5K
kerrow
kerrow@itskerrow·
@signulll focus unlocks the models intelligence. orchestration is worthy of focus. see stepwise.run - agent driven workflow orchestration. it's fun and works really well
English
0
0
0
600
signüll
signüll@signulll·
for our first product, we built an agent that manages other agents. & it’s so god damn fucking beautiful cuz it works so well. the architecture is deceptively simple.. define responsibility sets, abilities to delegate, & orchestrate but what emerges is something surprisingly elegant. each layer builds on the others with an internal agent notification system (like you’d tap on someone’s shoulder when complete). there’s a reason org design is its own discipline. it turns out the same principles that make human organizations readable make agentic systems composable. programming this way is genuinely lovely, my lord. my life is so fuckin boring that this is what excites me.
English
30
12
494
31.3K
Sick
Sick@sickdotdev·
@itskerrow job mutation is the right word
English
1
0
0
167
Sick
Sick@sickdotdev·
AI succeeds → massive job losses AI fails → economy crashes AI is just hype → we wasted a decade which path are we actually on ?
English
152
9
204
12.5K
kerrow
kerrow@itskerrow·
my desktop looks like my desk top.
English
0
0
0
15
Sick
Sick@sickdotdev·
How productive was your weekend?
English
52
0
35
2K
Storm
Storm@AndersonAndrue·
Haven’t seen anyone announce a new Firecracker-based MicroVM startup in a few days.. is everything alright??
English
4
6
102
8.8K