Eric

142 posts

Eric banner
Eric

Eric

@ericcco_

Making AI agents usable in real workflows

เข้าร่วม Kasım 2011
64 กำลังติดตาม147 ผู้ติดตาม
ทวีตที่ปักหมุด
Eric
Eric@ericcco_·
AI agents won’t become enterprise-ready just by getting better at reasoning.
English
1
0
1
245
Eric
Eric@ericcco_·
@twschiller Thanks for the clarification. If you and your team need to deal with agent identities let me know.. I think we can help you😁
English
0
0
0
3
Eric
Eric@ericcco_·
If you’re building anything around AI agents, agent security, evals, memory, permissions, orchestration, or human-in-the-loop workflows, drop it below. I’m trying to connect with more people working on the “agents in real workflows” layer. What are you building?
English
45
0
36
7.9K
Eric
Eric@ericcco_·
This is the CI/CD version of the agent security problem. Once an agent can read untrusted PR text and touch secrets or workflows, prompt injection becomes an authorization issue, not just a model behavior issue.
Microsoft Threat Intelligence@MsftSecIntel

Microsoft discovered that Anthropic's Claude Code GitHub Action could expose CI/CD workflow secrets when AI agents process untrusted content, including issue bodies, pull request descriptions, and comments. msft.it/6017vdfUc Following our disclosure, Anthropic mitigated this issue in Claude Code version 2.1.128 by blocking access to sensitive /proc files. Read the blog for details from our research, along with practical guidance for reducing prompt injection, over-permissive tooling, and secret exposure risks in agentic CI/CD workflows.

English
0
0
0
5
Eric
Eric@ericcco_·
@ZeroAetherfxt2 Service accounts are part of it. My point is multi-agent systems also need to preserve the delegation chain: which human/task spawned the agent, which agent handed off to which, what scope applied, and why the action was allowed. Otherwise the audit log just says “bot did it.”
English
0
0
0
14
aether
aether@ZeroAetherfxt2·
@ericcco_ Let me introduce you to service accounts
English
1
0
0
64
Eric
Eric@ericcco_·
Enterprise agents won’t scale on clever prompts alone. They need clear identity, scoped permissions, verifiable handoffs, and audit trails by default. If an agent can act, it must also be governable.
English
5
6
107
194.3K
Eric
Eric@ericcco_·
@0x_nik0 Indeed. I knew their pricing before June was too good relative to the amount of usage users were getting. It didn’t really make sense from their perspective. But now it feels like a token-burning machine
English
0
0
1
26
niko
niko@0x_nik0·
@ericcco_ yeah the credit burn felt too fast - i shifted to claude for the longer context when doing agent work
English
1
0
1
36
Eric
Eric@ericcco_·
GitHub Copilot's new credit-based billing feels rough. I ran out in a day and wasn't even pushing it that hard. I'm using Claude and Codex more now for coding agent work. Curious what people are preferring lately: Copilot, Claude Code, Codex, Cursor, or something else?
English
4
0
4
155
Eric
Eric@ericcco_·
I got bored today and built writtenbykai.com Kai is the AI editor-in-chief behind it. She runs through Hermes, works with Codex + her crew, opens PRs, and waits for my approval before anything goes live. Agents work. Humans keep the taste. What should Kai write next?
Eric tweet media
English
0
0
2
833
Eric
Eric@ericcco_·
@eigenoid Thank you!!!! I’ll take a look and I will let you know 🫡
English
0
0
1
16
Andrés
Andrés@eigenoid·
@ericcco_ I want to try both. I've been using OpenClaw with my own skills brain, and it gives me that same feeling: the agent comes back with context instead of starting from zero. My brain is here: github.com/andylow92/file… I like how easy it is to set up.
English
1
0
1
37
Eric
Eric@ericcco_·
I’ve been using Hermes with GBrain lately, and the biggest unlock is that the agent stops feeling like a fresh chat every time. Hermes can act across tools, while GBrain gives it structured context and memory. This is the direction I want more AI tools to move in.
English
1
0
3
170
Eric
Eric@ericcco_·
@log_npierce Exactly. The wrapper gets attention, but permissions are where the product either becomes useful or dangerous. The interesting part is making agents powerful enough to do real work while still being scoped, reviewable, and easy to shut down when something looks wrong.
English
0
0
0
33
Logan Pierce
Logan Pierce@log_npierce·
@ericcco_ permissions and orchestration are the real bottlenecks right now. shipping a wrapper is easy, making it survive a real production workflow with actual security constraints is the hard part.
English
1
0
1
47
Eric
Eric@ericcco_·
@twschiller This is super relevant. Browser agents make permissions, identity, and auditability matter immediately. Curious how you draw the line between attended and unattended use, especially when the agent can touch real accounts or sensitive data.
English
1
0
0
47
Eric
Eric@ericcco_·
@log_npierce Yes, setting the correct boundaries allow you having control over your workflows
English
0
0
0
13
Logan Pierce
Logan Pierce@log_npierce·
@ericcco_ context is everything. most "ai" features today are just expensive noise because they lack the execution boundaries to be actually useful in a real workflow. human-in-the-loop is the only way to scale agents without losing control
English
1
0
0
24
Eric
Eric@ericcco_·
AI replies are not the problem. Low-context, unsupervised AI replies are the problem. The future is not “let bots flood every conversation.” It’s agents that understand the context, know the goal, stay within boundaries, and make it easy for a human to approve or correct the output before it goes live. Automation without control creates spam. Automation with context creates leverage.
English
3
0
5
336
Eric
Eric@ericcco_·
Good point. Integration with existing systems is a must, especially in regulated industries where compliance is non-negotiable. Building agents is getting easier and faster, but having the right guardrails, governance, and control over how they operate is what will make them enterprise-ready.
English
0
0
1
68
Andrés
Andrés@eigenoid·
This is the layer we're focused on too. If agents are going to replace legacy workflows, they need more than orchestration. They need integration with the systems where work already happens, compliance around what data can move, and communication between agents that is identity-aware, scoped, and auditable.
English
1
0
2
84
Eric
Eric@ericcco_·
@m13v_ @flytradr_guy Totally. The first version is the easy part now. The harder part is keeping the workflow useful once people start changing it, approving things, fixing failures, and relying on it every day. That’s where you find out if it’s real infrastructure or just a good demo.
English
2
0
0
54
Matt
Matt@m13v_·
@flytradr_guy @ericcco_ agent demos in a workflow always land clean. the gap you're naming isn't the framework, it's iteration under change, where the AI-built first draft holds up or collapses into debt once approvals and failure handling get bolted on. mk0r.com/r/zmd26u6u written with ai
English
1
0
1
54
Aleksandar Grbic
Aleksandar Grbic@aleksandar_xyz·
@ericcco_ Building a Typescript specialized harness around Qwen 3.6 27B. I want to see whether I can get it to flagship quality by keeping it very scoped and specialised. Using DGX Spark and running tests 24/7 in a self corrective loop.
English
1
0
2
66
Eric
Eric@ericcco_·
@sdhilip This is a strong real-workflow use case. Curious how you’re handling trust in the outputs — citations, human review, approval flows, etc.?
English
0
0
0
38
Eric
Eric@ericcco_·
@Lakshman2302 @GrayCodeAI Love the “humans and AI agents build together” framing. Are you focusing more on orchestration, collaboration UX, or review/control?
English
1
0
2
66
Eric
Eric@ericcco_·
@Aru__09 That’s very close to what I’m exploring too. Agent memory gets powerful fast, but without evals and control it also gets risky fast. Would love to hear what you’re building.
English
0
0
0
46