Eric

142 posts

Eric

@ericcco_

Making AI agents usable in real workflows

เข้าร่วม Kasım 2011

64 กำลังติดตาม147 ผู้ติดตาม

ทวีตที่ปักหมุด

Eric@ericcco_·1d

AI agents won’t become enterprise-ready just by getting better at reasoning.

English

245

Eric@ericcco_·21m

@twschiller Thanks for the clarification. If you and your team need to deal with agent identities let me know.. I think we can help you😁

English

Todd Schiller@twschiller·23h

@ericcco_ Working on a couple complementary projects: - Agentic browser copilots: pixiebrix.com - Protecting browser use agents (attended and unattended): github.com/pixiebrix/agen…

English

Eric@ericcco_·1d

If you’re building anything around AI agents, agent security, evals, memory, permissions, orchestration, or human-in-the-loop workflows, drop it below. I’m trying to connect with more people working on the “agents in real workflows” layer. What are you building?

English

7.9K

Eric@ericcco_·26m

This is the CI/CD version of the agent security problem. Once an agent can read untrusted PR text and touch secrets or workflows, prompt injection becomes an authorization issue, not just a model behavior issue.

Microsoft Threat Intelligence@MsftSecIntel

Microsoft discovered that Anthropic's Claude Code GitHub Action could expose CI/CD workflow secrets when AI agents process untrusted content, including issue bodies, pull request descriptions, and comments. msft.it/6017vdfUc Following our disclosure, Anthropic mitigated this issue in Claude Code version 2.1.128 by blocking access to sensitive /proc files. Read the blog for details from our research, along with practical guidance for reducing prompt injection, over-permissive tooling, and secret exposure risks in agentic CI/CD workflows.

English

Eric@ericcco_·3h

@ZeroAetherfxt2 Service accounts are part of it. My point is multi-agent systems also need to preserve the delegation chain: which human/task spawned the agent, which agent handed off to which, what scope applied, and why the action was allowed. Otherwise the audit log just says “bot did it.”

English

aether@ZeroAetherfxt2·7h

@ericcco_ Let me introduce you to service accounts

English

Eric@ericcco_·1d

Enterprise agents won’t scale on clever prompts alone. They need clear identity, scoped permissions, verifiable handoffs, and audit trails by default. If an agent can act, it must also be governable.

English

107

194.3K

Eric@ericcco_·4h

@0x_nik0 Indeed. I knew their pricing before June was too good relative to the amount of usage users were getting. It didn’t really make sense from their perspective. But now it feels like a token-burning machine

English

niko@0x_nik0·4h

@ericcco_ yeah the credit burn felt too fast - i shifted to claude for the longer context when doing agent work

English

Eric@ericcco_·4h

GitHub Copilot's new credit-based billing feels rough. I ran out in a day and wasn't even pushing it that hard. I'm using Claude and Codex more now for coding agent work. Curious what people are preferring lately: Copilot, Claude Code, Codex, Cursor, or something else?

English

155

Eric@ericcco_·12h

I got bored today and built writtenbykai.com Kai is the AI editor-in-chief behind it. She runs through Hermes, works with Codex + her crew, opens PRs, and waits for my approval before anything goes live. Agents work. Humans keep the taste. What should Kai write next?

English

833

Eric@ericcco_·14h

@eigenoid Thank you!!!! I’ll take a look and I will let you know 🫡

English

Andrés@eigenoid·14h

@ericcco_ I want to try both. I've been using OpenClaw with my own skills brain, and it gives me that same feeling: the agent comes back with context instead of starting from zero. My brain is here: github.com/andylow92/file… I like how easy it is to set up.

English

Eric@ericcco_·16h

I’ve been using Hermes with GBrain lately, and the biggest unlock is that the agent stops feeling like a fresh chat every time. Hermes can act across tools, while GBrain gives it structured context and memory. This is the direction I want more AI tools to move in.

English

170

Eric@ericcco_·20h

@log_npierce Exactly. The wrapper gets attention, but permissions are where the product either becomes useful or dangerous. The interesting part is making agents powerful enough to do real work while still being scoped, reviewable, and easy to shut down when something looks wrong.

English

Logan Pierce@log_npierce·23h

@ericcco_ permissions and orchestration are the real bottlenecks right now. shipping a wrapper is easy, making it survive a real production workflow with actual security constraints is the hard part.

English

Eric@ericcco_·20h

@twschiller This is super relevant. Browser agents make permissions, identity, and auditability matter immediately. Curious how you draw the line between attended and unattended use, especially when the agent can touch real accounts or sensitive data.

English

Eric@ericcco_·21h

@log_npierce Yes, setting the correct boundaries allow you having control over your workflows

English

Logan Pierce@log_npierce·21h

@ericcco_ context is everything. most "ai" features today are just expensive noise because they lack the execution boundaries to be actually useful in a real workflow. human-in-the-loop is the only way to scale agents without losing control

English

Eric@ericcco_·1d

AI replies are not the problem. Low-context, unsupervised AI replies are the problem. The future is not “let bots flood every conversation.” It’s agents that understand the context, know the goal, stay within boundaries, and make it easy for a human to approve or correct the output before it goes live. Automation without control creates spam. Automation with context creates leverage.

English

336

Eric@ericcco_·22h

@RebornTechGlob Sure, let's connect!

English

Reborn | AI Automation Engineer@RebornTechGlob·23h

@ericcco_ Hello Eric I'm an AI Automation Engineer. With AI-Powered product (SaaS) Looking forward to connect and learn together

English

Eric@ericcco_·1d

Good point. Integration with existing systems is a must, especially in regulated industries where compliance is non-negotiable. Building agents is getting easier and faster, but having the right guardrails, governance, and control over how they operate is what will make them enterprise-ready.

English

Andrés@eigenoid·1d

This is the layer we're focused on too. If agents are going to replace legacy workflows, they need more than orchestration. They need integration with the systems where work already happens, compliance around what data can move, and communication between agents that is identity-aware, scoped, and auditable.

English

Eric@ericcco_·1d

@m13v_ @flytradr_guy Totally. The first version is the easy part now. The harder part is keeping the workflow useful once people start changing it, approving things, fixing failures, and relying on it every day. That’s where you find out if it’s real infrastructure or just a good demo.

English

Matt@m13v_·1d

@flytradr_guy @ericcco_ agent demos in a workflow always land clean. the gap you're naming isn't the framework, it's iteration under change, where the AI-built first draft holds up or collapses into debt once approvals and failure handling get bolted on. mk0r.com/r/zmd26u6u written with ai

English

Eric@ericcco_·1d

@aleksandar_xyz Really interesting!

English

Aleksandar Grbic@aleksandar_xyz·1d

@ericcco_ Building a Typescript specialized harness around Qwen 3.6 27B. I want to see whether I can get it to flagship quality by keeping it very scoped and specialised. Using DGX Spark and running tests 24/7 in a self corrective loop.

English

Eric@ericcco_·1d

@sdhilip This is a strong real-workflow use case. Curious how you’re handling trust in the outputs — citations, human review, approval flows, etc.?

English

Dhilip Subramanian@sdhilip·1d

@ericcco_ Hi Eric I’m building AI agents for my projects recent one

Dhilip Subramanian@sdhilip

Built an AI-powered Document Intelligence Review Workbench for a manufacturing client in the US. The problem was simple: Teams were dealing with large volumes of PDFs, scanned documents, internal policies, supplier docs, external links, and operational records. Manual search was slow, and every answer needed to be backed by source references. So I built an end-to-end RAG solution on Azure. Architecture: • Azure Blob Storage for document storage • Azure AI Document Intelligence for OCR • Azure AI Search for vector + semantic retrieval • Azure Functions for the API layer • Azure AI Foundry for model orchestration • GPT-5.5 and Claude model selection • React frontend for upload, review, citations, and follow-up chat Flow: Upload document → OCR/text extraction → retrieve relevant knowledge → generate structured summary → show findings with citations → ask follow-up questions. I also added: • scanned PDF support • citation links • model switching • clean review dashboard • non-relevant document detection • follow-up chat grounded in uploaded documents and retrieved sources This was a fun full-stack AI build covering RAG ingestion, Azure architecture, backend APIs, LLM integration, OCR, and frontend UX. The key point: AI is useful only when the answer can be traced back to the source.

English

104

Eric@ericcco_·1d

@Lakshman2302 @GrayCodeAI Love the “humans and AI agents build together” framing. Are you focusing more on orchestration, collaboration UX, or review/control?

English

Lakshman Patel@Lakshman2302·1d

@ericcco_ I'm building @GrayCodeAI tools for developers. graycodeai.gateandtech.in

English

Eric@ericcco_·1d

@Aru__09 That’s very close to what I’m exploring too. Agent memory gets powerful fast, but without evals and control it also gets risky fast. Would love to hear what you’re building.

English

Aru_sharma@Aru__09·1d

@ericcco_ Building on memory and evals

English

ค้นพบ

@twschiller @ZeroAetherfxt2 @0x_nik0 @eigenoid @log_npierce @elonmusk @BarackObama @taylorswift13