
Andrew Nguonly
1.3K posts

Andrew Nguonly
@andrewnguonly
swe @LangChain


Deep Agents deploy gets you: - Deep Agents harness - Sandbox of your choice (@daytonaio , @modal , @RunloopDev ) - Short and long term memory - Agents exposed via MCP and A2A Production ready and open standards

🎙️Introducing Max Agency Max Agency is a new podcast where we go deep on how the best agents are actually being built: architecture decisions, tradeoffs, evals, and everything in between. Each episode, I sit down with engineering leaders who are doing this work in production. Our first episode features Izzy Miller (@isidoremiller), AI Engineer at Hex (@_hex_tech). Hex has been shipping data agents since before most teams were even thinking about them, starting with single-cell text-to-SQL and graduating to a full Notebook agent that can work autonomously for 20 minutes on a complex analysis. Izzy has a lot of perspective on what it actually takes to get agents working well in production, and what breaks along the way. A few takeaways from our conversation: - Keep your eval sets small enough to hold in your head: Izzy runs 30-50 handcrafted "traps" with multiple repetitions, rather than hundreds of variants. If you can't explain why your agent fails each one, your eval set is too big - Day zero performance is almost irrelevant: The more interesting question is how the agent compounds. Izzy is building a 90-day simulation where the warehouse evolves and the agent has to accumulate understanding - You can catch agent errors without seeing the raw outputs: By running an LLM-as-a-judge over production usage and clustering the results, you can surface places where something likely went wrong, without needing to read individual conversations Watch the full episode on: - Youtube: youtube.com/watch?v=Xyh1Eq… - Apple Podcasts: podcasts.apple.com/us/podcast/how… - Spotify: open.spotify.com/episode/1BJlg3…

One of the benefits of Deep Agents deploy is model optionality Choose from models from @OpenAI @GeminiApp @AnthropicAI @FireworksAI_HQ @baseten @OpenRouter @ollama @nvidia and many others Another is you can bring your own sandbox - @daytonaio @modal @RunloopDev

🔌 Deploy agents with A2A A2A is an agent-to-agent communication protocol, useful for building multi-agent systems. With LangSmith Deployments, you get A2A support out of the box! Watch how: youtu.be/SjGPXBNH614 Docs: #a2a-endpoint-in-agent-server" target="_blank" rel="nofollow noopener">docs.langchain.com/langsmith/serv…
A2A Protocol: a2a-protocol.org/latest/
.@TryArcade's 7,500+ agent-optimized MCP tools are now available in LangSmith Fleet. Create a gateway and your agents get secure access to Salesforce, GitHub, Zendesk, Asana, and many more. Read more: blog.langchain.com/arcade-dev-too…

You cannot back into operational excellence it's either there at the beginning culturally or it's not.

LangSmith Fleet now supports @Microsoft365 tools. Now you can: → Build an agent that triages your Outlook inbox and drafts replies → Access Sharepoint docs so your agent has the context it needs → Connect any agent to Teams so your whole team can use it Try Fleet: smith.langchain.com/agents?skipOnb…



