Andrew Nguonly

1.3K posts

Andrew Nguonly banner
Andrew Nguonly

Andrew Nguonly

@andrewnguonly

swe @LangChain

San Francisco, CA เข้าร่วม Haziran 2021
866 กำลังติดตาม375 ผู้ติดตาม
Andrew Nguonly
Andrew Nguonly@andrewnguonly·
Kaskade is the most reliable DJ
English
0
0
1
310
Andrew Nguonly
Andrew Nguonly@andrewnguonly·
Conspiracy: Anyma —> Summit Premeditated 😈
English
0
0
0
118
Andrew Nguonly
Andrew Nguonly@andrewnguonly·
Max volume for Max Agency 📣🗣️
Harrison Chase@hwchase17

🎙️Introducing Max Agency Max Agency is a new podcast where we go deep on how the best agents are actually being built: architecture decisions, tradeoffs, evals, and everything in between. Each episode, I sit down with engineering leaders who are doing this work in production. Our first episode features Izzy Miller (@isidoremiller), AI Engineer at Hex (@_hex_tech). Hex has been shipping data agents since before most teams were even thinking about them, starting with single-cell text-to-SQL and graduating to a full Notebook agent that can work autonomously for 20 minutes on a complex analysis. Izzy has a lot of perspective on what it actually takes to get agents working well in production, and what breaks along the way. A few takeaways from our conversation: - Keep your eval sets small enough to hold in your head: Izzy runs 30-50 handcrafted "traps" with multiple repetitions, rather than hundreds of variants. If you can't explain why your agent fails each one, your eval set is too big - Day zero performance is almost irrelevant: The more interesting question is how the agent compounds. Izzy is building a 90-day simulation where the warehouse evolves and the agent has to accumulate understanding - You can catch agent errors without seeing the raw outputs: By running an LLM-as-a-judge over production usage and clustering the results, you can surface places where something likely went wrong, without needing to read individual conversations Watch the full episode on: - Youtube: youtube.com/watch?v=Xyh1Eq… - Apple Podcasts: podcasts.apple.com/us/podcast/how… - Spotify: open.spotify.com/episode/1BJlg3…

English
0
1
5
1.5K
Andrew Nguonly
Andrew Nguonly@andrewnguonly·
LLM usage of a senior SWE in a new system: 1⃣Vibe code small features, use LLM to understand how the system works. LLM directs you. 2⃣Implement features by hand. Test. Go on-call. You direct you. 3⃣Vibe code large features, use LLM to speed up implementation. You direct LLM.
English
0
0
1
63
Andrew Nguonly
Andrew Nguonly@andrewnguonly·
SRE-bench > SWE-bench
English
0
0
1
66
Andrew Nguonly
Andrew Nguonly@andrewnguonly·
Rust: “😑”.to_string()
English
0
0
1
50
Andrew Nguonly
Andrew Nguonly@andrewnguonly·
I like my software like I like my beer. Craft. 🍺👌
English
1
0
1
91
Andrew Nguonly
Andrew Nguonly@andrewnguonly·
In the age of AI, there are only 2 roles: builder and operator. 💡🚀 Builders ideate, design, produce code, test, and verify. 💪🔒 Operators measure, scale, secure, mitigate issues, architect for resilience, and control costs.
English
0
0
0
65
Andrew Nguonly รีทวีตแล้ว
Sam Lambert
Sam Lambert@samlambert·
You cannot back into operational excellence it's either there at the beginning culturally or it's not.
English
16
11
274
17.3K
Andrew Nguonly
Andrew Nguonly@andrewnguonly·
Fleet 95 Fleet 98 Fleet XP 🛫 Fleet 10 Fleet 11 LangSmith Fleet is the operating system for your work 👨🏻‍💻
LangChain@LangChain

LangSmith Fleet now supports @Microsoft365 tools. Now you can: → Build an agent that triages your Outlook inbox and drafts replies → Access Sharepoint docs so your agent has the context it needs → Connect any agent to Teams so your whole team can use it Try Fleet: smith.langchain.com/agents?skipOnb…

English
0
0
1
108
NVIDIA AI Developer
NVIDIA AI Developer@NVIDIAAIDev·
👀 @LangChain is leveling up agentic workflows Victor Moreira, a LangChain engineer, breaks down 2 essential tools for improving performance and reliability with @llm_wizard. ✅Deep Agent Harness to manage complex, long-duration tasks and boost LLM performance. ✅LangSmith for tracing agents in production and allowing for continuous improvement. Catch more interviews from our #NVIDIAGTC developer livestream: youtube.com/live/MplaRtIZe…
YouTube video
YouTube
English
2
20
72
8.8K
Andrew Nguonly รีทวีตแล้ว
Sam Lambert
Sam Lambert@samlambert·
literally anyone can ship quickly if they sacrifice reliability. it’s not in any way impressive.
English
61
220
2.3K
110.3K