WFH

2K posts

WFH banner
WFH

WFH

@WHinthorn

Dev/Research/Studies in ML/NLP @langchainAI Formerly Research @ @robustHQ & Microsoft & Princeton Haymaking. https://t.co/Evua9bcgbQ

เข้าร่วม Ağustos 2015
2K กำลังติดตาม1K ผู้ติดตาม
WFH รีทวีตแล้ว
Harrison Chase
Harrison Chase@hwchase17·
🎙️Introducing Max Agency Max Agency is a new podcast where we go deep on how the best agents are actually being built: architecture decisions, tradeoffs, evals, and everything in between. Each episode, I sit down with engineering leaders who are doing this work in production. Our first episode features Izzy Miller (@isidoremiller), AI Engineer at Hex (@_hex_tech). Hex has been shipping data agents since before most teams were even thinking about them, starting with single-cell text-to-SQL and graduating to a full Notebook agent that can work autonomously for 20 minutes on a complex analysis. Izzy has a lot of perspective on what it actually takes to get agents working well in production, and what breaks along the way. A few takeaways from our conversation: - Keep your eval sets small enough to hold in your head: Izzy runs 30-50 handcrafted "traps" with multiple repetitions, rather than hundreds of variants. If you can't explain why your agent fails each one, your eval set is too big - Day zero performance is almost irrelevant: The more interesting question is how the agent compounds. Izzy is building a 90-day simulation where the warehouse evolves and the agent has to accumulate understanding - You can catch agent errors without seeing the raw outputs: By running an LLM-as-a-judge over production usage and clustering the results, you can surface places where something likely went wrong, without needing to read individual conversations Watch the full episode on: - Youtube: youtube.com/watch?v=Xyh1Eq… - Apple Podcasts: podcasts.apple.com/us/podcast/how… - Spotify: open.spotify.com/episode/1BJlg3…
YouTube video
YouTube
English
14
42
222
31.5K
WFH รีทวีตแล้ว
Jake Broekhuizen
Jake Broekhuizen@jakebroekhuizen·
The first episode of our 'Max Agency' podcast is now live on Youtube and podcast platforms! Was great to work with Izzy in the lead-up to this episode with @hwchase17 Check it out below 👇 youtube.com/watch?v=Xyh1Eq…
YouTube video
YouTube
Jake Broekhuizen tweet media
English
0
8
13
2.5K
Sarah Wooders
Sarah Wooders@sarahwooders·
Memory in the sense of recalling information is a solved problem, or at least as solved as it needs to be. That's why everyone is getting ~100% on all the meaningless "memory benchmarks". Memory in the sense of learning/improving over time is very much unsolved though.
English
47
19
224
17.6K
WFH รีทวีตแล้ว
WFH รีทวีตแล้ว
WFH รีทวีตแล้ว
WFH รีทวีตแล้ว
Mukil Loganathan
Mukil Loganathan@MukilLoganathan·
Really excited to finally launch this in private preview. We have a lot more planned re memory, security, and monitoring to provide the most secure ephemeral environments for your agents. If you are interested would love to chat! We will be letting people off the waitlist incrementally but DM me if you want quicker access!
LangChain@LangChain

🚀 Today we're launching LangSmith Sandboxes Agents get a lot more useful when they can run code: analyze data, call APIs, build entire applications. Sandboxes give them a safe place to do it with ephemeral, locked-down environments you control. Now in Private Preview. Learn more: blog.langchain.com/introducing-la… Join the waitlist: langchain.com/langsmith-sand…

English
2
3
11
2.7K
Sherwood
Sherwood@shcallaway·
For your consideration: 2025 - MCP H1 2026 - CLI + Sandboxes H2 2026 - RLM + SDKs
English
13
2
97
10.7K
WFH
WFH@WHinthorn·
@giffmana Also, "belt-and-suspenders"
English
0
0
0
43
Lucas Beyer (bl16)
Lucas Beyer (bl16)@giffmana·
Both codex-cli and claude code like to use "X is the smoking gun" way too much during investigations. Either OAI and Anthro use the exact same env provider company, or both use a public reasoning/agentic dataset that over-uses this phrase. Any of my follewers knows by chance?
Lucas Beyer (bl16)@giffmana

@Must_af_a @thomascygn literally 15min after I read your reply, now in a codex-cli session:

English
37
2
244
39.1K
Aarno
Aarno@TheGlobalMinima·
@kmeanskaran yes, currently using pydantic ai + langgraph. working pretty well so far
English
1
0
2
361
Aarno
Aarno@TheGlobalMinima·
Ditch all agent frameworks. Pick Pydantic AI / DsPy. Singular agents that reason, but define your own orchestration.
English
8
1
123
5.2K
Jerry Liu
Jerry Liu@jerryjliu0·
@swyx the big liu brothers
English
2
0
2
833
swyx 🐣
swyx 🐣@swyx·
i'm asian so its ok to say this
swyx 🐣 tweet media
Jerry Liu@jerryjliu0

Shoutout @latentspacepod for calling me a "Big Harness guy" in this article: latent.space/p/ainews-is-ha… The biggest barrier to adapting AI is your ability to provide context and workflows to these models. We see ourselves as unlocking the highest quality context from all documents (PDFs, Word, Excel) so that agents can reason through them at scale.

English
7
2
88
18.9K
WFH
WFH@WHinthorn·
@0xFxy I liked developing in dotnet previously. If you fill out the form, you could write "other" and mention the request!
English
1
0
0
12
f(x)
f(x)@0xFxy·
@WHinthorn Would love C#/.NET support. Huge enterprise ecosystem.
English
1
0
0
21
WFH
WFH@WHinthorn·
@summery @LangChain_OSS Hi Summery! Seems like you're writing your own agent framework already, but lmk if you're interested in collaborating
English
1
0
0
20
LangChain OSS
LangChain OSS@LangChain_OSS·
☕️ Building agents in Java or Go? We want to hear from you! Join the waitlist to get early access and help shape the future of LangGraph in your language. Sign up ➡️ airtable.com/appeUtF0AROUCZ…
LangChain OSS tweet media
English
7
7
49
6.6K
WFH รีทวีตแล้ว
WFH รีทวีตแล้ว
LangChain
LangChain@LangChain·
What? LangChain is evolving! Meet our final form ➡️ langchain.com
LangChain tweet media
English
63
92
1.5K
96.5K