Chau Tran

1.1K posts

Chau Tran

@mr_cheu

Building AI @glean. Past: Machine Translation at FAIR, ML @Quora

New York, USA Katılım Nisan 2009

943 Takip Edilen1K Takipçiler

Chau Tran@mr_cheu·22 Şub

What's the best opensource code mode tool execution that's lightweight, support both MCP and non-MCP tools? Don't want dependencies on any agents/harness sdk

English

176

Chau Tran@mr_cheu·16 Oca

x.com/i/article/2011…

ZXX

281

Chau Tran@mr_cheu·24 Kas

@giffmana Maybe crashes are penalized more than silent failures in post-training?

English

115

Lucas Beyer (bl16)@giffmana·24 Kas

PLEASE Anthro and OpenAI, make sure this paper is in the training data. He'll put it in the context! I'm tired of the code you generate "failing silently" instead of loudly so i can investigate! Don't be robust to document.querySelector() not finding the thing we *need* to exist. Just fail loudly so we notice and fix, instead of going down a long hunt for "why nothing happens". (and Google too? Haven't tried gemini-cli yet)

Dominik Tornow@DominikTornow

assert() in production, crash on violation An assertion violation means the component already failed, just hasn't crashed yet Crash the component and let the rest of the system compensate A crash is always a possible failure mode Take advantage

English

576

104.5K

Chau Tran@mr_cheu·19 Kas

@ScottWu46 @antigravity This is embarrassing.

English

797

Scott Wu@ScottWu46·19 Kas

Congrats to the @antigravity team on the launch today! fyi you missed a spot:

Varun Mohan@_mohansolo

Excited to launch Google Antigravity, our next generation agentic IDE, now powered by Gemini 3!

English

146

4.2K

967.1K

Chau Tran@mr_cheu·28 Eki

@HanchungLee what's wrong with responses api?

English

367

Han@HanchungLee·28 Eki

anthropic api is taking over chat completion. its natively agentic, supports tool use, and adopted by almost all chinese labs (the only worthy oss players). at the same time, oai left chat completion to die in favor of response api.

Skyler Miao@SkylerMiao7

Hey guys, thank you all for your love and passion for M2. Lot asking why we recommend Anthropic API. I think need to explain a little bit. M2 is a agentic thinking model, it do interleaved thinking like sonnet 4.5, which means every response will contain its thought content. Its very important for M2 to keep the chain of thought. So we must make sure the history thought passed back to the model. Anthropic API support it for sure, as sonnet needs it as well. OpenAI only support it in their new Response API, no support for in ChatCompletion. That's why GPT5 has best performance only under Response API.

English

186

28.6K

Chau Tran@mr_cheu·17 Eki

Claude Code/Codex should evolve to become a full terminal replacement, accepting both bash scripts and natural language instructions

English

146

Chau Tran@mr_cheu·15 Eki

@a1zhang @lateinteraction @trq212 what's the best way to enable Claude Code to write code that can execute LLM calls?

English

217

alex zhang@a1zhang·15 Eki

What if scaling the context windows of frontier LLMs is much easier than it sounds? We’re excited to share our work on Recursive Language Models (RLMs). A new inference strategy where LLMs can decompose and recursively interact with input prompts of seemingly unbounded length, as a REPL environment. On the OOLONG benchmark, RLMs with GPT-5-mini outperforms GPT-5 by over 110% gains (more than double!) on 132k-token sequences and is cheaper to query on average. On the BrowseComp-Plus benchmark, RLMs with GPT-5 can take in 10M+ tokens as their “prompt” and answer highly compositional queries without degradation and even better than explicit indexing/retrieval. We link our blogpost, (still very early!) experiments, and discussion below.

English

135

378

2.7K

947.6K

Chau Tran@mr_cheu·26 Ağu

gpt-5-pro API wen?

Indonesia

278

Chau Tran@mr_cheu·19 Ağu

@raveeshbhalla @sherwinwu @augmentcode because it supports passing encrypted reasoning tokens across calls #reasoning-items-for-better-performance" target="_blank" rel="nofollow noopener">cookbook.openai.com/examples/o-ser…

English

134

Raveesh 折図@raveeshbhalla·19 Ağu

@sherwinwu @augmentcode Why exactly does the API impact the model’s performance?

English

1.5K

Sherwin Wu@sherwinwu·19 Ağu

Fun Fact: Responses API was built specifically for a world where models would be doing complex tasks with many tool calls – glad to hear the folks at @augmentcode are getting the most out of it!

Shen Zhuoran@CMS_Flash

Man it's crazy how BIG a difference it makes for GPT-5 just by switching from Completions API to Responses API. We're cooking @augmentcode.

English

129

108.9K

Chau Tran@mr_cheu·27 Haz

@jobergum @LinkedIn Sounds like bloom filter bug 😅

English

144

Jo Kristian Bergum@jobergum·27 Haz

Wonder what happened with the @LinkedIn recsys algo that now prioritizes 2-3w old posts that I have seen before. Congratulations!

English

Chau Tran@mr_cheu·27 Haz

@headinthebox Probably because this model was post trained with tools in that exact schema? Another generalized version should come later

English

Erik Meijer@headinthebox·27 Haz

OpenAI Deep Research MCP support: ... Your MCP server must implement two tools to work with deep research - search and fetch ... [0] This feels counter to the spirit of MCP, and I think using regular function calling is more appropriate.

English

Chau Tran@mr_cheu·13 Haz

@walden_yan It depends on whether you’re building a vertical agent or horizontal agent though. Broad horizontal agents like ChatGPT or Glean benefits more from calling specialized subagents

English

200

Walden@walden_yan·12 Haz

I see a lot of people make the same mistakes building agents. So we shared a few of the principles we use cognition.ai/blog/dont-buil…

English

126

1.1K

248K

Chau Tran@mr_cheu·9 Haz

Developers are hacking all sort of ways to do this agentic loop using the current one-hop API. But it’s best handled directly by LLM providers @OpenAIDevs @AnthropicAI @googleaistudio @xai

English

238

Chau Tran@mr_cheu·9 Haz

We need an LLM API that can iteratively call tools, see tool outputs, call tools again,… until finally streaming back the response

English

266

Chau Tran@mr_cheu·2 Haz

4/n What are the best memory management techniques for long-running agents, and what we need from future LLMs

English

132

Chau Tran@mr_cheu·2 Haz

3/n When to use finetuning vs dynamic prompting for training agents that scale and personalize

English

159

Chau Tran@mr_cheu·2 Haz

🚀 Excited to speak at @aiDotEngineer this Wednesday @ 2pm! Main topics: 1/n How to build enterprise-aware agents that are not just smart but also adaptable and aligned with your company’s processes

English

303

Chau Tran@mr_cheu·13 May

Just look at the system prompts from all the frontier labs if you don't believe me

English

107

Chau Tran@mr_cheu·13 May

Prompting is like writing software. Finetuning is like building customized hardware. The first is more flexible, you can adapt to the requirements very quickly The second is more optimized, but more costly, so you can only do it when the requirements don't change too quickly

Aaron Levie@levie

The more time you spend with AI the more you realize prompt engineering isn’t going away any time soon. For most knowledge work, there’s a very wide variance of what you can get out of AI by better understanding how you prompt it. This actually is a 21st century skill.

English

384

Keşfet

@giffmana @ScottWu46 @antigravity @HanchungLee @a1zhang @lateinteraction @trq212 @raveeshbhalla