Jianbo Wu

87 posts

Jianbo Wu

@jwu323

training & inference / opinions are mine

San Francisco Katılım Ekim 2023

1.9K Takip Edilen131 Takipçiler

Jianbo Wu@jwu323·10h

FedEx appears to have lost my expensive package. It was one of three packages in the shipment; the other two were delivered, but this one has had no tracking updates for two weeks. Today, FedEx told me they have exhausted their search and still cannot locate it. What can I do at this point? @FedEx @FedExHelp

English

249

Jianbo Wu@jwu323·11h

@chowtato Nice, been there a lot times.

English

cato 😾@chowtato·12h

Weekend trip to Panther Beach - plan accordingly for Juneteenth and 4th of July

cato 😾@chowtato

Love glazing SF, but here are some iconic spots you have to hit in South Bay: - Industrial strength margaritas at Aqui’s in Campbell - Stan’s glazed donuts in San Jose - Boba at Pekoe in San Jose (iykyk) - Testarossa and Lima Prieta wineries in Los Gatos - West Wind Capitol Drive-In movie theater San Jose - Bahn Mi at Lee’s Sandwiches (multiple locations) - Panther Beach in Santa Cruz

English

3.5K

Jianbo Wu retweetledi

Z.ai@Zai_org·3d

Intelligence should be open, accessible, and ready to build with, empowering every developer, everywhere. GLM-5.2 is now available to all GLM Coding Plan users, including Lite, Pro, Max, and Team plans. docs.z.ai/devpack/latest… As our new flagship model, GLM-5.2 delivers powerful coding capabilities, usable 1M-context support, and continued strengths in long-horizon tasks. API and Chatbot services will launch next week. The model will also be officially open-sourced next week under the MIT License. The future of AI is open, and it belongs to the people.

English

360

993

8.3K

2.5M

Jianbo Wu@jwu323·6d

@xdotli Loop Fix-ture Engineer

English

Xiangyi Li@xdotli·6d

2023 prompt engineering 2024 context engineering 2025 harness engineering 2026 loop engineering what's next?

English

Jianbo Wu@jwu323·8 Haz

@MishraAmogh @a1zhang 👀

QME

Amogh Mishra@MishraAmogh·8 Haz

Are there any sub‑1B coding agents that are surprisingly powerful for RLM use cases? @a1zhang , you may know? Thanks

English

Jianbo Wu@jwu323·8 Haz

@harshbhatt7585 nice to learn.

English

Harsh Bhatt@harshbhatt7585·7 Haz

x.com/i/article/2063…

ZXX

4.1K

Jianbo Wu@jwu323·6 Haz

@jxnlco Maybe obsidian

English

jason@jxnlco·6 Haz

I need Google Docs but just for markdown files. Multiplayer comments. Syncing resolving comments. Suggestion mode Edit mode Edit history Maybe some sense of multi edits. Easy cli access.

English

288

1.8K

494.7K

Jianbo Wu@jwu323·6 Haz

@aiandcloud 打算试试

中文

Jintao Zhang 张晋涛@aiandcloud·6 Haz

前几天和朋友聊到在 coding agent 中进行上下文压缩的策略。我们可以把动态规划和经济学模型结合起来，不再简单按阈值触发，配合经济学模型进行精算，只有在收益比为正时再进行压缩。同时由于压缩后会有失真，再引入失真惩罚，缓存失效以及压缩成本等，最大化缓存利用率，尤其是用 DeepSeek API 时

中文

173

15.6K

Jianbo Wu@jwu323·5 Haz

x.com/i/article/2062…

ZXX

405

Jianbo Wu@jwu323·5 Haz

@middlefeng agentic ripgrep is all you need.

English

1.9K

FENG DONG@middlefeng·5 Haz

一直很惊讶为什么 agent 能很快洞悉 code base 一些全局的联系，因为它的上下文显然也读不了太多代码。今天突然想明白了。比如说用一个很 general 的 expression 去 grep 整个 code base，结果可能有几千行。这个结果对人来说就等于没用。但是对 agent 来说就相当于去读整个 code base 的指南。

中文

31.8K

Jianbo Wu@jwu323·4 Haz

@julien_c @badlogicgames Really cool.

English

Julien Chaumond@julien_c·4 Haz

Today I'm launching a new project called SynthTraces 🔥 It is a minimal codebase to generate synthetic coding agent session traces using Pi (from @badlogicgames) I wanted a large number of coding-agent traces, so I built a tiny harness where two models talk to each other: - an open model (served via HF Inference Providers) plays the coding agent. It gets read + bash access to a real open source codebase (the huggingface OSS projects) - a small local model (llama.cpp) plays the human user, asking simple questions like "how do I run this?" or "how is CI set up?" The result is more than 2,000 Pi session traces which can be used to train or fine-tune LLMs, and optimize them for Pi 🤯 And ofc everything is published on @huggingface ✅

English

355

52.9K

Jianbo Wu@jwu323·3 Haz

@hxiao so interesting

English

Han Xiao@hxiao·2 Haz

Sharing a project I've been heavily using - Dataroom. It's a local-first harness that runs deep research with a small language model and gives a zip file at the end. Deep research is becoming an important first step for long-horizon tasks (the 2nd step being implementation), and I believe a small local model in a disciplined harness handles it well - we shouldn't waste frontier-model tokens on it. Dataroom runs on your own GPU at near-zero marginal cost, and it can keep going for hours until the dataroom is genuinely comprehensive, instead of stopping when a metered budget runs out.

English

189

15K

Jianbo Wu@jwu323·3 Haz

Maybe dropping an LLM with trace analysis into the loop could help steer things a bit more intentionally — not just tweaking the model arch and checking the loss, but actually asking why some changes work better than others. source: blog.huikang.dev/2026/05/31/aut…

English

184

Jianbo Wu retweetledi

Kyle Lo@kylelostat·2 Haz

happy to share another quality tech report w/ the wider research community 🫶 great read for ppl who want to see all the details for methods + infra for scaling up pretraining & RL, esp detailed discussion about data which is often kept vague by other labs