

okcomputer
359 posts





Anthropic pays $750,000+ a year for engineers who can build LLM architectures from scratch. Stanford taught the entire thing in 1 hour lecture & released it for free. Bookmark & watch this today before someone takes it down.




又一次对 Opus 写业务逻辑无比失望。 昨晚上让 Claude Code 做一个功能,需求描述得很清楚,plan mode 讨论了好几轮才开始动手。做了很久,结果出来就有问题。描述了两轮让它修,还是修不好。干脆全部重置,不让它做了。 然后打开 Codex,同样的需求、同样的交互逻辑,一字不差地描述给它。也没讨论,直接告诉它:做完写测试用例,自己验证,要重新 review 一遍,没做完不要停,直到没问题。 今天早上起来一看,功能全部实现了。只有一点点字体偏移的小问题,逻辑没有任何毛病。 说到干活靠谱,还是 Codex 靠谱。写业务代码就应该多用 Codex,少用 Opus,节省生命。Opus 还是留给前期设计和写 UI 比较合适。 但实际用的时候经常忍不住——因为它快,能给即时反馈,写着写着就继续用下去了。这个过程其实挺累的,写的时候时不时冒出 bug,写完之后让 Codex review 还是能查出问题。但同样的东西直接让 Codex 从头写,就没问题。 快和靠谱,有时候真的是两回事。

和@sainingxie 一起挑战7小时播客!他刚和Yann LeCun踏上“世界模型”的创业旅程(AMI Labs)。这是他第一次Podcast、第一次访谈。 2026年2月雪后的一天,我们在纽约布鲁克林,从下午2点,开启了一场始料未及的马拉松式访谈,直到凌晨时分散去。 这篇访谈的中文标题叫做《逃出硅谷》,但他又不厌其烦地枚举了影响他学术生涯的每一个人,并反反复复口头描摹这些人的人物特征(侯晓迪、何恺明、杨立昆、李飞飞…)正是这些,让这篇“逃出硅谷”的对话充斥着人性的温度。 By the way, 下面是访谈的YouTube版本,我们提供了中英字幕。 And yes, 我们是在用播客给这个世界建模😎 A 7-hour podcast with Saining Xie. He has just begun a new journey on world models with Yann LeCun at AMI Labs. This was his first podcast appearance and his first long-form interview. A day after the snowfall in February 2026, in Brooklyn, New York, we started recording at 2 p.m. What followed became an unexpected marathon conversation that lasted until the early hours of the morning. The Chinese title of the interview is “Escaping Silicon Valley.” Yet throughout the conversation, he patiently listed the people who shaped his academic life, repeatedly sketching their personalities in vivid detail: Hou Xiaodi, Kaiming He, Yann LeCun, Fei-Fei Li, and others. These portraits are what give this “escape from Silicon Valley” conversation its human warmth. By the way, the YouTube version of the interview is below, with Chinese and English subtitles. And yes, we are using podcasts to model the world 😎 A 7-hour marathon interview with Saining Xie: World Models, AMI Labs, Ya... youtu.be/rIwgZWzUKm8?si… 来自 @YouTube





GPT-5.3-Codex is here! *Best coding performance (57% SWE-Bench Pro, 76% TerminalBench 2.0, 64% OSWorld). *Mid-task steerability and live updates during tasks. *Faster! Less than half the tokens of 5.2-Codex for same tasks, and >25% faster per token! *Good computer use.
