jack r
40 posts



New Anthropic Fellows Research: a new method for surfacing behavioral differences between AI models. We apply the “diff” principle from software development to compare open-weight AI models and identify features unique to each. Read more: anthropic.com/research/diff-…


订阅了一大堆的AI工具/模型,实践下来每个都有自己最擅长的部分: Grok = 情报员(调研/实时数据/实时研究) Manus = 侦察兵(脑爆/Research/MVP) Claude = 大脑(架构/规划) Codex/GPT = 四肢(主力代码产出) MiniMax = 苦力(脏活累活/测试/Mock) Cursor = 手术刀(精修/调试/Debug) Gemini = 审计师(日志/全库Review) 并行的流水线:在Cursor里调试的同时,Codex异步生成下一批模块,MiniMax批量产出测试+Mock,三线并行超级快乐。



You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only.




Introducing GLM-5V-Turbo: Vision Coding Model - Native Multimodal Coding: Natively understands multimodal inputs including images, videos, design drafts, and document layouts. - Balanced Visual and Programming Capabilities: Achieves leading performance across core benchmarks for multimodal coding, tool use, and GUI Agents. - Deep Adaptation for Claude Code and Claw Scenarios: Works in deep synergy with Agents like Claude Code and OpenClaw. Try it now: chat.z.ai API: docs.z.ai/guides/vlm/glm… Coding Plan trial applications: docs.google.com/forms/d/e/1FAI…














