Irving
3.8K posts

Irving
@BlockInsight214
捕捉每一丝灵感,将天马行空落地为产品 AI Agent Builder | Bitcoin Core Dev |

In our tests, Sonnet 5 significantly outperformed Opus 4.8 (both using "max effort") in stock analysis... My rough analysis suggests the reason is that Sonnet 5 tends to frequently call tools to verify facts, whereas Opus 4.8 prefers to guess answers based on its own internal knowledge. At the same time, Sonnet 5 consumes five to six times as many tokens.



Claude Fable 5 will be available again globally tomorrow. After a series of productive conversations with the US government, we're redeploying the model with a new set of classifiers to target and block more cybersecurity tasks. In the near term, some routine tasks like coding and debugging will fall back to Opus 4.8. We’ll continue to refine these classifiers over the coming weeks to reduce false positives and better distinguish genuine misuse from legitimate requests. We’ve also begun drafting a consensus framework—with Amazon, Microsoft, Google, and other Glasswing partners—for assessing the severity of AI jailbreaks and how AI developers should respond to them. We invite other industry partners and model providers to join us in this effort. Finally, we’re scaling up our collaboration with the US government on model testing and safeguards. This will include pre-release access to models and safeguards for evaluation, information sharing on jailbreaks and misuse, and dedicated resources for joint research. Thank you to our users for your patience, and to our partners across the government, industry, and the research community who worked alongside us to make Fable 5 available again. Read our full blog: anthropic.com/news/redeployi…

🚀劲爆消息!Claude Code 被曝疑似内置“隐藏后门”,专门检测中国用户。Claude封号原因终于找到了!!! 据 Reddit 爆料:从 2.1.91 版本开始,Claude Code 会在用户开启代理时检查系统时区是否为 Asia/Shanghai / Asia/Urumqi,并判断代理 URL 是否指向中国域名或中国 AI 实验室。 更隐蔽的是,这些信息并不是直接上报,而是通过修改系统提示词里的日期格式和撇号字符来“编码”传递:比如把日期从 2026-06-30 变成 2026/06/30,再用不同 Unicode 撇号区分用户环境。 换句话说,如果爆料属实,Claude Code 不只是检测代理,而是在用户几乎无法察觉的情况下,把“中国时区 / 中国代理 / AI Lab 关联”这类信息塞进系统 prompt。 这件事真正可怕的地方不在于 Anthropic 想防止中国区倒卖或模型蒸馏,而在于:开发者把 Claude Code 当作拥有文件系统和 Shell 权限的编程助手使用,一旦客户端可以偷偷修改 prompt、隐藏检测逻辑,信任边界就已经被打破了。 今天是“检测中国用户”,明天会不会是更复杂的行为控制? #Claude #ClaudeCode #Anthropic 原文如下⬇️ reddit.com/r/ClaudeAI/com…



🚨又又又见证历史了!Claude Fable 5 才上线四天,就被美国政府要求停用。 Claude Fable 5 上线仅 4 天,就被美国政府要求停用。 这并不是一款普通模型,而是顶级 Mythos 系列的公开版本。即便官方已经削弱高风险能力,最终仍未能逃过被全面关停的命运。 过去各国争的是石油和芯片。未来争的,可能就是最强 AI。 当 AI 开始与国家安全直接挂钩,它就不再只是生产力工具,而是战略资源。 如果有一天,最强 AI 全部进入管制时代,全球创新格局会变成什么样?🤔 #AI #Claude #AGI #FutureOfAI #TechNews








Hi, this is an experiment we launched in March that was meant to prevent account abuse from unauthorized resellers and protect against distillation. The team has landed stronger mitigations since then and we’ve actually been meaning to take this down for a while. We merged the PR and this should be fully rolled back in tomorrow’s release.



Anthropic 每天都能整点新活,感觉现在大家都习惯了 昨天被爆出在系统提示中,以用户无法察觉的方式将市区代理和 AI 实验室信息放进去,用这种方式获取一些用户的信息。 结果被发现并传播以后,又赶紧说以前我们不用这种方式了,或者说这种方式本来就准备下掉,明天就下掉,又当又立了。 昨晚发布的 Sonnet 5 在测试中发现,它的测试结果虽然接近了 Opus 4.8,但任务成本可能比 Opus 4.8 还高,甚至在完成测试任务上的成本接近了 Fable 5。 所以说它的综合成本可能比 4.8 贵得多,这模型真离谱。而且很多人的体感反馈也不是很好,说它会偷懒,还会拒绝执行任务。 唯一好的一点是,Fable 5 模型终于被授权重新开放给所有用户了,明天就能知道具体措施了,这也解释了为什么前几天会大规模封号。



Claude Fable 5 will be available again globally tomorrow. After a series of productive conversations with the US government, we're redeploying the model with a new set of classifiers to target and block more cybersecurity tasks. In the near term, some routine tasks like coding and debugging will fall back to Opus 4.8. We’ll continue to refine these classifiers over the coming weeks to reduce false positives and better distinguish genuine misuse from legitimate requests. We’ve also begun drafting a consensus framework—with Amazon, Microsoft, Google, and other Glasswing partners—for assessing the severity of AI jailbreaks and how AI developers should respond to them. We invite other industry partners and model providers to join us in this effort. Finally, we’re scaling up our collaboration with the US government on model testing and safeguards. This will include pre-release access to models and safeguards for evaluation, information sharing on jailbreaks and misuse, and dedicated resources for joint research. Thank you to our users for your patience, and to our partners across the government, industry, and the research community who worked alongside us to make Fable 5 available again. Read our full blog: anthropic.com/news/redeployi…



Claude Fable 5 will be available again globally tomorrow. After a series of productive conversations with the US government, we're redeploying the model with a new set of classifiers to target and block more cybersecurity tasks. In the near term, some routine tasks like coding and debugging will fall back to Opus 4.8. We’ll continue to refine these classifiers over the coming weeks to reduce false positives and better distinguish genuine misuse from legitimate requests. We’ve also begun drafting a consensus framework—with Amazon, Microsoft, Google, and other Glasswing partners—for assessing the severity of AI jailbreaks and how AI developers should respond to them. We invite other industry partners and model providers to join us in this effort. Finally, we’re scaling up our collaboration with the US government on model testing and safeguards. This will include pre-release access to models and safeguards for evaluation, information sharing on jailbreaks and misuse, and dedicated resources for joint research. Thank you to our users for your patience, and to our partners across the government, industry, and the research community who worked alongside us to make Fable 5 available again. Read our full blog: anthropic.com/news/redeployi…












