DataLearner
1.2K posts

DataLearner
@DataLearnerAI
关注数据科学 关注科技行业 关注人工智能 关注一切促进人类生活美好的新技术 业界主流大模型列表:https://t.co/H4FUDd7Gfb 国产开源大模型生态现状:https://t.co/q5KU9WhPuE

Anthropic说Opus 4.8的默认思考模式是high,效果和成本的最佳均衡。从OS-World-Verified看,比Opus 4.7的Max模式少一点tokens,比Opus 4.7 xhigh高一点tokens就能有更好的效果。但是实际体验上不知道是不是5h限额缩小的原因,做了2从信息搜索任务并生成md和接口请求,单次都是消耗5小时限额的10%!




有些不明白为什么小参数模型要搞各种 benchmark,感觉小参数模型主要是验证用,还有就是搞创新,还有就是给没卡的研究生发论文用。

Codex 交互做的真的挺好的,你可以方便的看当前运行的 SubAgents,以及每个 SubAgent 在做的事、用的提示词

Some of you noticed limits drained faster in Codex, we root caused it to an optimization that we rolled back that had an impact on cache hit rates when compacting across long running sessions. We fixed this and have now reset usage limits for all accounts. Enjoy the weekend.

A little secret. About 5% of our production traffic is on the Pi harness, about another 5% is on OpenCode. Reminder you can use your ChatGPT account in a flourishing set of other tools. We’ll continue to make Codex awesome, but you have options.


谷歌 Antigravity 负责人(原 Windsurf 创始人)Varun Mohan 宣布,即日起再次将所有付费订阅计划的每周 Gemini 模型调用额度上限提升 3 倍。加上前一日 3 倍的额度调整,目前的基准配额已累计达到最初版本的 9 倍。同时,官方已将所有付费用户的当周用量清零重置,以期为开发者提供更充足的算力余量。 然而,这一「加量」声明备受吐槽。有开发者在评论区指出,Antigravity 此前曾经历过一次严重的「配额缩水」(rug-pull),当时的调用限制严苛到哪怕只是偶尔使用侧边栏对话的轻度用户都会迅速触发限制,导致产品陷入完全不可用的「窒息状态」。官方此举本质上只是在修复之前极度不合理的严苛限制,如今却将其包装成慷慨的「免费福利」来进行营销。

GLM-5.1-highspeed is coming, 400 tokens per second. Very expensive, but bring a new possibility.

anti-gravity CLI 里面竟然有 opus-4.6, 那估计大多数人首选用 opus 了。


Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

Qwen 3.7 Max Preview and Plus are live on the Qwen website

Why don’t LLM’s just tell you when you are asking a question / doing something that is out of distribution?

The model is the product








