yingmisz
63 posts


测试了一下 DeepSeek V4,完全无法正常调用 Skill。
指令遵循和工具调用的效果很差,不知道是他们发布的原因还是什么问题。
用我那个 PPT Skills 测试,它都没有办法读模板,自己随便实现了一个网页。

歸藏(guizang.ai)@op7418
我去,DeepSeek V4 终于来了! 有两个型号,一个 Flash,一个 Pro。 新版本的功能支持非常全面: 支持 JSON 输出 支持工具调用 支持对话前缀续写 支持 FIM 补全 价格方面: Flash 型号:每百万输入/输出的价格分别是 ¥0.2 和 ¥1 Pro 型号:每百万输入/输出的价格分别是 ¥1 和 ¥12 另外,100 万上下文的价格输出会翻一倍。
中文

@Jason_Young1231 我在codex干活的时候发现了,然后立马去网页端和他对话,确实很舒服。但是万年不变的“不是而是”和“最重要,最直接,最...”这些还在
中文

你说这扯不扯。
GPT 5.5刚发,Claude就立刻修复了降智的bug🤣
三月份开始大家就明显感觉到 Claude 有点降智。
尤其在一些场景里,回答质量、稳定性、连贯性都不太对。
一开始我还以为是自己错觉。
结果今天 OpenAI 刚发 GPT 5.5,Anthropic 就立刻发 blog,承认他们确实有问题,把High模式,设置成了medium,目前已经修了问题。
而且这好像还不是第一次。
有网友翻出他们之前也有过类似操作。上次 OpenAI 刚发新模型没多久,Anthropic 那边也马上出来修 bug,连节奏都像复刻了一遍。
所以这些问题到底是今天才发现的?
还是早就知道,只是一直没修,直到对面发新模型了才突然加速处理?
用户先感受到“变笨”,官方过段时间再出来发 postmortem。
一次是巧合,两次还叫巧合吗?
@levelsio@levelsio
I can't believe we were right Claude was dumbified on March 4, just when we noticed!
中文

My tweet last week about Google's AI adoption drew a lot of pushback, to say the least.
Since then, Googlers from multiple orgs have reached out to me independently and anonymously. They've expressed fear of being doxxed, concern about what they saw as bullying of me, and general corroboration of my original tweet. I haven't verified each person's story, but the picture these Googlers paint is consistent across sources. It is more specific than what I originally wrote, and somewhat bleaker.
What they describe is a two-tier system. DeepMind engineers use Claude as a daily tool. Most of the rest of Google does not. When the question of equalizing access came up internally, the proposed response was to remove Claude for everyone — which DeepMind objected to so strongly that several engineers reportedly threatened to leave.
Non-DeepMind engineers get pushed onto internal Gemini variants behind router-style names that obscure which underlying model is actually serving a request. Multiple engineers describe regressions and reliability problems severe enough that some senior people have stopped using the tools. A senior manager on a major product line reportedly flagged attrition concerns over exactly this issue.
Googlers say leadership knows the gap is real. The response has been to mandate AI usage in OKRs and individual expectations, and to stand up an internal token-usage leaderboard. Unfortunately, managers have been told both that the leaderboard won't be used for performance reviews and, separately, that it absolutely will. And I hear other stories that Google's culture is not adapted properly yet for high-volume coding.
Addy Osmani's reply on behalf of Google said over 40,000 SWEs use agentic coding weekly. I don't doubt the number. But weekly use of a thin tool is precisely the box-checking I described in the original post. Volume of opens isn't adoption — and "weekly" is a low bar that includes a lot of people who tried it once and went back to writing code by hand.
The clearest thing I'm hearing is that Googlers do want to use high-quality agentic tools. They are asking repeatedly for better ones. But overall, this is not a picture of an engineering org that is fine.
My goal in the first tweet, and now, is always the same — get more people using AI and agentic coding. Nobody is as far ahead as they might look from the outside, and none of you are as far behind as you might be worried you are.
To all the Googlers who've reached out: thank you. You took a real risk and I appreciate you. Be safe. And good luck getting good models!
English
yingmisz retweetledi

@Uc0sT5dVoc51418 这种评论表面上像在“调侃”,实则在刻意转移矛盾。江油的问题是校园霸凌、是司法公信力、是暴力执法,而不是“美国持枪”。拿别国的伤口来掩盖自己的溃烂,是一种廉价的逃避。
中文

江油,这个四川的小城,这两天成了现实版的“剧场”。剧情荒诞到让人怀疑,是谁在导演,是谁在编剧。起因不过是一桩校园霸凌事件,一名少女被同龄人施暴,四分钟的视频像一根火柴,点燃了全城的愤怒。可愤怒并没有换来公正,而是换来了黑衣人和警棍。
8月4日,江油街头的画面令人心碎:母亲被按倒,儿子上前护母,却被一群警察拖走;有人高唱国歌,下一秒被按在地上;有人喊“执法要有法”,换来的回应是沉默的拳脚。甚至,还有市民被塞进猪仔车,仿佛在告诉所有人,这里不是公民社会,而是牲口管理。
而另一边,网络上,有人忙着下单“江油肥肠”,一副“天下太平”的姿态。你以为你在搞幽默,实际上是把公共议题缩小成一盘菜,把别人的血换成自己的段子。可笑的是,这种“乐”,最后连肥肠你都买不起。
当地的教育系统,正是滋生这些问题的温床。网友爆料,绵阳乃至江油,本就是“教育产业”的样板,学校像工厂,孩子像螺丝钉,唯独缺少了尊重和规则。特警巡逻,查手机、查身份,安保森严,仿佛敌人就在民众中间。一个地方,连声音都被驱逐,公平自然成了奢侈品。
江油不是孤例,它是镜子,映照出权力如何处理危机:问题不解决,制造恐惧才是第一反应。今天的江油,可能就是明天的你我。当一座城市的安全感靠暴力维持,当舆论空间被一碗肥肠淹没,这个社会,离文明就只剩下冷笑话。
记住,真相不是视频里的惨叫,也不是新闻里的“维稳”,真相是一个正在哭泣的母亲,一个血迹斑斑的公民,一群沉默的人群。而沉默,从来改变不了命运,它只会让刀更快落到你身上。

中文














