vowm
33 posts


看了Bloomberg 采访 Dario 的视频,这应该是 Dario 第一次回应在百度的那段经历,主持人也很懂梗,还专门问了一下当时的工作是否塑造了他今天对中国的看法🤣
主持人问他:早年在百度的硅谷分部工作过,这段经历有没有塑造他对中国的看法。回答是 "Not really, no."
Dario 说自己在那儿只待了一年,主要学到的是语音识别那一类东西。这是他还没去 OpenAI、更没有创立 Anthropic 的时候一份普通的工作。
但他还主动提到了「唯一让我在意的一点」:「当年百度那么多语音识别数据是怎么来的」:"We don't care about privacy in China. So we have all this speech recognition data."(百度在中国不在乎隐私,所以才有这么多语音识别数据。)
youtube.com/watch?v=x2VHFg…

YouTube
中文

@Stealth40k The difficulty level is optional, and you can even skip it altogether - that's a great feature. Some people just want to breeze through the main story, while others want to be immersed in the game world 24/7
English

Naoki Hamaguchi says regardless of feedback, he refused to reduce the number of minigames in Final Fantasy VII Revelation:
"The request to reduce the number of minigames is a difficult one. I have a very clear policy that I don’t want to change, no matter how much feedback we receive. After all, the original FFVII had an enormous number of minigames. This remake series exists because of the original game, and respecting the original is one of our core policies.
More than in previous installments, we’ve made it so that you don’t have to engage with minigames if you don’t want to. For example, when it comes to minigames tied to battle progression, players who wanted to focus on combat ended up feeling forced to play them. To address that, minigame rewards are now largely unrelated to combat progression. Instead, they offer things like character skins and customization options, so that only those who want to play them are encouraged to do so. Meanwhile, battle progression and the like are tied more closely to combat-oriented side content. Overall, I think the game is much more comfortable to play than past entries.
That said, I imagine there will always be players who dislike a particular minigame but still want its rewards. To help with that, we’ve added difficulty settings specifically for minigames. So now you can lower the difficulty to “Easy” if you wish, and if you try a minigame and immediately decide it’s a lost cause, you can also skip it right away. We’ve made a number of improvements aimed at minimizing frustration.
In that sense, we’ve done everything we reasonably can. We won’t reduce the number of minigames, but we will continue making every effort to ensure as many players as possible can embrace them."
automaton-media.com/en/interviews/…


English

Naoki Hamaguchi says FF7 Revelations will have about the same if not more content than FF7 Rebirth!
He says you won't be able to experience everything in one playthrough, so its hard to say which has more. But Revelations will have a very large volume of content.
automaton-media.com/articles/inter…


English
vowm retweetledi

We’ve finally announced FINAL FANTASY VII REVELATION. I’m very happy we were able to showcase the long-awaited Highwind flight.
We’re committed to delivering the ultimate experience in Spring 2027.
-
ついに『FINAL FANTASY VII REVELATION』を発表しました。長く待ち望まれていたハイウインドでのフライトもお見せでき、とても嬉しく思います。
2027年春、最高の体験をお届けします。
#FF7R
#REVELATION


日本語

My babies are finally home! 🩷
I'm so happy!
They're absolutely stunning! ✨
#cloti #FF7Rebirth




English

几个客观性说明:
1)这一条和 MiniMax 没有任何关系(我从来不接商单)
2)「体感」不等于真实水平,不是量化数据
用了更多的一些体验,整体上 coding 能力对比 m2.7 是质的提升,目前发现的缺点是 1-shot 的结果相比较 Opus 4.6/4.7/gpt5.5 没有那么全面,会出现考虑不是特别全的情况。
但是加另外一个 Agent 作为 gate 和 mentor(例如 Opus 4.7),可以获得一个非常好的结果,M3 写代码,Opus 4.7 做 code review 并给出具体的修改意见,可以做到一个很好的「效果 + token 使用」的平衡。
为了保证客观性,正在做一个更加量化的数据评测:用 Multica 这个项目中真实的 Issue 来做 benchmark。
整个流程均基于 Multica 的 Squads 功能,会同步加上其他主流 OSS 模型作为评测的对象。

Jiayuan (JY) Zhang@jiayuan_jy
已经测试一个早上了,目前体感上接近 Opus 4.7(还需要进一步测试)。 用 M3 来写代码,Opus 4.8 + GPT5.5 来做对抗式的 code review,效果还不错。 已经完成了 1 个 PR github.com/multica-ai/mul…
中文

ChatGPT 的 Translate 功能做的不像是一个前沿 AI 团队的作品,像 10 年前的互联网产品经理设计的水平,ChatGPT 团队会被 Codex 团队合并不是没有理由的。
chatgpt.com/translate
Tibor Blaho@btibor91
OpenAI is working on a new "Translation Block" widget in ChatGPT Fun fact - one of the supported languages is "High Valyrian", a fictional language created for Game of Thrones
中文

阿里云 Model Studio CLI(百炼 CLI) 官方 AI 平台命令行工具来了,支持与 Qwen 文本对话、多模态理解(图片/音频/视频)、图片生成与编辑、视频生成与编辑、语音合成与识别、知识库检索、网页搜索等能力。
github.com/modelstudioai/…

中文

vowm retweetledi

Behind the MiMo API Price Reduction:
The deepest price cut, up to 99%, is for Input (Cache Hit). The core reason is our inference framework now supports hierarchical KV cache optimization for SWA. Production inference engine tests show this optimization increases cached token capacity by 5x, equivalent to an 80% reduction in caching costs. Combined with Cache Read Overlap among multiple Full Attention modules in the Hybrid model, actual costs are further reduced.
Prices for Input (Cache Miss) and Output are also reduced by 60%-80%. This mainly benefits from the extreme 1:7 Full:SWA sparsity ratio brought by the model architecture (the prefill compute of the 70-layer MiMo-V2.5-Pro roughly equals a 10-layer GQA model). This kept our original inference costs well below the industry average, naturally leaving a 2x-3x profit margin in pricing. This price adjustment simply reflects our decision to pass these structural cost efficiencies directly to developers.
Operating at these newly reduced API prices, our production inference engine is running at near full capacity, and we can still essentially break even. We previously advised LLM companies not to "blindly cut prices" precisely because very few model architectures and inference optimizations can keep API costs from running at a loss. If more architectures that save compute and KV cache emerge, along with better inference Infra to drive down API costs, this will form an excellent virtuous cycle in the industry.
More crucially, affordable, high-performance model APIs will drive real, sustained, and at-scale inference demand. This upstream demand pulls forward the development of the entire AI infrastructure chain—including chips, servers, optical transceivers, PCBs, liquid cooling, power, energy storage, and data centers—serving as a strategic fulcrum for a systemic revaluation of AI hardware. In the long run, this injects more affordable and accessible compute into both training and inference pipelines, accelerating the parallel evolution of global AGI across multiple regions and technical routes.
For more technical details, we will release a detailed Blog post later.
English

Codex in the last 3 days has been a nightmare. Hallucination after hallucination, writing so much unwanted code, creating things that are not at all intended, taking too much time and burning tokens unnecessarily. Especially
/goal.
@sama something is wrong. It burned 50% of my Pro plan tokens for the week in 1 night creating a PR of size 104k lines and when asked about the PR : it said it’s useless and cleared the whole PR and closed it.
English

@SMOKINGCHAIN97 Amazing work, bro. Any chance you’re planning to make a wallpaper with Tifa and Aerith together next?
English



















