
Shuyao Tim Xu
64 posts



Are you up for a challenge? openai.com/parameter-golf


The results are not only poor, differences between models are also huge: Opus-4.6 completely fails, and Gemini-3.1 performs only half as good as GPT-5.4. The capability does not seem stressed equally among all frontier models, even though it is essential for math use cases.

求证:OpenRouter新上的两个隐身模型是DeepSeek V4吗? 发现自 @geekbb 一个叫Healer Alpha(治疗者Alpha) 具有视觉、听觉、推理和行动能力的前沿全模态模型。 原生感知视觉和音频输入、跨模态推理以及精确可靠地执行复杂的多步骤任务。 一个叫Hunter Alpha(狩猎者Alpha) Hunter Alpha是为Agent使用构建的1万亿参数+1M Token模型。 擅长长期规划、复杂推理和多步任务执行。 具有OpenClaw等框架所需的可靠性和instruction-following精度。 感觉这个AI团队估计是喜欢MMORPG游戏的...







We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.


We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.













