Shuyao Tim Xu

64 posts

Shuyao Tim Xu banner
Shuyao Tim Xu

Shuyao Tim Xu

@TimXu222575

@Kimi_Moonshot

Katılım Temmuz 2023
311 Takip Edilen25 Takipçiler
antirez
antirez@antirez·
Oh, and if you want to smile: Cursor valuation is like 10-20x the one of Moonshot AI.
English
13
2
296
25.1K
Shuyao Tim Xu
Shuyao Tim Xu@TimXu222575·
@_TobiasLee 这个确实只是后训练的一个小方向,不过国内御三家都各有一批人在做
中文
1
0
0
2.2K
Lei Li
Lei Li@_TobiasLee·
2026 了,可以忽略一切拿 one-shot 前端测评 frontier LLM 博主了...
中文
4
2
91
88.5K
外汇交易员
外汇交易员@fxtrader·
美团王兴:AI Agent对我的冲击比ChatGPT更大。
外汇交易员 tweet media
中文
39
59
415
111K
Shuyao Tim Xu
Shuyao Tim Xu@TimXu222575·
best perk of working at an LLM shop: your agent system is blocked on infra, so you get to go home like a normal human being for once
English
0
0
0
59
向阳乔木
向阳乔木@vista8·
比较靠谱的猜测,hunter是蚂蚁的Ling-2.6-1T。 healer是小米的模型。
向阳乔木@vista8

求证:OpenRouter新上的两个隐身模型是DeepSeek V4吗? 发现自 @geekbb 一个叫Healer Alpha(治疗者Alpha) 具有视觉、听觉、推理和行动能力的前沿全模态模型。 原生感知视觉和音频输入、跨模态推理以及精确可靠地执行复杂的多步骤任务。 一个叫Hunter Alpha(狩猎者Alpha) Hunter Alpha是为Agent使用构建的1万亿参数+1M Token模型。 擅长长期规划、复杂推理和多步任务执行。 具有OpenClaw等框架所需的可靠性和instruction-following精度。 感觉这个AI团队估计是喜欢MMORPG游戏的...

中文
5
1
19
12.5K
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
If Hunter-Alpha is V4 and it really has 1T tokens, that's very sad (unless it was trained on Ascends, which would suggest better models not constrained to 2-4K H800s to come soon. But still sad on the V4 side) If it's something like Kimi-DSA/Xiaomi/GLM, that's better.
English
24
1
126
12K
Shuyao Tim Xu
Shuyao Tim Xu@TimXu222575·
@scaling01 kimi agent (the feature on web) has it alalready. One advantage is that you create a full website from scratch with ai generated figures. However i agree this is more for cool demo
English
0
0
0
24
Lisan al Gaib
Lisan al Gaib@scaling01·
I don't know who needs this but Codex CLI will support image gen natively in the next big update
English
6
2
99
6.2K
Shuyao Tim Xu
Shuyao Tim Xu@TimXu222575·
@scaling01 gpt oss was actually a great model. It has the proprietary that chinese model lacks
English
0
0
1
286
Lisan al Gaib
Lisan al Gaib@scaling01·
For some reason I dreamt that OpenAI is going to release two open-source models today but I guess GPT-5.4 will also do
English
18
1
225
14.8K
Shuyao Tim Xu
Shuyao Tim Xu@TimXu222575·
what the heck? hr was scheduling my interview with GOAT @JustinLin610 just last friday🤯🤯🤯
English
0
0
1
93
Shuyao Tim Xu
Shuyao Tim Xu@TimXu222575·
@Teknium Including DeepSeek is purely political here. It feels like DeepSeek is just using Claude for comparison, maybe comparing their judge with Claude's.
English
0
0
6
246
Teknium (e/λ)
Teknium (e/λ)@Teknium·
FYI 150k rubric judgements is just 5000 sample RL training run at groupsize 32. This is like, literally nothing. Fear mongering. DeepSeek gets such a bad rap because Anthropic and our labs are complete shitbags
Teknium (e/λ) tweet media
English
3
12
167
8.5K
Shuyao Tim Xu
Shuyao Tim Xu@TimXu222575·
@pmddomingos Did you notice that the DeepSeek's requests is only at the scale of 150,000?? Including DeepSeek in the report seems purely political. It is nowhere "distilling"
English
0
0
16
793
Anthropic
Anthropic@AnthropicAI·
These attacks are growing in intensity and sophistication. Addressing them will require rapid, coordinated action among industry players, policymakers, and the broader AI community. Read more: anthropic.com/news/detecting…
English
366
367
7K
2.3M
Anthropic
Anthropic@AnthropicAI·
We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.
English
7.3K
6.3K
55.1K
33.6M
Shuyao Tim Xu
Shuyao Tim Xu@TimXu222575·
@Ryan76589177 @caolei1 1. 豆包线上请求最多的肯定是flash模型,而不是pro模型 2. 单卡A100,不做任何多卡优化,跑gpt-oss 20b可以上万tps
中文
1
0
2
180
曹山石
曹山石@caolei1·
一张A100要1.5万美元
曹山石 tweet media
中文
22
14
110
151.6K
Shuyao Tim Xu
Shuyao Tim Xu@TimXu222575·
@jayair K2.5 thinks very efficiently and is much faster then recent opus 4.6 or codex 5.3. It reminds me of the old sonnet (speed wise). K2.5 is a compelling daily driver.
English
0
0
0
91