woner

8.3K posts

woner

@mywoner

AI丨LLM Security丨Privacy Computing丨Representation Learning丨Nyaa~丨中英日zh,en,ja三语丨雅俗和谐

China shenzhen&gz/HongKong Katılım Ağustos 2013

1.9K Takip Edilen1.3K Takipçiler

woner@mywoner·2d

达妮娅是好女孩

中文

288

woner@mywoner·3d

嗯？海盗船用长鑫颗粒了？国内其他牌子都出货一年多了

中文

woner@mywoner·3d

Cloudflare is down again. Well, it's my leisure time for now

English

376

woner@mywoner·4d

洛克王国，卸载

中文

290

woner@mywoner·18 May

发现chatgpt客户端通过app接口去读代码和讨论，将更精细的需求交给codex去执行的结果质量更高，还能避免额度快速浪费..

中文

woner@mywoner·17 May

空调房压制不住蚊子了！

中文

woner@mywoner·16 May

@thsottiaux 🥰

QME

179

Tibo@thsottiaux·16 May

Codex usage limits have now been reset across all paid plans. Enjoy the weekend!

Tibo@thsottiaux

We found and fixed two issues that could explain this degradation of the capability of GPT-5.5 in Codex over the last ~ 48 hours. We are monitoring over the coming hours to fully confirm and I will reset usage limits this evening. Apologies and now is the time for /fast maxxing.

English

1.1K

493

9.4K

816.1K

woner@mywoner·16 May

@ReitsukiSion 再给点时间吧..算法不会差，推理算力得等下半年的950，pre/post-train 的数据更能拉开差距，composer 2 就是一个例子

中文

麗月シオン@ReitsukiSion·16 May

@mywoner 国内现阶段最强和能真的干活的模型（

中文

woner@mywoner·24 Nis

DeepSeek v4是真牛逼，模型强还用国产芯片，中国半导体要迅猛发展了

中文

108

woner@mywoner·16 May

我codex的最新对话session怎么没了

中文

woner@mywoner·15 May

Jensen Huang's eyes looking extra bright with that TikTok filter. 🤣

English

woner@mywoner·15 May

anthropic 当代虚伪的代表，甭管是不是从百度离开带去的情绪。观其言、察其行、知其底，充分说明他就是发自内心的极端的魔怔人

中文

woner retweetledi

Fox News@FoxNews·14 May

BREAKING: President Trump gives a toast to President Xi and invites him to the White House for an official visit in September: "Thank you again, President Xi, for this beautiful welcome... It is my honor to extend an invitation to you and Madam Peng to visit us at the White House, September 24th, and we look forward to it." "I now like to raise a glass and propose a toast to the rich and enduring ties between the American and Chinese people. It's a very special relationship, and I want to thank you again. This has been an amazing period of time. Thank you, President Xi."

English

535

2.7K

16.5K

773.3K

woner@mywoner·14 May

真批准老黄卖芯片了，爽

中文

woner@mywoner·13 May

不求开放，限量卖一点好GPU让股票多涨涨行不行，两边市场兴致都很高昂

中文

woner@mywoner·12 May

kimi 乞丐版(49软)的消耗比chatgpt plus的多啊（70软），相同prompt下k2.6对话仅1次, 而5.5 high能3次，周额度kimi用了5%，codex才3%.. 任务完成度也是后者更好。

中文

146

woner@mywoner·12 May

It's been an hour, and Google Search still has the issue. 'We're sorry but it appears that there has been an internal server error while processing your request. Our engineers have been notified and are working to resolve the issue.'

English

375

woner retweetledi

Tilde@tilderesearch·8 May

Introducing Aurora, a new optimizer for training frontier-scale models. We train Aurora-1.1B, which achieves 100x data efficiency on open-source internet data. Despite having 25% fewer parameters, 2 orders of magnitude fewer training tokens, and using fully open-source internet-only data, Aurora matches Qwen3-1.7B on several benchmarks. Aurora was developed after identifying a major failure mode that can occur under Muon, an increasingly popular optimizer that has shown strong gains over Adam(W). We find that Muon can cause a huge percentage of neurons to effectively die early in training, reducing effective network capacity so that many parameters no longer meaningfully contribute to network outputs. By redistributing update energy more uniformly across neurons while preserving Muon’s stability properties, Aurora prevents neuron death and recovers substantial model capacity. What makes this work especially exciting is that it points toward a broader direction for ML research: better optimizers may not come purely from elegant mathematical abstractions, but from understanding and addressing the concrete dynamics and pathologies that emerge inside real training systems.

Tilde@tilderesearch

x.com/i/article/2052…

English

176

1.6K

518.1K

woner@mywoner·10 May

openai整的business优惠居然让白嫖社区主动付费了

中文

142

woner@mywoner·8 May

今天公布的非农和密歇根消费指数都不错，半导体又迎来新一轮涨势。个人认为训练和推理芯片的持续迭代，注定会让HBM等一系列存储供应紧张，何况未来车载自动驾驶最需要的本地算力还是一片蓝海，还有终端离线模型部署需求。就技术带起的硬件发展不太适合视为泡沫，啥时候算力自由了才算摸到上限