David

18.4K posts

David

@davidshen84

🐭🦀

Sydney, New South Wales Katılım Haziran 2009

122 Takip Edilen49 Takipçiler

David retweetledi

山谷剛史　アジア中国ITライター＆異国飯@YamayaT·19 Mar

立ち乗り二輪車でキョンシーという発想・・・！

日本語

439

1.5K

90.8K

David retweetledi

刘江/LIU Jiang@turingbook·16 Mar

赞。智谱和MiniMax上市而且获得较高市值后，我相信更多中国的大模型实验室现在更有资源和心气儿搞一些基础研究了。

花叔@AlchainHust

x.com/i/article/2033…

中文

27.6K

David retweetledi

GoGavin@Gitaronnie22·16 Mar

@CattardSlim @TLangley99 x.com/kingkong9888/s…

Eric Yeung 👍🚀🌕@KingKong9888

QME

33K

David retweetledi

Databricks AI Research@DbrxMosaicAI·4 Mar

New research from Databricks AI Research: FlashOptim cuts training memory by over 50% with no measurable loss in model quality. Training a model with AdamW typically requires 16 bytes per parameter just for weights, gradients, and optimizer state. FlashOptim brings that down to 7 bytes, or 5 with gradient release. For Llama-3.1-8B finetuning, peak GPU memory drops from 175 GiB to 113 GiB. Two techniques drive this: improved master weight splitting using tighter ULP-normalized error correction, and companded optimizer state quantization that reduces quantization error and improves convergence. FlashOptim works as a drop-in replacement for SGD, AdamW, and Lion, supports distributed training with DDP and FSDP2, and is open source. Paper: arxiv.org/html/2602.2334… Source code: github.com/databricks/fla…

English

213

25.1K

David retweetledi

The Great Translation Movement 大翻译运动@TGTM_Official·18 Şub

This is a horse.

Henry Gao@henrysgao

Don’t call yourself a China expert, unless you can explain to your friends why a deer is making the rounds in New Year greetings for the Year of the Horse — and why the year’s guardian deity ought to be Lord Shang Yang rather than Malfoy.

English

4.7K

David@davidshen84·11 Şub

"We called him Tortoise because he taught us," 🐢

English

David@davidshen84·11 Şub

The more there is of mine, the less there is of yours. #Duchess

English

David@davidshen84·29 Ara

#NieRAutomata is so good 👍 really should have a sequel

English

David retweetledi

蔡英文 Tsai Ing-wen@iingwen·25 Ara

Merry Christmas! Wishing all those celebrating in Taiwan & around the world a joyful holiday spent with loved ones.

English

980

3.3K

33.3K

529.3K

David retweetledi

西乔 XiQiao@recatm·5 Ara

噶了

日本語

660

124.7K

David@davidshen84·2 Ara

Look how much I learned on Duolingo in 2025! How did you do? #Duolingo365

English

David retweetledi

Dr. Jonathan N. Stea@jonathanstea·22 Kas

RFK Hospital: A groundbreaking new series inspired by the medical advice of RFK Jr.

English

359

1.2K

91.5K

David retweetledi

阿海@funabashi_ahai·19 Kas

P得好有喜感！

中文

132

440

3.3K

350.7K

David@davidshen84·11 Kas

🦆🦆🦆🦆

QME

David retweetledi

蔡英文 Tsai Ing-wen@iingwen·8 Kas

Today, I'm heading to #Germany to take part in the inaugural Berlin Freedom Conference. I look forward to sharing #Taiwan's unwavering commitment to freedom & democracy with friends from Germany & around the world.

English

289

783

7.2K

271K

David retweetledi

Andrew Ng@AndrewYNg·30 Eyl

Announcing a significant upgrade to Agentic Document Extraction! LandingAI's new DPT (Document Pre-trained Transformer) accurately extracts even from complex docs. For example, from large, complex tables, which is important for many finance and healthcare applications. And a new SDK makes using it require only 3 simple lines of code. Please see the video for technical details. I hope this unlocks a lot of value from the "dark data" currently stuck in PDF files, and that you'll build something cool with this!