David

18.4K posts

David

David

@davidshen84

🐭🦀

Sydney, New South Wales Katılım Haziran 2009
122 Takip Edilen49 Takipçiler
David retweetledi
Databricks AI Research
Databricks AI Research@DbrxMosaicAI·
New research from Databricks AI Research: FlashOptim cuts training memory by over 50% with no measurable loss in model quality. Training a model with AdamW typically requires 16 bytes per parameter just for weights, gradients, and optimizer state. FlashOptim brings that down to 7 bytes, or 5 with gradient release. For Llama-3.1-8B finetuning, peak GPU memory drops from 175 GiB to 113 GiB. Two techniques drive this: improved master weight splitting using tighter ULP-normalized error correction, and companded optimizer state quantization that reduces quantization error and improves convergence. FlashOptim works as a drop-in replacement for SGD, AdamW, and Lion, supports distributed training with DDP and FSDP2, and is open source. Paper: arxiv.org/html/2602.2334… Source code: github.com/databricks/fla…
Databricks AI Research tweet media
English
5
29
213
25.1K
David
David@davidshen84·
"We called him Tortoise because he taught us," 🐢
English
0
0
0
2
David
David@davidshen84·
The more there is of mine, the less there is of yours. #Duchess
English
0
0
0
8
David
David@davidshen84·
#NieRAutomata is so good 👍 really should have a sequel
English
0
0
0
6
David retweetledi
蔡英文 Tsai Ing-wen
蔡英文 Tsai Ing-wen@iingwen·
Merry Christmas! Wishing all those celebrating in Taiwan & around the world a joyful holiday spent with loved ones.
蔡英文 Tsai Ing-wen tweet media
English
980
3.3K
33.3K
529.3K
David retweetledi
西乔 XiQiao
西乔 XiQiao@recatm·
噶了
西乔 XiQiao tweet media
日本語
25
24
660
124.7K
David
David@davidshen84·
Look how much I learned on Duolingo in 2025! How did you do? #Duolingo365
David tweet media
English
0
0
0
2
David retweetledi
Dr. Jonathan N. Stea
Dr. Jonathan N. Stea@jonathanstea·
RFK Hospital: A groundbreaking new series inspired by the medical advice of RFK Jr.
English
68
359
1.2K
91.5K
David retweetledi
阿海
阿海@funabashi_ahai·
P得好有喜感!
阿海 tweet media
中文
132
440
3.3K
350.7K
David
David@davidshen84·
🦆🦆🦆🦆
David tweet media
QME
0
0
0
3
David retweetledi
蔡英文 Tsai Ing-wen
蔡英文 Tsai Ing-wen@iingwen·
Today, I'm heading to #Germany to take part in the inaugural Berlin Freedom Conference. I look forward to sharing #Taiwan's unwavering commitment to freedom & democracy with friends from Germany & around the world.
蔡英文 Tsai Ing-wen tweet media
English
289
783
7.2K
271K
David retweetledi
Andrew Ng
Andrew Ng@AndrewYNg·
Announcing a significant upgrade to Agentic Document Extraction! LandingAI's new DPT (Document Pre-trained Transformer) accurately extracts even from complex docs. For example, from large, complex tables, which is important for many finance and healthcare applications. And a new SDK makes using it require only 3 simple lines of code. Please see the video for technical details. I hope this unlocks a lot of value from the "dark data" currently stuck in PDF files, and that you'll build something cool with this!
English
98
560
3.6K
298.2K
David
David@davidshen84·
Finally, made some progress 😃
David tweet media
English
0
0
0
4
David retweetledi
Bear Liu
Bear Liu@bearliu·
Linus对Vibe Coding的定义很哈哈:效率很低,但 娱乐性拉满 😂
Bear Liu tweet media
中文
24
54
466
91.9K