Daniel Soudry

5 posts

Daniel Soudry banner
Daniel Soudry

Daniel Soudry

@soudry_daniel

Associate Professor at the Technion, try to understand how AI works, and how to make it more efficient.

Haifa, Israel เข้าร่วม Aralık 2019
4 กำลังติดตาม30 ผู้ติดตาม
Daniel Soudry รีทวีตแล้ว
PapersAnon
PapersAnon@papers_anon·
Scaling FP8 training to trillion-token LLMs From Intel. Trained a 7B model using FP8 precision on 256 Gaudi2 accelerators. Matched BF16 with 34% throughput improvement while using 30% less memory. Introduces Smooth-SwiGLU as solution to outlier amplification. Links below
PapersAnon tweet media
English
4
18
106
6.7K
Daniel Soudry รีทวีตแล้ว
Daniel Soudry รีทวีตแล้ว
Gon Buzaglo
Gon Buzaglo@gon_buzaglo·
Q: You sample random neural networks until you find one with perfect training accuracy. What will be the generalization error? A: Typically good — We prove that when a “simple explanation” exists, such sampled NNs (MLP/CNNs) generalize well! arxiv.org/abs/2402.06323
English
6
37
156
38K