Daniel Soudry

5 posts

Daniel Soudry banner
Daniel Soudry

Daniel Soudry

@soudry_daniel

Associate Professor at the Technion, try to understand how AI works, and how to make it more efficient.

Haifa, Israel شامل ہوئے Aralık 2019
4 فالونگ30 فالوورز
Daniel Soudry ری ٹویٹ کیا
PapersAnon
PapersAnon@papers_anon·
Scaling FP8 training to trillion-token LLMs From Intel. Trained a 7B model using FP8 precision on 256 Gaudi2 accelerators. Matched BF16 with 34% throughput improvement while using 30% less memory. Introduces Smooth-SwiGLU as solution to outlier amplification. Links below
PapersAnon tweet media
English
4
18
106
6.7K
Daniel Soudry ری ٹویٹ کیا
Daniel Soudry ری ٹویٹ کیا
Gon Buzaglo
Gon Buzaglo@gon_buzaglo·
Q: You sample random neural networks until you find one with perfect training accuracy. What will be the generalization error? A: Typically good — We prove that when a “simple explanation” exists, such sampled NNs (MLP/CNNs) generalize well! arxiv.org/abs/2402.06323
English
6
37
156
38K