Daniel Soudry

5 posts

Daniel Soudry banner
Daniel Soudry

Daniel Soudry

@soudry_daniel

Associate Professor at the Technion, try to understand how AI works, and how to make it more efficient.

Haifa, Israel Tham gia Aralık 2019
4 Đang theo dõi30 Người theo dõi
Daniel Soudry đã retweet
PapersAnon
PapersAnon@papers_anon·
Scaling FP8 training to trillion-token LLMs From Intel. Trained a 7B model using FP8 precision on 256 Gaudi2 accelerators. Matched BF16 with 34% throughput improvement while using 30% less memory. Introduces Smooth-SwiGLU as solution to outlier amplification. Links below
PapersAnon tweet media
English
4
18
106
6.7K
Daniel Soudry đã retweet
Daniel Soudry đã retweet
Gon Buzaglo
Gon Buzaglo@gon_buzaglo·
Q: You sample random neural networks until you find one with perfect training accuracy. What will be the generalization error? A: Typically good — We prove that when a “simple explanation” exists, such sampled NNs (MLP/CNNs) generalize well! arxiv.org/abs/2402.06323
English
6
37
156
38K