Reza

14 posts

Reza

Reza

@Reza_LOD

Never stop fighting

Vancouver, British Columbia Katılım Mayıs 2017
84 Takip Edilen66 Takipçiler
Reza
Reza@Reza_LOD·
4/4 Want to learn more about our optimization techniques? Dive into the Snowflake Arctic Cookbook Series on building an efficient training system for Arctic for in-depth insights.
English
1
0
4
385
Reza
Reza@Reza_LOD·
3/4 Last but not least, communication optimization! Leveraging smart parallelization-topology and overlapping techniques, we minimize communication overhead for the Arctic’s MoE architecture. That means faster training and smoother performance.
Reza tweet media
English
1
0
4
546
Reza
Reza@Reza_LOD·
2/4 And that's not all! Let's delve into selective activation-checkpointing. By strategically reusing parts of the computation graph and quantizing activations, we find the sweet spot between speed and memory usage. It's all about maximizing efficiency!
English
2
0
5
468
Reza
Reza@Reza_LOD·
1/4 Have you wondered how to optimize sys-perf for training Arctic-like models (MoE arch)? Let’s dive in! Our first technique: custom fused kernels. By crafting these kernels, we streamline irregular and sparse operators, boosting efficiency. #SnowflakeArctic #SystemOptimization
Reza tweet media
English
6
9
39
11.6K
DeepSpeed
DeepSpeed@DeepSpeedAI·
Introducing Mixtral, Phi2, Falcon, and Qwen support in #DeepSpeed-FastGen! - Up to 2.5x faster LLM inference - Optimized SplitFuse and token sampling - Exciting new features like RESTful API and more! For more details: github.com/microsoft/Deep… #DeepSpeeed #AI
DeepSpeed tweet media
English
9
87
415
49.5K
Stas Bekman
Stas Bekman@StasBekman·
I'm super excited to start working at @contextualai where I will be training LLMs w/ Retrieval to help businesses deploy AI that overcomes hallucination, keeps data up-to-date and runs much faster inference. If you're new to Contextual.AI, see: contextual.ai/announcing-nex… Applied ML here I come!
English
4
6
122
17.3K
Reza
Reza@Reza_LOD·
@MSFTDeepSpeed Just love working at this team! You work on adding a new module and all the other team members come join you and make it strong in a way that you no longer recognize it as it was originally designed. You can now use DeepSpeed-Chat with all new features and with high efficiency!
English
0
0
5
191
DeepSpeed
DeepSpeed@DeepSpeedAI·
🚀 Exciting Updates for #DeepSpeedChat! 🤖 - Llama-2 Support: Enjoy 7.1x faster generation with DeepSpeed Hybrid Engine! - Improved efficiency and accessibility through MixZ++ and ZeRO-Offload. - Improved stability and software enhancements. Blog: github.com/microsoft/Deep…
DeepSpeed tweet media
English
2
18
95
5.8K
Reza
Reza@Reza_LOD·
@teknium Does it remember the previous context? It seems it is not storing cache!
English
0
0
0
91
Reza
Reza@Reza_LOD·
@tri_dao Correcting myself, I am actually seeing 14% e2e performance speed! Thanks a lot @tri_dao for this amazing work.
English
0
0
2
133
Reza
Reza@Reza_LOD·
@tri_dao sorry, I label the experiments wrongly, the first one is with flash-attn 2.0 and the second one with the previous version.
English
1
0
0
151
Tri Dao
Tri Dao@tri_dao·
Announcing FlashAttention-2! We released FlashAttention a year ago, making attn 2-4 faster and is now widely used in most LLM libraries. Recently I’ve been working on the next version: 2x faster than v1, 5-9x vs standard attn, reaching 225 TFLOPs/s training speed on A100. 1/
Tri Dao tweet mediaTri Dao tweet media
English
38
647
3.3K
903K
clem 🤗
clem 🤗@ClementDelangue·
This is my 5-minute testimony before the US Congress! Open science and open source AI distribute economic gains by enabling hundreds of thousands of small companies and startups to build with AI. It fosters innovation, and fair competition between all. Thanks to ethical openness, it creates a safer path for development of artificial intelligence by giving civil society, non-profits, academia, and policy makers the capabilities they need to counterbalance the power of big private companies. Open science and open source AI prevent blackbox systems, make companies more accountable, and help solving today’s challenges like mitigating biases, reducing misinformation, promoting copyright, & rewarding all stake-holders including artists & content creators in the value creation process. Let's go!
English
80
362
2K
616.3K