Reza
14 posts

Reza
@Reza_LOD
Never stop fighting
Vancouver, British Columbia Katılım Mayıs 2017
84 Takip Edilen66 Takipçiler

1/4 Have you wondered how to optimize sys-perf for training Arctic-like models (MoE arch)? Let’s dive in! Our first technique: custom fused kernels. By crafting these kernels, we streamline irregular and sparse operators, boosting efficiency. #SnowflakeArctic #SystemOptimization

English

@iliasmiraoui @MSFTDeepSpeed Do you mean the logits after each token-generation step?
English

@MSFTDeepSpeed Can we get log probs from the inference server?
English

Introducing Mixtral, Phi2, Falcon, and Qwen support in #DeepSpeed-FastGen!
- Up to 2.5x faster LLM inference
- Optimized SplitFuse and token sampling
- Exciting new features like RESTful API and more!
For more details: github.com/microsoft/Deep…
#DeepSpeeed #AI

English

More updates o deepspeed inference support. The performance improvement of the MoE model (Mixtral) is quite substantial. Kudos to all the folks at DeepSpeed team :)
DeepSpeed@DeepSpeedAI
Introducing Mixtral, Phi2, Falcon, and Qwen support in #DeepSpeed-FastGen! - Up to 2.5x faster LLM inference - Optimized SplitFuse and token sampling - Exciting new features like RESTful API and more! For more details: github.com/microsoft/Deep… #DeepSpeeed #AI
English

I'm super excited to start working at @contextualai where I will be training LLMs w/ Retrieval to help businesses deploy AI that overcomes hallucination, keeps data up-to-date and runs much faster inference.
If you're new to Contextual.AI, see: contextual.ai/announcing-nex…
Applied ML here I come!
English

@MSFTDeepSpeed Just love working at this team! You work on adding a new module and all the other team members come join you and make it strong in a way that you no longer recognize it as it was originally designed. You can now use DeepSpeed-Chat with all new features and with high efficiency!
English

🚀 Exciting Updates for #DeepSpeedChat! 🤖
- Llama-2 Support: Enjoy 7.1x faster generation with DeepSpeed Hybrid Engine!
- Improved efficiency and accessibility through MixZ++ and ZeRO-Offload.
- Improved stability and software enhancements.
Blog: github.com/microsoft/Deep…

English

Here are some ways to test Llama 2:
replicate.com/a16z-infra/lla…
English

@ClementDelangue Open-source community should be honored to have such transparent, open, and dedicated representative. Thanks @ClementDelangue
English

This is my 5-minute testimony before the US Congress!
Open science and open source AI distribute economic gains by enabling hundreds of thousands of small companies and startups to build with AI. It fosters innovation, and fair competition between all.
Thanks to ethical openness, it creates a safer path for development of artificial intelligence by giving civil society, non-profits, academia, and policy makers the capabilities they need to counterbalance the power of big private companies.
Open science and open source AI prevent blackbox systems, make companies more accountable, and help solving today’s challenges like mitigating biases, reducing misinformation, promoting copyright, & rewarding all stake-holders including artists & content creators in the value creation process.
Let's go!
English



