Sam Ade Jacobs

194 posts

Sam Ade Jacobs

@samadejacobs

PhD Comp. Science (Texas A&M University), R&D expertise and experience in advanced large-scale big data (graph) analytics, machine (deep) learning, and robotics

San Francisco, CA Katılım Eylül 2010

119 Takip Edilen92 Takipçiler

Sabitlenmiş Tweet

Sam Ade Jacobs@samadejacobs·10 Kas

#SC20 starts today! It is exciting to have our work on AI/HPC-enabled drug design for CoVID19 in the prestigious Gordon Bell Special Prize Finalist. Congratulations to our team, “sleepless” night in a chaotic Summer not in vain!

English

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·6 Ara

🚀Introducing Ulysses-Offload🚀 - Unlock the power of long context LLM training and finetuning with our latest system optimizations - Train LLaMA3-8B on 2M tokens context using 4xA100-80GB - Achieve over 55% MFU Blog: shorturl.at/Spx6Y Tutorial: shorturl.at/bAWu5

English

5.8K

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·21 Ağu

Great to see the amazing DeepSpeed optimizations from @Guanhua_Wang_, Heyang Qin, @toh_tana, @QuentinAnthon15, and @samadejacobs presented by @ammar_awan at MUG '24.

MVAPICH@mvapich

Dr. Ammar Ahmad Awan from Microsoft DeepSpeed giving a presentation at MUG '24 over Trillion-parameter LLMs and optimization with MVAPICH. @OSUengineering @Microsoft @OhTechCo @mvapich @MSFTDeepSpeed @MSFTDeepSpeedJP #MUG24 #MPI #AI #LLM #DeepSpeed

English

2.4K

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·19 Ağu

Announcing that DeepSpeed now runs natively on Windows. This exciting combination unlocks DeepSpeed optimizations to Windows users and empowers more people and organizations with AI innovations. - HF Inference & Finetuning - LoRA - CPU Offload Blog: shorturl.at/a7TF8

English

4.3K

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·2 Tem

Introducing Universal Checkpointing for boosting training efficiency. - Change parallelism (PP, SP, TP, ZeRO-DP) or GPU count mid-stream - Improve resilience by scaling down to healthy nodes💪 - Increase throughput by scaling up to elastic nodes🚀 Blog: rb.gy/aup3pn

English

4.3K

Sam Ade Jacobs retweetledi

Jeff Dean@JeffDean·21 Şub

A nice example of the kind of capabilities unlocked by the long context feature in the Gemini 1.5 Pro model.

Sully@SullyOmarr

Gemini 1.5 pro is STILL under hyped I uploaded an entire codebase directly from github, AND all of the issues (@vercel ai sdk,) Not only was it able to understand the entire codebase, it identified the most urgent issue, and IMPLEMENTED a fix. This changes everything

English

437

98.8K

Sam Ade Jacobs@samadejacobs·16 Şub

@sama @_tim_brooks @billpeeb @model_mechanic Sora video of what Lagos would look like in 2056 is my favorite… incredibly awesome! Cc: @AjuriNgelale , @bosuntijani

English

Sam Altman@sama·15 Şub

here is sora, our video generation model: openai.com/sora today we are starting red-teaming and offering access to a limited number of creators. @_tim_brooks @billpeeb @model_mechanic are really incredible; amazing work by them and the team. remarkable moment.

English

1.6K

24.5K

6.2M

Sam Ade Jacobs retweetledi

Stas Bekman@StasBekman·26 Oca

If you were holding off to try @MSFTDeepSpeed ZeRO++ it looks like deepspeed@master should work well now: #event-11602278791" target="_blank" rel="nofollow noopener">github.com/microsoft/Deep… ZeRO++'s main feature is allowing you to use a hybrid approach if you can fit a model on a single node of 8 gpus. So it takes benefit of the super fast NVLink within the node and only needs to reduce grads across nodes over the slow link. So if in your workflow the slow inter-node network was impacting your tflops, enabling ZeRO++ should give you a sizeable boost. The number would very depend on your situation but in my experiments I saw 5%+ boost with a 7b llama. This is similar to Hybrid FSDP. To try see: deepspeed.ai/tutorials/zero… I was talking about the hybrid solution - I'm yet to try the quantized weights/grads also offered by ZeRO++ which should speed up things even further as there will be even less stress on the network with those. Just remember until the next release is made you want deepspeed@master

English

7.9K

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·20 Oca

Introducing Mixtral, Phi2, Falcon, and Qwen support in #DeepSpeed-FastGen! - Up to 2.5x faster LLM inference - Optimized SplitFuse and token sampling - Exciting new features like RESTful API and more! For more details: github.com/microsoft/Deep… #DeepSpeeed #AI

English

416

49.5K

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·17 Oca

🚀 Excited to announce our paper "ZeRO++: Extremely Efficient Collective Communication for Large Model Training" has been accepted at #ICLR2024! 🔍 ZeRO++ significantly reduces communication volume by 4x, achieving up to 3.3x speedup. microsoft.com/en-us/research… #DeepSpeed #AI

English

5.7K

Sam Ade Jacobs retweetledi

OpenAI@OpenAI·6 Kas

We're rolling out new features and improvements that developers have been asking for: 1. Our new model GPT-4 Turbo supports 128K context and has fresher knowledge than GPT-4. Its input and output tokens are respectively 3× and 2× less expensive than GPT-4. It’s available now to all developers in preview. 2. Assistants API and new tools (Retrieval, Code Interpreter) will help developers build world-class AI assistants within their own apps. 3. The platform is becoming multimodal. GPT-4 Turbo with Vision, DALL·E 3, and text-to-speech are all now available to developers. Oh… and we’re doubling GPT-4 rate limits. openai.com/blog/new-model…

English

894

2.7K

14.5K

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·4 Kas

Introducing DeepSpeed-FastGen 🚀 Serve LLMs and generative AI models with - 2.3x higher throughput - 2x lower average latency - 4x lower tail latency w. Dynamic SplitFuse batching Auto TP, load balancing w. perfect linear scaling, plus easy-to-use API github.com/microsoft/Deep…

English

115

548

112.8K

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·4 Eki

🚀Introducing #DeepSpeed-VisualChat! 🖼📜 - Multi-image, multi-round #dialogues - Novel #MultiModal causal attention - Enriched training data via improved blending techniques - Unmatched #scalability (>70B params) Blog: github.com/microsoft/Deep… Paper: arxiv.org/abs/2309.14327

English

137

18.5K

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·12 Eyl

🚀Exciting new updates on #DeepSpeed ZeRO-Inference with 20X faster generation! - 4x lesser memory usage through 4-bit weight quantization with no code change needed. - 4x larger batch sizes through KV cache offloading. Available in DeepSpeed v0.10.3: aka.ms/z3-inference

English

167

18.2K

Sam Ade Jacobs retweetledi

Eric Horvitz@erichorvitz·12 Eyl

We have much to learn about LLMs. Compact 1.3 billion parameter phi-1.5 model exhibits surprising capabilities. @MSFTResearch

Sebastien Bubeck@SebastienBubeck

How far does one billion parameters take you? As it turns out, pretty far!!! Today we're releasing phi-1.5, a 1.3B parameter LLM exhibiting emergent behaviors surprisingly close to much larger LLMs. For warm-up, see an example completion w. comparison to Falcon 7B & Llama2-7B

English

Sam Ade Jacobs@samadejacobs·10 Eyl

@jteevan Congratulations @jteevan and thank you for your leadership.

English

Jaime Teevan@jteevan·7 Eyl

Sometimes it takes an external push to really recognize you're in the middle of something big. Just seeing how many people I know and respect are on the first-ever #TIME100 AI list makes me feel like I'm a part of history. time.com/collection/tim…

English

6.1K

Sam Ade Jacobs retweetledi

DeepSpeed@DeepSpeedAI·24 Ağu

Want to train 1 million token context lengths (all 7 of the Harry Potter books!📚) on a GPT-like model w. 64 GPUs? Announcing DeepSpeed-Ulysses🚀 This release enables highly efficient and scalable LLM training with extremely long sequence lengths🤯 github.com/microsoft/Deep…

English

142

15.7K

Sam Ade Jacobs retweetledi

OpenAI@OpenAI·31 May

We trained an AI using process supervision — rewarding the thought process rather than the outcome — to achieve new state-of-art in mathematical reasoning. Encouraging sign for alignment of advanced AIs: …openai.com/research/impro…

English

410

802

4.5K

1.8M

Sam Ade Jacobs@samadejacobs·3 May

@Livermore_Lab @bkspears9 Sorry I missed this, Nigerian presidential election polluted my Twitter feed! Good job @bkspears9 and the NIF team, hope you win!

English

Lawrence Livermore National Laboratory@Livermore_Lab·20 Nis

We need your support 🤝 We’re honored to have been nominated for two People’s @TellyAwards for videos related to LLNL’s #FusionIgnition breakthrough. Cast your vote for our videos by Friday, April 21! Video 1: peoples.tellyawards.com/PublicVoting#/… Video 2: peoples.tellyawards.com/PublicVoting#/…

Lawrence Livermore National Laboratory tweet media

English

3.1K

Sam Ade Jacobs@samadejacobs·13 Ara

@bkspears9 This was headlined on BBC, it is for real, pretty awesome! I willl tune in live tomorrow. Congratulations @lasers_llnl

English

Keşfet

@Guanhua_Wang_ @toh_tana @QuentinAnthon15 @ammar_awan @sama @_tim_brooks @billpeeb @model_mechanic