Abhinav Bhatele

502 posts

Abhinav Bhatele banner
Abhinav Bhatele

Abhinav Bhatele

@bhatele

Prof @UMDCS and @umiacs, Lead Parallel Software & Systems Group @hpc_group. @IITKanpur & @UofIllinois Alum. Ex-@Livermore_Lab. Views are my own.

Katılım Haziran 2009
275 Takip Edilen681 Takipçiler
Abhinav Bhatele retweetledi
Parallel Software and Systems Group
Congratulations to Dr. Joy Kitson for successfully defending her PhD dissertation last Friday (April 10)! That makes her the fifth PhD student to graduate from our group. Dissertation title: "Scaling Agent-based Epidemic Diffusion on HPC Clusters"
Parallel Software and Systems Group tweet media
English
0
1
4
124
Abhinav Bhatele
Abhinav Bhatele@bhatele·
If you are interested in collaborating with us or you are an undergrad planning to apply for a PhD, come and talk to us (don't feel shy!). You'll find me or PSSG students at the events below. #SC25
Parallel Software and Systems Group@hpc_group

We have arrived in St. Louis for @Supercomputing 2025. Learn more about our group's research through the various talks, panels and tutorials below. We will also be at the @UofMaryland booth (3123) at various times. #SC25 #UMDCS #HPC #AI #MLSys #AI4Science

English
0
1
6
1.3K
Abhinav Bhatele
Abhinav Bhatele@bhatele·
A large number of PhD students in my group have graduated or will be graduating by Spring, so I am recruiting several PhD students for the next admission cycle (Fall 2026). If you want to work with us, apply by Dec 5 and drop me a short email. Please repost/share widely. #HPC #AI
Abhinav Bhatele tweet media
English
9
64
215
38.2K
Abhinav Bhatele retweetledi
Lightning AI ⚡️
Lightning AI ⚡️@LightningAI·
Fine-tuning massive LLMs requires reliability at scale. The Lightning open source stack gives you just that. Check out Democratizing Al: Open-source Scalable LLM Training on GPU-based Supercomputers. ✅ 405B param MoE language model ✅ Fully finetuned on 32,768 GPUs on a massive 1.4 exaflop/s cluster ✅ All running on LitGPT Led by @siddharth_3773 and @bhatele at @UofMaryland. Read the paper ➡️ go.lightning.ai/44EjLFu Use LitGPT ➡️ github.com/Lightning-AI/l…
Lightning AI ⚡️ tweet media
English
4
10
72
5K
Abhinav Bhatele
Abhinav Bhatele@bhatele·
Congrats @siddharth_3773! Siddharth is going to join NVIDIA after he graduates to continue working on parallel training and inference. Follow him to keep track of what interesting stuff he’ll do next 😊
Parallel Software and Systems Group@hpc_group

We are on a roll, second successful dissertation defense in a week (March 28)! Congratulations to @siddharth_3773 on becoming the second PhD graduate from PSSG!! Dissertation title: "Optimizing Communication in Parallel Deep Learning on Exascale-class Machines" #HPC #AI #HPC4AI

English
0
1
15
847
Abhinav Bhatele retweetledi
Satoshi Matsuoka
Satoshi Matsuoka@ProfMatsuoka·
Riken, including our center R-CCS, is hiring researchers big time, from PIs to postdocs to interns. We have the best support for international researches; in fact the % of non-JP researchers at R-CCS is approaching 50 % now. Come work with us! riken.jp/en/careers/ope…
English
0
16
30
12.6K
Abhinav Bhatele
Abhinav Bhatele@bhatele·
Incredibly proud moment for me as an advisor. The first cohort of students in @hpc_group started in 2020 and they are now starting to graduate. Also a bittersweet moment as we bid adieu to some senior students this Spring & welcome new ones in the Fall. Congrats @DanielNichols10!
Parallel Software and Systems Group@hpc_group

We present our very first newly minted Dr. Daniel Nichols (@DanielNichols10), who successfully defended his dissertation today. Congrats & best wishes for a bright future ahead!! Dissertation Title: "On Learning Behaviors of Parallel Code and Systems Across Modalities" #HPC #AI

English
0
2
22
1.2K
Abhinav Bhatele retweetledi
Soheil Feizi
Soheil Feizi@FeiziSoheil·
Come see our #NeurIPS2024 poster today on" Loki: Low-rank Keys for Efficient Sparse Attention" Paper: arxiv.org/abs/2406.02542 Code: github.com/hpcgroup/loki Loki is a sparse attention method that reduces the computational and memory costs of LLM inference! By exploiting the low-dimensional space of key vectors in self-attention, Loki achieves faster attention computation without compromising model performance.
Parallel Software and Systems Group@hpc_group

Can we get away w/ reducing attention keys to a lower-dimensional space to optimize compute during inference? @prajwal1210 & @siddharth_3773 investigated using PCA on key vectors & found that the rank of attention keys is much lower than the full dimensionality. #NeurIPS2024

English
1
9
44
6K
Abhinav Bhatele
Abhinav Bhatele@bhatele·
@prajwal1210 will be presenting this work today (Dec 13) @NeurIPSConf in the poster session from 11 am-2 pm in East Exhibit Hall A-C (Poster# 2000). Come and say hi! #NeurIPS2024 #HPC #AI
Parallel Software and Systems Group@hpc_group

Can we get away w/ reducing attention keys to a lower-dimensional space to optimize compute during inference? @prajwal1210 & @siddharth_3773 investigated using PCA on key vectors & found that the rank of attention keys is much lower than the full dimensionality. #NeurIPS2024

English
0
1
11
976
Abhinav Bhatele retweetledi
Sean McLeish
Sean McLeish@SeanMcleish·
Why is addition hard for next token predictors? Come hear about our fix for this at #NeurIPS! We’re presenting Abacus Embeddings 🧮 tomorrow (Friday) from 11-2 in the East Exhibit hall at poster 2907. Drop by to see how we can improve your language model.
Sean McLeish@SeanMcleish

Introducing 🧮Abacus Embeddings, a simple tweak to positional embeddings that enables LLMs to do addition, multiplication, sorting, and more. Our Abacus Embeddings trained only on 20-digit addition generalise near perfectly to 100+ digits. 1/n

English
0
12
24
3.7K
Abhinav Bhatele retweetledi
OLCF
OLCF@OLCFGOV·
🏆 Researchers from the @UofMaryland received the HPC Innovation Excellence Award from @Hyperion_HPC for their work on AxoNN—a scalable framework that’s pushing the limits of Large Language Model (LLM) training! 🚀 💫 bit.ly/3CsmjKM 🏆 bit.ly/3V9HFTw
OLCF tweet media
English
0
4
10
572
Abhinav Bhatele retweetledi
UMD Research
UMD Research@UMDResearch·
A team of parallel computing and machine learning experts from @UMDscience and @umiacs has been named a finalist in a global competition recognizing outstanding contributions in high-performance computing. The team—led by Abhinav Bhatele and Tom Goldstein—is one of six vying for top honors in this year’s ACM, Association for Computing Machinery’s Gordon Bell Prize competition. umiacs.umd.edu/about-us/news/…
UMD Research tweet media
English
0
1
20
951