Arnav Chavan

15 posts

Arnav Chavan

Arnav Chavan

@ArnavChavan6

Senior Applied Scientist @ Amazon Lab126 | 1x Founder | Efficient & Sustainable AI | Prev. - Google Deepmind, Microsoft Research

Sunnyvale, CA Katılım Şubat 2022
298 Takip Edilen99 Takipçiler
Aditya Tomar
Aditya Tomar@adityastomar_·
Excited to begin my summer research internship at @nvidia today. I’ll be working in the Applied Deep Learning Research team in the Santa Clara HQ office. Let me know if you are around and would like to meet!
Aditya Tomar tweet media
English
38
4
459
14.6K
Arnav Chavan retweetledi
Steve Laskaridis
Steve Laskaridis@stevelaskaridis·
📣 We're extending the AdaptFM @ ICML'26 paper submission deadline to May 8! This will align better with the ICML acceptance announcements and NeurIPS deadlines. 👉 Submit your work at: openreview.net/group?id=ICML.… 🌐 CfP: adaptfm.gitlab.io
Steve Laskaridis tweet media
English
0
3
3
443
Arnav Chavan retweetledi
Steve Laskaridis
Steve Laskaridis@stevelaskaridis·
📢 Announcing the AdaptFM Workshop @icmlconf As foundation models grow in scale and ubiquity, the ability to adapt inference dynamically to the task and available resources becomes critical. Submission deadline: May 1, 2026 (AoE) 📍Seoul, South Korea 🌐adaptfm.gitlab.io
Steve Laskaridis tweet media
English
2
3
5
1K
Arnav Chavan retweetledi
Nyun AI
Nyun AI@Nyun_AI_·
🔍 From tedious to streamlined! Nyun Zero's 'Nyun Kompress' module transforms model optimization with automated compression techniques that maintain performance. 🚀📊 #AIModel #TechNews #Efficiency Learn more at nyunai.com
English
0
1
1
198
Arnav Chavan retweetledi
Nyun AI
Nyun AI@Nyun_AI_·
Our Team at @Nyun_AI_ recently compiled an experimental survey with a thorough codebase - Faster and Lighter LLMs: A Survey on Current Challenges and Way Forward. We provide a comprehensive overview of multiple LLM compression methodologies along with system-level approaches.
Nyun AI tweet media
English
1
1
4
185
Arnav Chavan retweetledi
Nyun AI
Nyun AI@Nyun_AI_·
Say hello to Nyun Zero 💡- where AI meets efficiency! Reduce inference costs, speed up training, and secure your data like never before. 🛡️ Join the revolution in AI productivity. #EfficientAI #DeepLearning 🚀 Sign up here ➡️ [forms.office.com/r/NxYwkmGypG]
English
0
3
6
646
Arnav Chavan retweetledi
AK
AK@_akhaliq·
Rethinking Compression: Reduced Order Modelling of Latent Features in Large Language Models paper page: huggingface.co/papers/2312.07… Due to the substantial scale of Large Language Models (LLMs), the direct application of conventional compression methodologies proves impractical. The computational demands associated with even minimal gradient updates present challenges, particularly on consumer-grade hardware. This paper introduces an innovative approach for the parametric and practical compression of LLMs based on reduced order modelling, which entails low-rank decomposition within the feature space and re-parameterization in the weight space. Notably, this compression technique operates in a layer-wise manner, obviating the need for a GPU device and enabling the compression of billion-scale models within stringent constraints of both memory and time. Our method represents a significant advancement in model compression by leveraging matrix decomposition, demonstrating superior efficacy compared to the prevailing state-of-the-art structured pruning method.
AK tweet media
English
0
6
24
9K
Chenlin Meng
Chenlin Meng@chenlin_meng·
We will be presenting our @CVPR Award Candidate paper On Distillation of Guided Diffusion Models tomorrow Wed 21 Jun 4:30-6 p.m. at West Building Exhibit Halls ABC 186! Super honored that our paper is selected as one of the 12 award candidates! 🙏 #CVPR2023
Chenlin Meng tweet media
English
7
22
140
47.8K
Arnav Chavan retweetledi
AK
AK@_akhaliq·
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning paper page: huggingface.co/papers/2306.07… present Generalized LoRA (GLoRA), an advanced approach for universal parameter-efficient fine-tuning tasks. Enhancing Low-Rank Adaptation (LoRA), GLoRA employs a generalized prompt module to optimize pre-trained model weights and adjust intermediate activations, providing more flexibility and capability across diverse tasks and datasets. Moreover, GLoRA facilitates efficient parameter adaptation by employing a scalable, modular, layer-wise structure search that learns individual adapter of each layer. Originating from a unified mathematical formulation, GLoRA exhibits strong transfer learning, few-shot learning and domain generalization abilities, as it adjusts to new tasks through additional dimensions on weights and activations. Comprehensive experiments demonstrate that GLoRA outperforms all previous methods in natural, specialized, and structured benchmarks, achieving superior accuracy with fewer parameters and computations on various datasets. Furthermore, our structural re-parameterization design ensures that GLoRA incurs no extra inference cost, rendering it a practical solution for resource-limited applications.
AK tweet media
English
3
106
510
97.7K
JQUAVE
JQUAVE@jquave·
If your working on any of these, DM me: 1. In memory vector database 2. Generalized LoRA adapters dev environment+marketplace. (npm for LoRA adapters) 3. Marketplace for ongoing rlhf tasks 4. LLM schema fine tuning for consistent outputs in, for example, YAML
English
2
1
3
370