Aseem Srivastava

272 posts

Aseem Srivastava banner
Aseem Srivastava

Aseem Srivastava

@as3eem

NLP Postdoc at @mbzuai | Prev: PhD @IIITDelhi / @lcs2lab @flamenlp

Abu Dhabi, UAE Katılım Ağustos 2017
972 Takip Edilen270 Takipçiler
Aseem Srivastava retweetledi
MBZUAI
MBZUAI@mbzuai·
What does loneliness look like in the age of AI? And can AI respond to it responsibly? A team from MBZUAI received a Google Academic Research Award (GARA) to explore one of the most urgent psychosocial challenges of our time: loneliness in digital spaces. The project, A Psychosocial Loneliness Framework for Safer AI Companionship, is led by Professor Thamar Solorio, Professor Monojit Choudhury, and postdoctoral researcher Aseem Srivastava in collaboration with Professor Munmun De Choudhury's team from Georgia Institute of Technology. Together, they will examine how loneliness is expressed online, how conversational agents can detect it, and what healthier, more responsible AI companionship could look like. Read more here: mbzuai.ac.ae/news/mbzuai-te… Watch more here: youtube.com/watch?v=_SOWTd…
YouTube video
YouTube
English
2
4
8
963
Aseem Srivastava retweetledi
thamar |
thamar |@thamar_solorio·
Welcome back lunch for RiTUAL lab: a new semester started and we have some new faces and some members completing their appointment with us. I'm thankful for the contributions and connections that the researchers in my group bring. I'm still hiring, visiting students, postdocs, short term research visits. Get in touch. See our research topics here: ritual-mbzuai.github.io/web/
thamar | tweet media
English
1
4
21
1.1K
hallerite
hallerite@hallerite·
I am thinking of starting a reading group / weekly space for LLM-RL. Would anyone be interested in that?
English
171
9
643
51.2K
Aseem Srivastava retweetledi
Tanmoy Chakraborty
Tanmoy Chakraborty@Tanmoy_Chak·
Our #ICML25 Paper "𝐄𝐧𝐨𝐮𝐠𝐡 𝐨𝐟 𝐒𝐜𝐚𝐥𝐢𝐧𝐠 𝐋𝐋𝐌𝐬! 𝐋𝐞𝐭'𝐬 𝐅𝐨𝐜𝐮𝐬 𝐨𝐧 𝐃𝐨𝐰𝐧𝐬𝐜𝐚𝐥𝐢𝐧𝐠" challenges the obsession with scaling laws -- it's time to #downscale LLMs for efficiency, sustainability & real-world usability. Remember: SLMs are the future! Kudos to my students: Ayan Sengupta, Yash Goel @icmlconf @lcs2lab @iitdelhi
English
0
1
23
701
Aseem Srivastava retweetledi
#CVPR2026
#CVPR2026@CVPR·
#CVPR2025 Area Chairs (ACs) identified a number of highly irresponsible reviewers, those who either abandoned the review process entirely or submitted egregiously low-quality reviews, including some generated by large language models (LLMs). 1/2
English
15
55
578
111.1K
Aseem Srivastava retweetledi
#CVPR2026
#CVPR2026@CVPR·
Following a thorough investigation, the Program Chairs (PCs) decided to desk-reject 19 papers authored by confirmed highly irresponsible reviewers, which would have been accepted otherwise, in accordance with the previously communicated CVPR 2025 policies. 2/2
English
11
37
444
81.7K
Aseem Srivastava retweetledi
FLaMe Research Lab
FLaMe Research Lab@flamenlp·
🎉 Excited to share that our work has been accepted to #Findings of NAACL 2025! 📜 Title: Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation Generation 👥 Authors: Palaash Goel, Dushyant Singh Chauhan, Md Shad Akhtar #NAACL2025 #NLP #Multimodal #AIResearch
English
0
2
8
166
Aseem Srivastava retweetledi
Tanmoy Chakraborty
Tanmoy Chakraborty@Tanmoy_Chak·
Kicking off the year with a bang -- 4 papers accepted in prestigious venues this month! #ICLR2025 -- 𝐋𝐋𝐌 𝐜𝐨𝐦𝐩𝐫𝐞𝐬𝐬𝐢𝐨𝐧: We introduce 𝐏𝐫𝐮𝐧𝐞𝐍𝐞𝐭, a novel, dataset-free policy learning approach to model pruning, achieving high compression efficiency and performance retention, demonstrated by compressing LLaMA-2-7B with over 80% zero-shot accuracy retention at a 30% compression ratio. @iclr_conf URL: shorturl.at/HEO7O #𝐍𝐀𝐀𝐂𝐋2025 -- 𝐈𝐧𝐯𝐞𝐬𝐭𝐢𝐠𝐚𝐭𝐢𝐧𝐠 𝐦𝐮𝐥𝐭𝐢𝐥𝐢𝐧𝐠𝐮𝐚𝐥 𝐥𝐨𝐧𝐠-𝐜𝐨𝐧𝐭𝐞𝐱𝐭 𝐛𝐞𝐡𝐚𝐯𝐢𝐨𝐫 𝐢𝐧 𝐋𝐋𝐌𝐬: We introduce 𝐌𝐋𝐍𝐞𝐞𝐝𝐥𝐞, the first systematic evaluation of multilingual long-context retrieval in LLMs, revealing significant performance variations across languages and context positions, with insights to guide future evaluations. @naaclmeeting Preprint: lnkd.in/gtRAXjmh 𝐍𝐀𝐀𝐂𝐋'25 -- 𝐂𝐨𝐮𝐧𝐭𝐞𝐫𝐬𝐩𝐞𝐞𝐜𝐡 𝐞𝐯𝐚𝐥𝐮𝐚𝐭𝐢𝐨𝐧 𝐛𝐞𝐧𝐜𝐡𝐦𝐚𝐫𝐤 𝐚𝐧𝐝 𝐦𝐞𝐭𝐫𝐢𝐜𝐬: We introduce 𝐂𝐒𝐄𝐯𝐚𝐥, a dataset for evaluating counterspeech across four dimensions and a prompt-based framework using auto-calibrated CoT, offering better alignment with human judgment than traditional metrics. @naaclmeeting 𝐍𝐚𝐭𝐮𝐫𝐞 𝐌𝐚𝐜𝐡𝐢𝐧𝐞 𝐈𝐧𝐭𝐞𝐥𝐥𝐢𝐠𝐞𝐧𝐜𝐞: In collaboration with AIIMS (All India Institute of Medical Sciences, New Delhi), NIMHANS, Bangalore and other NGOs, we wrote how GenAI can potentially empower multisectoral suicide prevention efforts, particularly in resource-constrained settings like India. @NatMachIntell
Tanmoy Chakraborty tweet media
English
1
2
26
1.5K
Aseem Srivastava retweetledi
FLaMe Research Lab
FLaMe Research Lab@flamenlp·
#WWW2025 | Our work has been accepted for oral presentation at TheWebConf 2025 Title: Figurative-cum-Commonsense Knowledge Infusion for Multimodal Mental Health Meme Classification Stay tuned for preprint, data, and code on our lab's webpage: @flamenlp 1/n
English
1
2
6
157
Aseem Srivastava retweetledi
Tanmoy Chakraborty
Tanmoy Chakraborty@Tanmoy_Chak·
🌟 𝐀 𝐍𝐞𝐰 T𝐞𝐱𝐭𝐛𝐨𝐨𝐤 -- 𝐈𝐧𝐭𝐫𝐨𝐝𝐮𝐜𝐭𝐢𝐨𝐧 𝐭𝐨 𝐋𝐚𝐫𝐠𝐞 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬 🌟 I am excited to share the release of my new textbook, 𝘐𝘯𝘵𝘳𝘰𝘥𝘶𝘤𝘵𝘪𝘰𝘯 𝘵𝘰 𝘓𝘢𝘳𝘨𝘦 𝘓𝘢𝘯𝘨𝘶𝘢𝘨𝘦 𝘔𝘰𝘥𝘦𝘭𝘴 (#LLMs) -- Perhaps the first textbook on LLMs. Target Audience: 👉 Students/beginners, Looking for a structured starting point to learn LLMs 👉 Teachers, planning to offer a course on LLMs 👉 Industry professional, seeking to deepen their understanding of LLMs Explore the Book: 🔗 Book Website: tanmoychak.com/llmbook/ 📑 Table of Contents: tanmoychak.com/llmbook/toc.pdf 🛒 Available on Amazon: amazon.in/dp/936386474X/ Enhance Your Learning Experience: 👉 Slides & Lecture Videos: Chapter-wise resources -- lcs2-iitd.github.io/ELL881-AIL821-… 👉 Exercises & Solutions: Practice with detailed chapter exercises (solutions available on request). 👉 Upcoming @nptel_official Course: Starting January 2025! Preview here: onlinecourses.nptel.ac.in/noc25_cs45/pre… Book Endorsement: 📖 Foreword by Prof. Tim Baldwin @eltimster 👏 Endorsements from Prof. Iryna Gurevych @IGurevych and Prof. Pushpak Bhattacharyya #LLMs #Textbook @iitdelhi @WileyIndiaPL @lcs2lab
Tanmoy Chakraborty tweet media
English
0
22
76
9.2K
Aseem Srivastava retweetledi
Shital Shah
Shital Shah@sytelus·
Are you ready for an early Christmas present from our team at Microsoft Research? Introducing the most powerful smol model ever built in the world! Welcome to Phi-4! 👇
Shital Shah tweet media
English
37
130
1.6K
215.7K
Aseem Srivastava
Aseem Srivastava@as3eem·
I will be in Miami this week for EMNLP '24! I’ll be presenting our latest work in my PhD and would like to discuss career opportunities (industry/academia). Let’s connect, exchange ideas, and make the most of this #EMNLP.
English
0
1
17
1.2K
Aseem Srivastava retweetledi
Graham Neubig
Graham Neubig@gneubig·
The AI community has been lacking models that are: - multilingual 🗣️ - multimodal 🖼️ - multicultural 🌎🌍🌏 Our new paper introduces - Pangea, a model - PangeaInstruct, a dataset - PangeaBench, an eval benchmark towards this goal! neulab.github.io/Pangea/
Graham Neubig tweet media
Xiang Yue@xiangyue96

🌍 I’ve always had a dream of making AI accessible to everyone, regardless of location or language. However, current open MLLMs often respond in English, even to non-English queries! 🚀 Introducing Pangea: A Fully Open Multilingual Multimodal LLM supporting 39 languages! 🌐✨ neulab.github.io/Pangea/ arxiv.org/pdf/2410.16153 The Pangea family includes three major components: 🔥 Pangea-7B: A state-of-the-art multilingual multimodal LLM capable of 39 languages! Not only does it excel in multilingual scenarios, but it also matches or surpasses English-centric models like Llama 3.2, Molmo, and LlavaOneVision in English performance. 📝 PangeaIns: A 6M multilingual multimodal instruction tuning dataset across 39 languages. 🗂️ With 40% English instructions and 60% multilingual instructions, it spans various domains, including 1M culturally-relevant images sourced from LAION-Multi. 🎨 🏆 PangeaBench: A comprehensive evaluation benchmark featuring 14 datasets in 47 languages. Evaluation can be tricky, so we carefully curated existing benchmarks and introduced two new datasets: xChatBench (human-annotated wild queries with fine-grained evaluation criteria) and xMMMU (a meticulously machine-translated version of MMMU). 🙌 This is a joint leading effort with @yueqi_song. Also kudos to the amazing team @AkariAsai, @seungonekim, @Jeande_d, @simi_97k, @anjali_ruban, @lintangsutawika, @Sathya8NR, @gneubig for their hard work! Check out more results and insights we conclude from our training in the thread below. 👇

English
2
28
178
11.7K
Aseem Srivastava retweetledi
Mian Zhang
Mian Zhang@_Guuuuuuuu_·
📢 Excited to introduce CBT-Bench (huggingface.co/papers/2410.13…), a new benchmark that systematically evaluates LLMs’ capabilities in Cognitive Behavioral Therapy (CBT) across three Levels:
English
1
7
17
12.7K
Rose
Rose@rose_e_wang·
Thanks so much @arankomatsuzaki for promoting this work. I remember the magical feeling I had using GitHub CoPilot -- I wished all people in all different domains could experience that in live interactions!!! Fast forward, voila, Tutor CoPilot :)
Aran Komatsuzaki@arankomatsuzaki

Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise Presents the first large-scale intervention of a Human-AI Approach that has statistically significant positive learning gains w/ 900 tutors & 1,800+ K12 students arxiv.org/abs/2410.03017

English
2
3
23
2.8K
Declan Grabb, MD
Declan Grabb, MD@declangrabbmd·
Excited to be at @COLM_conf to present our work, “Risks from Language Models for Automated Mental Healthcare” (arxiv.org/abs/2406.11852) with @MLamparth and @NinaVasan. As a practicing physician & forensic psychiatrist, I’d love to chat about human-centered AI, health AI, and alignment! Please reach out!
English
3
8
40
4.3K