Soham Deshmukh

116 posts

Soham Deshmukh

@sohamdesh_

language and audio @Sesame Prev: @Microsoft @CarnegieMellon

Redmond, WA Katılım Temmuz 2015

257 Takip Edilen188 Takipçiler

Soham Deshmukh retweetledi

Satvik Dixit@SatvikDixit9·4 Ara

Our paper "Mellow: a small audio language model for reasoning" was accepted to #NeurIPS2025! Catch our poster in Session 3 at 11am today.

Soham Deshmukh@sohamdesh_

we show for the first time ever that sub-billion audio models can reason. we introduce mellow, a small audio-language model (167M) that gets SoTA on different audio reasoning tasks. by using our method and data, you can train an alm within 24 hrs on academic resources (1/n 🧵)

English

719

Soham Deshmukh@sohamdesh_·25 Eki

@HirofumiInaguma @rdesh26 Feel free to DM if you are looking for speech/audio roles

English

150

Soham Deshmukh@sohamdesh_·23 Eki

@MimansaJ Sent a DM

English

1.6K

Mimansa Jaiswal@MimansaJ·23 Eki

I was impacted by Meta layoffs today. As a Research Scientist working on LLM posttraining (reward models, DPO/GRPO) & automated evaluation pipelines, I’ve focused on understanding why/wehere models fail & how to make them better. I’m looking for opportunities; please reach out!

Susan Zhang@suchenzang

👀

English

121

231

3.1K

841K

Soham Deshmukh@sohamdesh_·19 Eki

@animesh_garg @dwarkesh_sp I completely agree with the post. But I think the point here was to troll Lex 😂

English

810

Animesh Garg@animesh_garg·19 Eki

Why is credentialed gatekeeping still a thing? @dwarkesh_sp has gained more technical insight, during his deep preparations for the breadth of speakers he meets, than most people would have working full time! And yet, he almost always admits not being the expert in each of his podcasts. Experience is more valuable than pedigree!

taimur@taimurabdaal

I’m intrigued by this Dwarkesh guy but what are his qualifications? E.g. has he done research at a top school like MIT?

English

432

98.1K

Soham Deshmukh retweetledi

arXiv Sound@ArxivSound·22 Tem

Shikhar Bharadwaj, Samuele Cornell, Kwanghee Choi, Satoru Fukayama, Hye-jin Shim, Soham Deshmukh, Shinji Watanabe, "OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder," arxiv.org/abs/2507.14129

Indonesia

6.5K

Soham Deshmukh retweetledi

Dimitris Papailiopoulos@DimitrisPapail·6 Haz

Some of the most impactful work you can do in academia isn’t cool new algos or novel architectures. It’s data research. Data research isn’t just dumping tokens into a json. It requires a ton of rigorous experimentation, algorithmic thinking, and actually talking to your models.

Ludwig Schmidt@lschmidt3

Very excited to finally release our paper for OpenThoughts! After DataComp and DCLM, this is the third large open dataset my group has been building in collaboration with the DataComp community. This time, the focus is on post-training, specifically reasoning data.

English

179

15.2K

Soham Deshmukh@sohamdesh_·27 Nis

@anishmaybe @AllTrails More importantly how was the hike?

English

Anish@anishmaybe·27 Nis

That moment when you forget to log "finish" in your @AllTrails app. Still, I'll take the ATH

English

129

Soham Deshmukh@sohamdesh_·25 Nis

Being presented currently in Hall 3 - poster number 48 - 10 am to 12:30 pm. Come drop by to discuss robustness evaluation for ASR systems! #ICLR2025

Ahmed Shah@AhmedSh1494

Happy to announce that Speech Robust Bench (SRB) is being presented at #ICLR2025. SRB is a comprehensive multi-lingual robustness benchmark for speech recognition. paper: openreview.net/forum?id=D0LuQ… code: github.com/ahmedshah1494/… data: huggingface.co/datasets/mshah… more in 🧵

English

317

Soham Deshmukh@sohamdesh_·16 Nis

@makemytripcare Absolutely worst customer service! My ticket was confirmed after boarding closed, forcing me to buy another ticket at the counter. After a week of chasing 5 different people, still no updates.

English

Soham Deshmukh@sohamdesh_·6 Nis

Just landed in Hyderabad for ICASSP 2025! Excited to reconnect with familiar faces and meet some new ones. Feel free to reach out or swing by the Microsoft booth for a quick chat #ICASSP2025

English

867

Soham Deshmukh@sohamdesh_·1 Nis

@harshit_sikchi @scottniekum @yayitsamyzhang @marcgbellemare @yukez @PeterStone_TX Congratulations Harshit! 🎉🎊

English

136

Harshit Sikchi@harshit_sikchi·1 Nis

Successfully defended my Ph.D. today 🎓🥳! @scottniekum and @yayitsamyzhang are the best advisors I could have ever asked for. A big thanks to my committee members @marcgbellemare @yukez @PeterStone_TX . The full presentation video will be uploaded soon... Excited about what's to come!

English

197

11.1K

Soham Deshmukh@sohamdesh_·15 Mar

collaboration with @SatvikDixit9 and advised by rita singh and bhiksha raj demo: tinyurl.com/mellowredirect huggingface: huggingface.co/soham97/mellow github: github.com/soham97/mellow paper: arxiv.org/abs/2503.08540

English

218

Soham Deshmukh@sohamdesh_·15 Mar

also, if one scales data (audio), mellow performance improves further, highlighting the effectiveness of our minimal recipe. we release the checkpoint, reasonaqa dataset, and will soon open-source the training code to make research on audio-language models more accessible!

English

218

Soham Deshmukh@sohamdesh_·15 Mar

arXiv Sound@ArxivSound

``Mellow: a small audio language model for reasoning,'' Soham Deshmukh, Satvik Dixit, Rita Singh, Bhiksha Raj, ift.tt/BFgXl2L

English

2.5K

Soham Deshmukh retweetledi

𝚐𝔪𝟾𝚡𝚡𝟾@gm8xx8·12 Mar

Mellow: a small audio language model for reasoning

English

100

8.2K

Soham Deshmukh retweetledi

arXiv Sound@ArxivSound·12 Mar

``Mellow: a small audio language model for reasoning,'' Soham Deshmukh, Satvik Dixit, Rita Singh, Bhiksha Raj, ift.tt/BFgXl2L

Indonesia

3.4K

Soham Deshmukh@sohamdesh_·10 Şub

ADIFF is accepted at ICLR 2025! We study the comparative reasoning ability of audio language models. Subsequently, we introduce audio difference explanation task, release datasets and trained models. To be presented at Singapore in April #ICLR2025

arXiv Sound@ArxivSound

``ADIFF: Explaining audio difference using natural language,'' Soham Deshmukh, Shuo Han, Rita Singh, Bhiksha Raj, ift.tt/TYbkawd

English

1.6K

Keşfet

@HirofumiInaguma @rdesh26 @MimansaJ @animesh_garg @dwarkesh_sp @anishmaybe @AllTrails @makemytripcare