Shoval Messica

15 posts

Shoval Messica

@ShovalMessica

Audio & Speech MSc student at HUJI @cseHUJI

Katılım Mayıs 2024

107 Takip Edilen35 Takipçiler

Sabitlenmiş Tweet

Shoval Messica@ShovalMessica·19 Haz

🚨I’m excited to share our #INTERSPEECH2024 paper "NAST: Noise Aware Speech Tokenization for Speech Language Models” 🥳 W/ @adiyossLC Paper-arxiv.org/abs/2406.11037 Code-github.com/ShovalMessica/…

English

4.1K

Shoval Messica@ShovalMessica·5 Şub

@OmriFahn Amazing!

English

Shoval Messica retweetledi

Omri Fahn@OmriFahn·5 Şub

MFA is a beautiful visualization, but not only that. It’s a practical tool: competitive localization and better steering, while revealing that concepts live in regions, not just single directions. Grateful I got to contribute to this with an awesome team!

Or Shafran@OrShafran

It's time to look past dictionary learning for decomposing LM activations. What happens when we instead leverage local geometry? We find a natural region-based decomposition that yields better steering and localization 🧵 1/

English

Shoval Messica retweetledi

Omri Fahn@OmriFahn·4 Haz

8/ 📄 Read the full paper: arxiv.org/abs/2505.21218 Thanks for reading! #AI #LLM #Uncertainty #Interpretability #Research

English

Shoval Messica retweetledi

Omri Fahn@OmriFahn·4 Haz

🤔Can an LLM "unconscious" feel a lie bubbling up before it speaks? 🤔And what mysteries hide in its linear space of uncertainty? Thrilled to share our new paper: “Pre-trained LLMs Learn Multiple Types of Uncertainty” (@roicohen9•@OmriFahn•@GerardDeMelo) arxiv.org/abs/2505.21218

English

222

Shoval Messica retweetledi

Gallil Maimon@GallilMaimon·12 Eyl

🚨New paper on SLM evaluation🚨 We present SALMon🍣 which is a suite of benchmarks for evaluating how much Speech Language Models model acoustic elements like sentiment or background noise. Project: pages.cs.huji.ac.il/adiyoss-lab/sa… 🧵👇🏻

English

4.3K

Shoval Messica retweetledi

Michael Hassid@MichaelHassid·29 Tem

Which is better, running a 70B model once, or a 7B model 10 times? The answer might be surprising! Presenting our new @COLM_conf paper: "The Larger the Better? Improved LLM Code-Generation via Budget Reallocation" arxiv.org/abs/2404.00725 1/n

English

208

31.4K

Shoval Messica@ShovalMessica·1 Tem

@MohammadSalaama Very interesting!

English

Shoval Messica retweetledi

Mohammad Salama@MohammadSalaama·28 Haz

I am excited to share my first work: "Dataset Size Recovery from LoRA Weights". Ever wondered if you could find out how many samples was a model trained on using just its weights? Well now you can! Project: vision.huji.ac.il/dsire/ 👇

English

2.9K

Shoval Messica retweetledi

Yossi Adi@adiyossLC·21 Haz

Speech tokenizers are fundamental in building speech LMs. In a recent study we show the common tokenization method (a.k.a “semantic tokens”) is not robust to different signal variations. We then present NAST! a noise aware speech tokenizer w. @ShovalMessica Code and models 👇

Shoval Messica@ShovalMessica

English

2.6K

Shoval Messica retweetledi

Guy Yariv@guy_yariv·21 Haz

1/ Commonsense reasoning needs multimodal knowledge, yet current LLMs focus mostly on text, limiting their integration of crucial visual information. We introduce vLMIG, a method that enhances LLMs' visual commonsense by integrating images into the decision-making process

English

10.2K

Shoval Messica retweetledi

Audio and Speech Processing arXiv@AudioAndSpeech·19 Haz

NAST: Noise Aware Speech Tokenization for Speech Language Models. arxiv.org/abs/2406.11037

English

147

Shoval Messica retweetledi

arXiv Sound@ArxivSound·18 Haz

``Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation,'' Or Tal, Alon Ziv, Itai Gat, Felix Kreuk, Yossi Adi, ift.tt/tP4Hua8

English

1.3K

Shoval Messica retweetledi

Or Tal@Or__Tal·18 Haz

Today we released JASCO 🥁🎵🪇! JASCO uses time-aligned conditions (e.g. drums, melody) as guiding controls for text-to-music generation. 📜Full Paper: arxiv.org/abs/2406.10970 🔊Samples: pages.cs.huji.ac.il/adiyoss-lab/JA… @lonziks @itai_gat @FelixKreuk @adiyossLC Stay tuned for updates.

English

2.9K

Shoval Messica retweetledi

arXiv Sound@ArxivSound·18 Haz

``NAST: Noise Aware Speech Tokenization for Speech Language Models,'' Shoval Messica, Yossi Adi, ift.tt/TwrHMcg

English

1.5K

Keşfet

@OmriFahn @roicohen9 @GerardDeMelo @COLM_conf @MohammadSalaama @lonziks @itai_gat @FelixKreuk