Julius Richter (@JuliusRichter13) - Twitter Profili

Julius Richter@JuliusRichter13·21 Ara

Truly a pleasure to be involved in building such a versatile sound event detection model. Huge shout-out to my great collaborators: @apoorv2904 @mhnt1580 @BowenShi20 @bigpon517 👏

English

0

52

Julius Richter@JuliusRichter13·21 Ara

Fun example of PE-A-Frame confidently predicting “a man speaking” and “a man coughing”, but being much less confident about “a man predicting the future” 😄 (Though @SchmidhuberAI may have been right back then) 📄 Paper: tinyurl.com/pe-av-paper 💻 Code: tinyurl.com/pe-av-code

English

1

0

2

72

Julius Richter@JuliusRichter13·28 Kas

@ymas0315 Haha nice! And was thinking about a DJ mixer. Maybe the auto-translation messed up 😆

English

1

0

65

まっすー@ymas0315·28 Kas

@JuliusRichter13 Vitamix for smoothies😅

English

1

0

70

まっすー@ymas0315·28 Kas

ブラックフライデーに論文通ったので調子に乗ってお高めのミキサーを買ってしまった

日本語

1

0

9

729

Julius Richter@JuliusRichter13·1 Kas

🙏 Many thanks to my supervisor Timo Gerkmann for his invaluable guidance, to the reviewers Shinji Watanabe and Simon Leglaive for their insightful feedback, and to commission members Sören Laue and Jianwei Zhang. Grateful to all collaborators who made this journey so rewarding!

English

0

116

Julius Richter@JuliusRichter13·1 Kas

🎓 I’m thrilled to share that I successfully defended my PhD on generative speech enhancement at the University of Hamburg! My work explored diffusion-based and audio-visual generative models for robust speech enhancement and audio restoration. 👉 tinyurl.com/thesis-richter

English

3

0

8

300

Julius Richter@JuliusRichter13·9 Eki

@adriiiarizav2 Code

English

0

54

Adrian Ariza@adriiiarizav2·9 Eki

Nicee! I have a lots of code invite for Sora 2, FOLLOW ME and comment here “CODE” and I will send by X chat 😊🤘🏽

English

40

11

12

1.4K

Julius Richter@JuliusRichter13·29 Eyl

@ArxivSound Reference [6] is not my work.

English

0

1

79

arXiv Sound@ArxivSound·29 Eyl

Naisong Zhou, Saisamarth Rajesh Phaye, Milos Cernak, Tijana Stojkovic, Andy Pearce, Andrea Cavallaro, Andy Harper, "Shortcut Flow Matching for Speech Enhancement: Step-Invariant flows via single stage training," arxiv.org/abs/2509.21522

English

1

7

677

Julius Richter@JuliusRichter13·21 Ağu

@forthshinji I am curious about the result for generative SE …

English

0

2

93

Shinnosuke Takamichi / 高道慎之介@forthshinji·19 Ağu

草 (音声強調を繰り返しかけると無になる)

Shinnosuke Takamichi / 高道慎之介 tweet media

日本語

2

8

60

5.2K

Julius Richter@JuliusRichter13·19 Tem

@serrjoa Absolutely. I also have speech enhancement/restoration in mind. Replacing corrupted areas in the spectrogram using CLAP similarity on speech.

English

1

0

1

112

Joan Serrà@serrjoa·19 Tem

Let's do this for audio, no?

Kwang Moo Yi@kwangmoo_yi

Preprint of today: Beyer et al., "Highly Compressed Tokenizer Can Generate Without Training" -- github.com/lukaslaobeyer/… The latent space of tokenizers already provides a good enough abstraction to work with -- you don't have to use a diffusion model on top to inpaint, etc!

English

3

1

15

1.3K

Julius Richter@JuliusRichter13·16 Tem

@wuyusongwys Amazing! Thanks for answering the questions my brother asked you 😊 I couldn’t make it to ICML this time

English

0

78

Yusong Wu@wuyusongwys·15 Tem

We are presenting our poster soon at West Exhibition Hall B2-B3 W-502, at 3:30-4:30pm! Check it out online: icml.cc/virtual/2025/p…

Yusong Wu@wuyusongwys

It’s been a thrilling journey building FLAM! 🚀 Super proud of what we achieved open‑vocabulary audio event detection using calibrated frame‑wise modeling. FLAM will be presented at ICML 2025, come check it out! 📄 Paper: arxiv.org/abs/2505.05393 🎧 Demo: flam-model.github.io

English

2

0

14

692

Julius Richter@JuliusRichter13·3 Tem

@BenUFO @notyumiko1000 What's the one at 1:55:00? Pure fire 🔥

English

0

22

Ben UFO@BenUFO·7 May

it's up! on.soundcloud.com/E5kWDqX3RVKxGi… @notyumiko1000

English

1

2

23

6.6K

Julius Richter@JuliusRichter13·26 Haz

@wuyusongwys Great contribution!

English

0

45

Yusong Wu@wuyusongwys·24 Haz

It’s been a thrilling journey building FLAM! 🚀 Super proud of what we achieved open‑vocabulary audio event detection using calibrated frame‑wise modeling. FLAM will be presented at ICML 2025, come check it out! 📄 Paper: arxiv.org/abs/2505.05393 🎧 Demo: flam-model.github.io

Justin Salamon@justin_salamon

I think we finally cracked it? FLAM can detect *any* sound via text prompts arXiv (ICML'25): arxiv.org/abs/2505.05335… demos: flam-model.github.io @AdobeResearch+@MIT+@Mila_Quebec led by @wuyusongwys w/@tsirigoc @Kotentorothy @huangcza @AaronCourville @urinieto @pseetharaman

English

4

10

67

5.5K

Julius Richter@JuliusRichter13·25 Haz

@justin_salamon @AdobeResearch @MIT @Mila_Quebec @wuyusongwys @tsirigoc @Kotentorothy @huangcza @AaronCourville @urinieto @pseetharaman Great work! Will you release the code/checkpoints?

English

0

1

237

Justin Salamon@justin_salamon·24 Haz

I think we finally cracked it? FLAM can detect *any* sound via text prompts arXiv (ICML'25): arxiv.org/abs/2505.05335… demos: flam-model.github.io @AdobeResearch+@MIT+@Mila_Quebec led by @wuyusongwys w/@tsirigoc @Kotentorothy @huangcza @AaronCourville @urinieto @pseetharaman

English

6

37

267

26.1K

Julius Richter@JuliusRichter13·10 Haz

@ArxivSound Great work!

English

0

51

arXiv Sound@ArxivSound·10 Haz

``FLAM: Frame-Wise Language-Audio Modeling,'' Yusong Wu, Christos Tsirigotis, Ke Chen, Cheng-Zhi Anna Huang, Aaron Courville, Oriol Nieto, Prem Seetharaman, Justin Salamon, ift.tt/vfyP8jR

Filipino

1

0

12

841

Julius Richter@JuliusRichter13·10 May

@wataru9871 👏

QME

0

1

313

took@wataru9871·9 May

ｴｯﾎｴｯﾎｴｯﾎｴｯﾎ残響を保持した音声復元ができるって伝えなきゃｴｯﾎｴｯﾎｴｯﾎｴｯﾎ残響の制御もできるって伝えなきゃｴｯﾎみんなに伝えなきゃ paper: arxiv.org/abs/2505.05077 demo: google.github.io/df-conformer/r…

GIF

日本語

2

42

206

28.9K

Julius Richter retweetledi

arXiv Sound@ArxivSound·9 May

``Normalize Everything: A Preconditioned Magnitude-Preserving Architecture for Diffusion-Based Speech Enhancement,'' Julius Richter, Danilo de Oliveira, Timo Gerkmann, ift.tt/0WZoJm7

English

0

1

9

814

Julius Richter@JuliusRichter13·23 Nis

Check out the slides here: shorturl.at/TUK5G. Please note that the PDF is 36 MB in size due to its audio and video content, and it is best viewed using Acrobat Reader.

English

0

2

0

540

Julius Richter@JuliusRichter13·21 Nis

Join me tomorrow for a webinar on "Generative Audio Restoration in Multimodal Applications"! I'll introduce the tractable Schrödinger bridge and discuss the differences between flow matching and score-based methods. signalprocessingsociety.org/blog/sps-webin…

English

1

3

14

1.6K

Julius Richter@JuliusRichter13·7 Nis

Will present our paper, "Investigating Training Objectives for Generative Speech Enhancement" at #ICASSP2025! 🗓 Wed, 5:00-6:30 PM (AASP-P12) 🔊 Discussing diffusion bridges (Schrödinger bridge) & connections to score-based models—let’s chat!

English

0

3

21

2.1K

Julius Richter

Keşfet