Nicholas J. Bryan (@NicholasJBryan) - โปรไฟล์ Twitter

ทวีตที่ปักหมุด

🎶 V2M-Zero: SOTA time-sync'd video-to-music! 🎶 * Music sync'd to dance, scene cuts, * Easily adapt existing TTM models, * No paired video-music data, * SOTA objective + human preference. Exceptional work @yblin98 + @casebeer! w/@mtlong_88 @aniruddha26398 @gberta227, me

Yan-Bo Lin@yblin98

🎵🎵What if we could generate video soundtracks without paired video–music data? Introducing V2M-Zero, a method that generates music synchronized with video events. arxiv.org/abs/2603.11042 w. @CasebeerJonah @mtlong_88 @aniruddha26398 @gberta227 @NicholasJBryan

English

0

1

13

849

Nicholas J. Bryan@NicholasJBryan·12 Mar

@AiquestAcademy Check out x.com/yblin98/status…

Yan-Bo Lin@yblin98

🎵🎵What if we could generate video soundtracks without paired video–music data? Introducing V2M-Zero, a method that generates music synchronized with video events. arxiv.org/abs/2603.11042 w. @CasebeerJonah @mtlong_88 @aniruddha26398 @gberta227 @NicholasJBryan

English

0

15

AIQUEST@AiquestAcademy·12 Mar

🤗 huggingface.co/papers/2603.11… 📄 arxiv.org/abs/2603.11042 🌐 genjib.github.io/v2m_zero/

QME

1

0

1

15

AIQUEST@AiquestAcademy·12 Mar

V2M-Zero: what if you could make music that perfectly matches your video's every move? this new AI makes it real, generating soundtracks that precisely align with events in your footage. it focuses on matching the *flow* and *changes* within your video and music separately, then brings them together, without needing tons of paired examples. teh results are super synced up. 🎶 demo available. links 👇

English

1

0

2

76

Nicholas J. Bryan@NicholasJBryan·12 Mar

@ArxivSound Check out x.com/yblin98/status…

Yan-Bo Lin@yblin98

🎵🎵What if we could generate video soundtracks without paired video–music data? Introducing V2M-Zero, a method that generates music synchronized with video events. arxiv.org/abs/2603.11042 w. @CasebeerJonah @mtlong_88 @aniruddha26398 @gberta227 @NicholasJBryan

English

0

140

arXiv Sound@ArxivSound·12 Mar

Yan-Bo Lin, Jonah Casebeer, Long Mai, Aniruddha Mahapatra, Gedas Bertasius, Nicholas J. Bryan, "V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation," arxiv.org/abs/2603.11042

Indonesia

1

0

4

467

Nicholas J. Bryan@NicholasJBryan·24 Şub

Audio VAEs + VQ-VAEs designed for #GenAI! * Ultra-fast encoding for on-the-fly training pipelines, * ~2x more compression (13Hz) w/frontier quality, * Any format (mono, stereo LR, MS, mel, raw), * Cont. or discrete latents. 👏 @CasebeerJonah! w/@__gzhu__ @zhepeiw03, me

Jonah Casebeer@CasebeerJonah

GenAE: An audio autoencoder engineered for generative modeling. To appear at ICASSP 2026. w/ @__gzhu__ @zhepeiw03 @NicholasJBryan arXiv: arxiv.org/abs/2602.15749 Video: youtu.be/gDIIuLb0cf0

English

1

4

72

7.7K

Nicholas J. Bryan@NicholasJBryan·19 Şub

TAC: Timestamped Audio Captioning 👇

Justin Salamon@justin_salamon

This is big. SOTA audio reasoning. SOTA video reasoning. SOTA audio captioning. SOTA sound event detection. Better than Gemini. Better than Qwen. TAC: Timestamped Audio Captioning 📑 paper: lnkd.in/getEz5xU 🌐 website with more demos: lnkd.in/gdw5TTuS

English

0

4

543

Nicholas J. Bryan@NicholasJBryan·11 Şub

@ArxivSound Check out x.com/slseanwu/statu…

Shih-Lun (Sean) Wu@slseanwu

Excited to announce our ICASSP 2026 paper "Stemphonic: All-at-once Flexible Multi-stem Music Generation" ! w/ @__gzhu__, @j_p_caceres, @huangcza, and @NicholasJBryan 🔊Demo stemphonic-demo.vercel.app 📰Paper arxiv.org/abs/2602.09891 More details in🧵

English

0

5

295

arXiv Sound@ArxivSound·11 Şub

Shih-Lun Wu, Ge Zhu, Juan-Pablo Caceres, Cheng-Zhi Anna Huang, Nicholas J. Bryan, "Stemphonic: All-at-once Flexible Multi-stem Music Generation," arxiv.org/abs/2602.09891

English

1

11

431

Nicholas J. Bryan@NicholasJBryan·11 Şub

Thrilled about Stemphonic! All-at-once Flexible Multi-stem Music Generation! w/@slseanwu @__gzhu__ @j_p_caceres @huangcza and myself

Shih-Lun (Sean) Wu@slseanwu

Excited to announce our ICASSP 2026 paper "Stemphonic: All-at-once Flexible Multi-stem Music Generation" ! w/ @__gzhu__, @j_p_caceres, @huangcza, and @NicholasJBryan 🔊Demo stemphonic-demo.vercel.app 📰Paper arxiv.org/abs/2602.09891 More details in🧵

English

1

2

19

996

Nicholas J. Bryan รีทวีตแล้ว

Luiz Marques@stargliderbr·11 Kas

@LudovicCreator I've been playing with Generate Soundtrack a lot today. It is pretty great, and really fast. I've been collecting a bunch of alternate music options from my videos.

English

0

1

3

155

Nicholas J. Bryan รีทวีตแล้ว

Drashya Kuruwa@drashyakuruwa·10 Kas

Why no one is talking about the new Meta AI - Image and Video generation which is partnered with Midjourney and Black Forest Labs?? The generations are fascinating. Image and Video generated with @Meta @AIatMeta Music - @Adobe Generate Soundtrack @alexandr_wang #MetaAI

English

0

1

5

341

Nicholas J. Bryan รีทวีตแล้ว

Alexandra Aisling@AllaAisling·7 Ara

@dreamina_ai Added sound in @AdobeFirefly , generate soundtrack feature

English

2

6

23

1.2K

Nicholas J. Bryan@NicholasJBryan·17 Ara

🎵Hiring summer interns on AI music 2026! @AdobeResearch adobe.ly/4dS1zc6 Past intern papers: arxiv.org/abs/2504.15217 TMLR arxiv.org/abs/2507.07867 MLSP arxiv.org/abs/2410.05167 ICLR arxiv.org/abs/2403.10493 SPL arxiv.org/abs/2401.12179 ICML arxiv.org/abs/2311.07069 TASLP

English

7

13

126

10.3K

Nicholas J. Bryan รีทวีตแล้ว

Shih-Lun (Sean) Wu@slseanwu·7 Kas

Thrilled to announce “MIDI-LLM: Adapting LLMs for Text-to-MIDI Music Generation” w/ @huangcza and Yoon Kim! 🎸 Live Demo midi-llm-demo.vercel.app 💻 github.com/slSeanWU/MIDI-… 🤗 huggingface.co/slseanwu/MIDI-… From a text prompt, it generates MIDIs you can edit directly in a DAW 🧵

English

2

12

30

2.7K

Nicholas J. Bryan@NicholasJBryan·29 Eki

Congrats @JCJesseLai and Team!

Chieh-Hsin (Jesse) Lai@JCJesseLai

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core ideas that shaped diffusion modeling and explains how today’s models work, why they work, and where they’re heading. 🧵You’ll find the link and a few highlights in the thread. We’d love to hear your thoughts and join some discussions! ⚡ Stay tuned for our markdown version, where you can drop your comments!

English

1

0

4

617

Nicholas J. Bryan รีทวีตแล้ว

The Verge@verge·28 Eki

Adobe’s new AI audio tools can add soundtracks and voice-overs to videos theverge.com/news/807809/ad…

English

0

5

20

16.1K

Nicholas J. Bryan@NicholasJBryan·28 Eki

Adobe's #GenerateSoundtrack is LIVE today! 🎉 Studio-quality music for storytellers🎵 * Trained on #Licensed data, * Commercially safe, royalty-free, and cleared for any use, & * Exported with #ContentCredentials for transparency and attribution. Get started: firefly.adobe.com/generate/sound… Powered by the #FireflyAudioModel, and built by an incredible R&D team: @j_p_caceres @CasebeerJonah @__gzhu__ @zhepeiw03 @ailiefraser @NicholasJBryan Excited for the Team. Much more to come! #Adobe #AdobeMAX #GenerateSoundtrack #FireflyAudioModel

English

1

7

14

1.2K

Nicholas J. Bryan@NicholasJBryan·22 Nis

@wesroth More info below! x.com/NicholasJBryan…

Nicholas J. Bryan@NicholasJBryan

Introducing "DRAGON: Distributional Rewards Optimize Diffusion Generative Models"! 📖: arxiv.org/abs/2504.15217 🎹: ml-dragon.github.io/web/ A new framework for fine-tuning gen models towards a target distribution. By Yatong Bai w/@CasebeerJonah @somayeh_sojoudi @NicholasJBryan

English

0

77

Wes Roth@WesRoth·22 Nis

Adobe just dropped DRAGON on Hugging Face DRAGON (Distributional RewArds for Generative OptimizatioN) is a fine-tuning framework for media generation from images to text-to-music models. Unlike RLHF or DPO, DRAGON can optimize for individual samples and entire distributions of outputs giving it next-level flexibility for real-world creative tasks.

AK@_akhaliq

Adobe announced DRAGON on Hugging Face Distributional Rewards Optimize Diffusion Generative Models

English

1

0

13

1.6K

Nicholas J. Bryan@NicholasJBryan·22 Nis

@_akhaliq Checkout more info at x.com/NicholasJBryan…

Nicholas J. Bryan@NicholasJBryan

Introducing "DRAGON: Distributional Rewards Optimize Diffusion Generative Models"! 📖: arxiv.org/abs/2504.15217 🎹: ml-dragon.github.io/web/ A new framework for fine-tuning gen models towards a target distribution. By Yatong Bai w/@CasebeerJonah @somayeh_sojoudi @NicholasJBryan

English

0

2

369

Nicholas J. Bryan รีทวีตแล้ว

AK@_akhaliq·22 Nis

Adobe announced DRAGON on Hugging Face Distributional Rewards Optimize Diffusion Generative Models

English

14

40

238

24.2K

Nicholas J. Bryan@NicholasJBryan·22 Nis

@ArxivSound For more info, checkout x.com/NicholasJBryan…

Nicholas J. Bryan@NicholasJBryan

Introducing "DRAGON: Distributional Rewards Optimize Diffusion Generative Models"! 📖: arxiv.org/abs/2504.15217 🎹: ml-dragon.github.io/web/ A new framework for fine-tuning gen models towards a target distribution. By Yatong Bai w/@CasebeerJonah @somayeh_sojoudi @NicholasJBryan

English

0

4

288

arXiv Sound@ArxivSound·22 Nis

``DRAGON: Distributional Rewards Optimize Diffusion Generative Models,'' Yatong Bai, Jonah Casebeer, Somayeh Sojoudi, Nicholas J. Bryan, ift.tt/0JpvRfW

English

1

2

10

1.5K

Nicholas J. Bryan@NicholasJBryan·22 Nis

DRAGON introduces a new approach to designing and optimizing reward functions to enhance human-perceived quality.

English

0

200

Nicholas J. Bryan@NicholasJBryan·22 Nis

With an appropriate exemplar set, DRAGON achieves a 60.95% human-voted music quality win rate without training on human preference annotations.

English

1

0

208

Nicholas J. Bryan@NicholasJBryan·22 Nis

Introducing "DRAGON: Distributional Rewards Optimize Diffusion Generative Models"! 📖: arxiv.org/abs/2504.15217 🎹: ml-dragon.github.io/web/ A new framework for fine-tuning gen models towards a target distribution. By Yatong Bai w/@CasebeerJonah @somayeh_sojoudi @NicholasJBryan

English

2

10

27

2.9K

Nicholas J. Bryan

ค้นพบ