FaRo

10K posts

FaRo banner
FaRo

FaRo

@faroit

@[email protected] Audio-AI researcher at @audioshakeai (Before: @inria, @FraunhoferIIS / @uniFAU). All in 17.68% of grey

Montpellier, France เข้าร่วม Nisan 2008
717 กำลังติดตาม1K ผู้ติดตาม
FaRo รีทวีตแล้ว
AudioShake
AudioShake@AudioShakeAI·
DJs making music magic, create yours with AudioShake stems. djay Pro from @algoriddim now includes AudioShake’s tech to isolate vocals, instruments, and drums at the highest quality available to the industry. On mobile. 🎧📷#djtools #beatmaking
English
0
6
19
3.2K
FaRo รีทวีตแล้ว
Ondřej Cífka
Ondřej Cífka@cifkao·
Our paper on lyrics transcription evaluation is on arXiv with updated and extended results (including Whisper v3)! 📄 arxiv.org/abs/2311.13987 Also, the benchmark is now on @paperswithcode: 🏆 paperswithcode.com/dataset/jam-alt 👏 @faroit @h_schreiber @Luke_Miner @cnst_ant @AudioShakeAI
AudioShake@AudioShakeAI

This month at #ISMIR2023, AudioShake’s Research team presented a new benchmark for automatic lyric transcription systems– one that accounts for the nuances of music. You can read more on their new paper on AudioShake: audioshake.ai/post/new-bench…

English
1
2
15
1.3K
FaRo
FaRo@faroit·
@ISMIRConf its 3PM and its not open :-/
English
1
0
0
332
ISMIR Conference
ISMIR Conference@ISMIRConf·
📢 LATE-BREAKING DEMO REOPENS 📢 today at 3 p.m. (CEST) and will remain open until the originally announced deadline, We can guarantee acceptance of a limited number of papers (venue capacity will be confirmed later) and apply a priority to papers submitted early. #ISMIR2023
English
1
5
15
2.9K
FaRo
FaRo@faroit·
@zhaojw1998 @gkspearow closing this after a couple of hours doesn't sound like a good review process to me
English
0
1
1
119
Jingwei Zhao
Jingwei Zhao@zhaojw1998·
@gkspearow I believe that is the most likely reason. Since there are only 15 entries to be accepted, I guess it's reasonable that all spots are filled quite early.
English
1
0
1
191
Jingwei Zhao
Jingwei Zhao@zhaojw1998·
ISMIR LBD just started and acceptance is rolling-based. CMT still open this morning but closed just now before I finished my draft. Never expected such competitiveness🥲 But anyw, seems many people have great progress to share. Look forward to connect with #ISMIR2023 in Milan :-)
English
5
1
16
4.3K
FaRo
FaRo@faroit·
@zhaojw1998 same here for us. Very unfortunate and not really fair as it can't be automated. Why not let more papers to be submitted and review them by quality instead of submission time?
English
0
0
2
150
FaRo
FaRo@faroit·
@serrjoa Btw. Awesome paper!
English
0
0
1
121
Joan Serrà
Joan Serrà@serrjoa·
Want to convert from mono to stereo? 🔊➡️🔊🔊 In our latest work, we posit that upmixing mono to stereo is a great avenue for generative modeling, and that parametric stereo coding can facilitate things. Paper: arxiv.org/abs/2306.14647
Joan Serrà tweet media
English
5
16
100
19K
FaRo
FaRo@faroit·
@serrjoa Out of curiosity, is that parametric stereo robust enough to extracting spatial parameters from a mixture and apply it on sources?
English
1
0
0
213
FaRo
FaRo@faroit·
I am looking for reviewers for a @JOSS_TheOJ submission with expertise in speech and python. The software under review is a new speech enhancement module of github.com/espnet/espnet (hence its so difficult to get reviewers without conflicts of interest). Any pointer is helpful!
English
1
1
9
1.2K
Julien Chaumond
Julien Chaumond@julien_c·
Hugging face Lyon office
Julien Chaumond tweet media
English
10
4
150
16.3K
FaRo รีทวีตแล้ว
Yusong Wu
Yusong Wu@wuyusongwys·
Join us for a special edition of our Mila Music + AI Reading Group from February 8th to 22nd! We're excited to host 5 teams from the 2022 AI Song Contest, an international contest where musicians and scientists collaborate to explore human-ai co-creativity.
Yusong Wu tweet media
English
1
10
56
14.1K
FaRo รีทวีตแล้ว
Loreto Parisi
Loreto Parisi@loretoparisi·
#Pytorch implementation of MusicLM, new SOTA model for music generation using attention networks plus embeddings from MuLan, a text-audio contrastive learned model github.com/lucidrains/mus…
English
0
3
7
585
Jim Fan
Jim Fan@DrJimFan·
@WilliamLamkin Same, I’m surprised by the momentum in audio this year. 4 models in one week is insane even by modern AI’s pace.
English
3
0
18
8.7K
Jim Fan
Jim Fan@DrJimFan·
Music & sound effect industry has not fully understood the size of the storm about to hit. There’re not just one, or two, but FOUR audio models in the past week *alone* If 2022 is the year of pixels for generative AI, then 2023 is the year of sound waves. Deep dive with me: 🧵
Jim Fan tweet media
English
82
914
4.3K
1.1M
FaRo รีทวีตแล้ว
AIcrowd
AIcrowd@aicrowdHQ·
🎻 The SDX23 challenge introduces a new formulation of audio source separation: cinematic sound separation. The task is to separate a movie's audio into three tracks: dialogue, sound effects & music. 📕 Give it a try using the starter kit. aicrowd.com/challenges/sou…
English
1
3
5
1.1K
FaRo
FaRo@faroit·
@naotokui_en @csteinmetz1 Training is just a very small aspect of it. Lawyers will first go after the obvious things: the startups that can generate new Taylor Swift songs.
English
0
0
1
43
Nao Tokui
Nao Tokui@naotokui_en·
@csteinmetz1 It can be problematic because it’s completely legal to train AI models on copyrighted material in some countries (including Japan)
English
2
0
7
575
FaRo
FaRo@faroit·
@JonathanLeRoux @ethanmanilow For the challenge we run mean across songs. But many papers and also (rightfully) show median across songs as outliers can be dramatic.
English
1
0
1
128
FaRo
FaRo@faroit·
Hey music separation researchers. We added a new definition of the SDR metric when we launched the last @sounddemix. To make it less confusing for future papers, we want to rename the metric. Please vote
English
3
1
8
1.9K