lester violeta

312 posts

lester violeta

@lesterphv

teaching computers to speak || research scientist @dubguild || phd @nagoyauniv

republika ng pilipinas Katılım Temmuz 2020

670 Takip Edilen213 Takipçiler

Sabitlenmiş Tweet

lester violeta@lesterphv·19 Mar

The first SVCC 2025 baseline system is now out! 🥳 We introduce Serenade: A Singing Style Conversion Framework Based On Audio Infilling. This preliminary investigation covers the main difficulties of singing style conversion (SSC) and details our findings.

English

3.4K

lester violeta retweetledi

Wen-Chin Huang@unilightwf·3d

🔥Announcing the VoiceMOS Challenge 2026🔥 We are REVERTING BACK to the VoiceMOS Challenge, as there are still unsolved problems in the evaluation of speech itself 🤨 Pre-registration: forms.gle/L6YdkUf1PJdSSw… Website: sites.google.com/view/voicemos-… See the tracks in the thread!

English

2.7K

lester violeta@lesterphv·6d

@Muramasa_2 @ricepamo 返信ありがとうございます！！内部の比較では、同じ5万時間なら1Bより3Bの方が安定していてパラ言語表現は出やすい傾向がありました。今後もスケーリング方向性は見ていく予定なので、次回の8B検証では、実験設定についてももう少し丁寧に評価したいと思います。

日本語

131

Muramasa@Muramasa_2·6d

@lesterphv @ricepamo これ興味深く見させていただいたんですが, モデルサイズスケーリングが効いてるのかデータスケーリングが効いてるのかは結局わからない気もして, 色々試している感じどちらも効いてるのでしょうか？ (実況風スタイルとか読みが安定する, とかもデータが増えれば1Bでも同じように安定する気もしたり)

日本語

255

lester violeta@lesterphv·12 Nis

TTSモデルを1Bから3Bまでスケールさせました！よかったら読んでみてね〜

DubGuild@DubGuild

テックブログを公開しました。「Scaling Speech AI」の下、1Bから3Bへと音声言語モデルをスケールさせた際のTTS性能へ影響を検証しました。日本語特有の読みや表記揺れ、表現の広がりがみられるに加え、現状の課題についても整理しています。日本語音声生成・SpeechLM・TTSに関心のある方はぜひご覧ください。 blog.dubguild.com/melte/llm-tts-… 1B/3Bモデルの構築にあたって実施した、データ前処理・事前学習・事後学習の詳細も、今後順次公開していく予定です。続報もお待ちいただければ幸いです。

日本語

4.1K

lester violeta retweetledi

DubGuild@DubGuild·12 Nis

会社HPを更新しました。株式会社DubGuildでは、「Scaling Speech AI」を掲げ、大規模音声言語基盤モデルの開発に取り組んでいます。会社ページはこちらから→dubguild.com ソフトバンク様の支援プログラム「AIFS」に関するプレスリリースはこちら→prtimes.jp/main/html/rd/p…

日本語

1.1K

lester violeta retweetledi

reo yoneyama@ricepamo·1 Nis

本日からお世話になります！有難いことに博士卒でも研修受けさせてくれるので、暫くは足りないスキルをしっかり修行してから本配属で活躍できるよう頑張ります💪

日本語

122

6.2K

lester violeta retweetledi

took@wataru9871·28 Mar

🚀 Thrilled to release DialogueSidon! Our joint two-speaker dialogue restoration & separation model. Need full-duplex dialogue data for models like Moshi or PersonaPlex? We've got you covered. 🎙️ 👇 Try it with your own samples! hf.co/spaces/sarulab…

English

8.8K

lester violeta@lesterphv·26 Mar

@erica_cooper Thank you Erica!!

English

erica@erica_cooper·26 Mar

@lesterphv Congrats!!!

English

lester violeta@lesterphv·25 Mar

本日、名古屋大学にて博士（情報学）の学位を取得しました！この5年間、支えてくださった皆さまに心より感謝します。卒業後はリサーチエンジニアとして働く予定なので、これからもニューラル音声モデルの研究開発に取り組んでいきたいと思います！！！！

日本語

8.1K

lester violeta@lesterphv·26 Mar

@heiga_zen Thanks a lot Zen-san!

English

Heiga Zen (全炳河)@heiga_zen·26 Mar

@lesterphv Congrats!

English

468

lester violeta@lesterphv·26 Mar

@unilightwf この5年間、ご指導いただき本当にありがとうございました。 WenChin先輩には、実験にいつも真剣に向き合っていただき、そのおかげで研究者として大きく成長できたと思っています。先輩と一緒に研究してこなければ、今の自分はなかったと思います。これから頑張っていきます！💪

日本語

Wen-Chin Huang@unilightwf·25 Mar

Lesterは，私は初めて深く関わった後輩だ。この5年間の成長本当にとんでもなくて，私の言うことすごく反論してくる時期もあった（笑）でもボスに「彼が一人前の研究者になった証だね」って言われて，ほっこりした。😊 今後も活躍してくれるでしょ。ほんとにおめでとうございます！

lester violeta@lesterphv

日本語

2.3K

lester violeta@lesterphv·25 Mar

@Muramasa_2 ありがとうございます！！ぜひぜひ飲みに行きましょう〜〜！

日本語

107

Muramasa@Muramasa_2·25 Mar

@lesterphv 卒業おめでとうございます🎉 また機会あれば飲みでもいきましょう！

日本語

187

lester violeta@lesterphv·25 Mar

@kwmrtko ありがとう！！河村くんもおめでとう🙌

日本語

138

Takao Kawamura@kwmrtko·25 Mar

@lesterphv おめでとう🙌🙌

日本語

163

lester violeta retweetledi

Shinnosuke Takamichi / 高道慎之介@forthshinji·17 Mar

日本語初の構音障害音声コーパスです！みなさん使って下さいね！ (音響学会で私が明日発表します)

ワッシー@kwashizzz

huggingface.co/datasets/JDSC-…

日本語

7.1K

lester violeta retweetledi

Keisuke Kamahori@KeisukeKamahori·4 Şub

Excited to share our new preprint on VoxServe, a serving system for Speech Language Models 🎙️⚡️ Speech model serving is challenging: complex pipelines, diverse architectures, and strict real-time requirements for low-latency + streamable inference. (1/2)

Baris Kasikci@bariskasikci

🚀🎙️ New preprint: VoxServe — a streaming-first serving system for SpeechLMs. VoxServe delivers blazing-fast, high-throughput model inference for real-time Text-to-Speech / Speech-to-Speech applications. 🔗 arxiv.org/abs/2602.00269 💻 github.com/vox-serve/vox-…

English

7.6K

lester violeta@lesterphv·18 Oca

Thanks to my co-authors for all their help! @xueyao_98 @jiatongshi @unilightwf @drwuz + Prof Yasuda & Prof Toda

English

166

lester violeta@lesterphv·18 Oca

Our paper discussing the SVCC 2025 summary has been accepted to ICASSP 2026! 🥳 Check it out here: arxiv.org/abs/2509.15629 We're still working on an extension journal paper that covers more details about SVCC, so stay tuned 😄

English

1.4K

lester violeta retweetledi

Takuya Fujimura@i15fujimura1t·14 Eki

今年もCEATEC

日本語

1.9K

lester violeta@lesterphv·22 Eyl

Special thanks as well to the organizing team for all their efforts in organizing this event! @xueyao_98 @jiatongshi @/Prof Yasuda @unilightwf @drwuz @/Prof Toda

English

129

lester violeta@lesterphv·22 Eyl

For now, try it out yourself! We release the dataset on HuggingFace, including test sets from submissions of different participants. This helps a lot in being able to compare your system with the systems described in the paper! huggingface.co/datasets/leste…

English

115

lester violeta@lesterphv·22 Eyl

Results from the Singing Voice Conversion Challenge 2025 (SVCC2025) are now out! 🥳🎶 We extend beyond identity conversion to singing style conversion (SSC), a harder problem involving control on both static + dynamic voice features. Check it out here: arxiv.org/abs/2509.15629

English

6.2K

Keşfet

@Muramasa_2 @ricepamo @erica_cooper @heiga_zen @unilightwf @kwmrtko @elonmusk @BarackObama