Hayato Futami

798 posts

Hayato Futami

@emonosuke

Research engineer at Sony, Speech and language AI. Views are my own.

Tokyo / Kyoto शामिल हुए Ocak 2022

610 फ़ॉलोइंग405 फ़ॉलोवर्स

Hayato Futami@emonosuke·5h

i was surprised my submission number was over 10k

English

304

Hayato Futami@emonosuke·6h

EMNLP done!

English

501

Hayato Futami@emonosuke·1d

EMNLP tsurai

English

662

Hayato Futami@emonosuke·16 May

@nagohachi_base よく見つけたね☺️

日本語

146

なごはち@nagohachi_base·15 May

Futami senpai! ieeexplore.ieee.org/document/11515…

Indonesia

1.5K

Hayato Futami@emonosuke·15 May

@Muramasa_2 おめでとうございます！！🙌 共にがんばりましょう！！

日本語

Muramasa@Muramasa_2·15 May

私事ですが本日入籍しました💍 気を引き締め直して仕事も研究も引き続きがんばりますので、今後ともよろしくお願いします！

日本語

182

6.6K

Hayato Futami@emonosuke·1 May

Kyutai の発表は Sakana の発表に意図的に被せているのかな、気になる

日本語

623

Hayato Futami रीट्वीट किया

🐿️🐒🗻📚🐹🦈@SythonUK·28 Nis

音AIモデル開発にWebデータガンガン使おうぜ派 VS 音AIモデル開発にWebデータ絶対使うな派 VS ダークライ

日本語

2.2K

Hayato Futami रीट्वीट किया

Yui Sudo@yuisudo24·28 Nis

I’m excited to share that sarashina2.2-tts, a high-performance Japanese TTS model, has been released!

SB Intuitions@sbintuitions

🚀 sarashina2.2-ttsを公開しました！日本語に特化したLLMベースの音声合成システムです✨️ 驚くほど自然な表現力と、高い再現性を実現しています。 🇯🇵 高精度 🔊 多彩な表現 🌐 日英対応 ✨ 声質再現 👇 詳細はこちら huggingface.co/sbintuitions/s… #SBIntuitions

English

7.6K

Hayato Futami रीट्वीट किया

🐿️🐒🗻📚🐹🦈@SythonUK·16 Nis

All speech reseach will be done by GDM. Pokémon Pokopia is all I can do

Deutsch

1.3K

Hayato Futami रीट्वीट किया

Yahoo!ニュース@YahooNewsTopics·12 Nis

【4社中核に国産AI開発会社を設立】 news.yahoo.co.jp/pickup/6575967

日本語

105

427

1.5K

962.7K

Hayato Futami रीट्वीट किया

Alexandr Wang@alexandr_wang·8 Nis

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵

English

729

1.2K

10.4K

4.5M

Hayato Futami@emonosuke·8 Nis

神話の時代

日本語

134

Hayato Futami रीट्वीट किया

Shinji Watanabe@shinjiw_at_cmu·7 Nis

6 papers (4 main and 2 findings) were accepted at #ACL2026! All are speech papers :)

English

4.8K

Hayato Futami रीट्वीट किया

Taka@ElevenLabs@tkhr410·6 Nis

最近イベントなどや打ち合わせの場で「ElevenLabsってSTSないの？」と言われる機会も多くなりました。本noteブログはの質問に対するElevenLabsとしての回答となります。気になる方はご笑覧ください。 note.com/taka_410/n/nfc…

日本語

6.6K

Hayato Futami रीट्वीट किया

あゆ@aya172957·5 Nis

NLP2026のワークショップで表彰して頂いたLLM-JP-4をベースにした日本語SpeechLLMを公開しました！chatモデルと音声認識モデルを商用利用可能なライセンスで公開しています！！

あゆ@aya172957

NLP2026の第2回「大規模言語モデルのファインチューニング技術と評価」ワークショップにて我々の『合成データを使用した日本語音声LLMの開発』が自由形タスク1位で表彰をいただきました！大規模な計算資源の提供など運営の方々ありがとうございました！

日本語

203

42.4K

Hayato Futami रीट्वीट किया

mamita@chemical_tree·4 Nis

最近仕事で音声周りの研究開発もやるようになった関係で初めて某speechの査読を引き受けてみたが（たまたま運がよかったのか）面白い&勉強になる当たり論文が多くて休日返上で査読してもぎりポジティブな気持ちになれてるの久しぶりかも

日本語

1.4K

Hayato Futami रीट्वीट किया

Microsoft AI@MicrosoftAI·2 Nis

The most accurate model across 25 languages, faster transcription speeds, and stronger performance in real‑world noise. MAI‑Transcribe‑1 sets a new bar for speech recognition. Learn more + try it today: msft.it/6019QLa8B

English

306

25.2K

Hayato Futami रीट्वीट किया

ぬこぬこ / NUKO 🇯🇵@nukonuko·2 Nis

Gemma 4 Google DeepMind のマルチモーダルモデル。Apache 2.0。パラメータ数あたりの知能がかつてないほどに高い。Effective 2B、4B、26B MoE、31B の 4 種類。画像、動画、音声入力に対応。Context Window は 128k~256k。140 言語以上に対応。Hugging Face などで。 blog.google/innovation-and…

日本語

40.5K

Hayato Futami रीट्वीट किया

Eustache Le Bihan@eustachelb·2 Nis

HF audio team member here 👋🤗 Don’t want to be the party pooper here, but those look a little… questionable 🙊 Would love to be proven wrong though, @WillowVoiceAI what about adding the model to the leaderboard? BTW We’re working on private test sets for the Open ASR Leaderboard to address this tipe of questions, but here, the model is the closest you can get (understandable ofc since your product is built on it)

Willow@WillowVoiceAI

Most models score 5-7% word error rate on clean audio. In real-world conditions they fall to 10-15%. Atlas 1 holds at 1.2% on clean audio and 2.1% in production. The gap widens in noisy environments.

English

107

15.4K

Hayato Futami रीट्वीट किया

AlphaCephei@alphacep·2 Nis

github.com/k2-fsa/OmniVoi…

ZXX

128

6.2K

खोजें

@nagohachi_base @Muramasa_2 @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA