
Zhizheng Wu
35 posts

















💥Video generation is booming lately🤤! 🤩Try our🎙️𝐅𝐨𝐥𝐞𝐲𝐂𝐫𝐚𝐟𝐭𝐞𝐫 to add sound effects. 🌟HomePage: foleycrafter.github.io ❤️Thanks to all the co-authors: Yicheng Gu @zengyh1900 @LeoXing8 Yuancheng Wang @drwuz @kaichen100 #FoleyCrafter #AudioGeneration #AISoundEffect


🚀 Excited to share SD-Eval benchmark dataset that can help LLM to understand speech better than ChatGPT 4o! It covers the understanding of emotions, accents, age and bg environmental sounds. Paper: arxiv.org/abs/2406.13340 GitHub: github.com/amphionspace/S…

Amphion now supports the FACodec, which is the core component of NaturalSpeech3 and the pretrained checkpoints are released. Paper: arxiv.org/abs/2403.03100 Checkpoints: huggingface.co/amphion/natura… Demo: huggingface.co/spaces/amphion… Code: github.com/open-mmlab/Amp… @xutan_tx @yuancwang





