Post

Diffio AI
Diffio AI@diffioai·
Word alignment error relative to SNR. See 🧵for details.
Diffio AI tweet media
English
3
2
4
56
Diffio AI
Diffio AI@diffioai·
- WhisperX github.com/m-bain/whisperX WhisperX performs forced alignment with an external phoneme/CTC aligner, typically a wav2vec2-based model, to align a known transcript to the waveform and recover word timestamps.
English
0
0
2
35
Chia sẻ