Word alignment error relative to SNR. See 🧵for details.
Diffio AI@diffioai·25 MarWord alignment error relative to SNR. See 🧵for details.Traduire English32456
Diffio AI@diffioai·25 Mar- WhisperX github.com/m-bain/whisperX WhisperX performs forced alignment with an external phoneme/CTC aligner, typically a wav2vec2-based model, to align a known transcript to the waveform and recover word timestamps.Traduire English00235