|

N.B. Yoma, C. Molina, J.
Silva, C. Busso, "Modeling, Estimating and Compensating
low-bit rate Coding Distortion in Speech Recognition,"
IEEE Transactions on Audio Speech and Language Processing,
vol 14, No 1, pp:223-231, January 2006.
Abstract
A solution to the problem of speech recognition
with signals distorted by low-bit rate coders is presented
in this paper. A model for the coding-decoding distortion,
a HMM compensation method to include this model, and an
EM-based adaptation algorithm to estimate this distortion
are proposed here. Medium vocabulary continuous-speech speaker-independent
recognition experiments with 8 kbps G.729(CS-CELP), 13 kbps
RPE-LTP (GSM), 5.3 kbps G723.1, 4.8 kbps FS-1016 and 32
kbps G.726(ADPCM) coders show that the approach described
in this paper is able to dramatically reduce the effect
of the coding distortion and, in some cases, gives a word
accuracy higher than the baseline system with uncoded speech.
Finally, the EM estimation algorithm requires only one adapting
utterance and the approach described is certainly suitable
for dialogue systems where just a few adapting utterances
are available.
Index
Terms:
Coding Distortion, EM estimation algorithm, HMM compensation,
low-bit rate coders,speech recognition.
File:
pdf
home
research
publications presentations
sail links
|