Dynamic Bayesian Networks for Audio-Visual Speech Recognition
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speech generation mechanism, which is essentially bimodal in audio and visual representation, and by the need for f...
Citation: EURASIP Journal on Advances in Signal Processing 2002 2002:783042