Dynamic Bayesian Networks for Audio-Visual Speech Recognition
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speech generation mechanism, which is essentially bimodal in audio and visual representation, and by the need for f...