Silence detection and vowel/consonant discrimination in video sequences

title: Silence detection and vowel/consonant discrimination in video sequences
author(s): Jacek C. Wojdel and Leon J.M. Rothkrantz
published in: December 2000
appeared in: Proceedings of the Eight Australian International Conference on Speech Science and Technology, Canberra, Australia
pages: 104-109
PostScript (132 KB)

Abstract

In this paper we present a set of experiments that were aimed at investigation of feasibility of using artificial neural networks (ANNs) in a lip-reading task. We present here the method for data extraction that is applied on video sequences containing lower half of the face of speaking subject. Further the data is used to evaluate the performance of ANNs in a task of classifying the frames in the video stream into three possible classes: vowel, consonant or silence.

 
blue line
University logo