DSpace

King Saud University Repository >
King Saud University >
COLLEGES >
Science Colleges >
College of Computer and Information Sciences >
College of Computer and Information Sciences >

Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/15222

Title: Motion estimation analysis for unsupervised training for lip reading user authentication systems
Authors: Hanan A. Mahmoud
Khaled Alghathbar
Fahad Bin Muhaya
Keywords: lip reading, speech recognition, motion
Issue Date: 2009
Publisher: World Scientific and Engineering Academy and Society (WSEAS)
Abstract: This paper proposes a lip reading technique for speech recognition by using motion estimation analysis. Motion estimation is done for lip movement image sequences representing speech. In this methodology, the motion estimation is computed without extracting the speaker's lip contours and location. This leads to obtaining robust visual features for lip movements representing utterances. Our methodology comprises of two phases, a training phase and a recognition phase. In both phases an n × n video frame of the image sequence for an utterances (can be an alphanumeric character, word or a sentence in more complicated analysis) is divided into m × m blocks. Our method calculates and fits eight curves for each frame. Each curve represents motion estimation of this frame in a specific direction. These eight curves are representing set of features of a specific frame and are extracted in an unsupervised manner. The feature set consists of the integral values of the motion estimation. These features are expected to be extremely effective in the training phase. The feature sets are used to characterize specific utterances with no additional acoustic feature set. A corpus of utterances and their motion estimation features are built in the training phase. The recognition phase is accomplished by extracting the feature set,from the new image sequence of lip movement of an utterance, and compare it to the corpus using the mean square error metric for recognition.
URI: http://hdl.handle.net/123456789/15222
Appears in Collections:College of Computer and Information Sciences

Files in This Item:

File Description SizeFormat
Dr. Hanan Mahmoud-3-conf.docx13.98 kBMicrosoft Word XMLView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

DSpace Software Copyright © 2002-2007 MIT and Hewlett-Packard - Feedback