A novel automatic lip reading method based on polynomial fitting

Meng Li*, Yiu Ming CHEUNG

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

4 Citations (Scopus)

Abstract

This paper addresses the problem of isolate number recognition using visual information only. We utilize the intensity transformation and spatial filter to estimate the minimum enclosing rectangle of mouth in each frame. For each utterance, we obtain the two vectors composed of width and height of mouth, respectively. Then, we present a method to recognize the speech based on the polynomial fitting. Firstly, both width and height vectors are normalized and arranged into the constant length via interpolation. Secondly, least square method is utilized to produce two 3-order polynomials that can represent the main trend of the two vectors, respectively, and reduce the noise caused by the estimate error. Lastly, the positions of three crucial points (i.e. maximum, minimum, and right boundary point) in each 3-order polynomial curve are formed as a feature vector. For each utterance, we calculate the average of all vectors of training data to make a template, and utilize Euclidean distance between the template and testing data to perform the classification. Experiments show the promising results of the proposed approach in comparison with the existing methods.

Original languageEnglish
Title of host publicationActive Media Technology - 6th International Conference, AMT 2010, Proceedings
Pages296-305
Number of pages10
DOIs
Publication statusPublished - 2010
Event2010 6th International Conference on Active Media Technology, AMT 2010 - Toronto, ON, Canada
Duration: 28 Aug 201030 Aug 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6335 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2010 6th International Conference on Active Media Technology, AMT 2010
Country/TerritoryCanada
CityToronto, ON
Period28/08/1030/08/10

Scopus Subject Areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'A novel automatic lip reading method based on polynomial fitting'. Together they form a unique fingerprint.

Cite this