Statistical Lip-Appearance Models Trained Automatically Using Audio Information

Daubias, Philippe; Deléglise, Paul

doi:10.1155/S1110865702206186

Research Article
Published: 28 November 2002

Statistical Lip-Appearance Models Trained Automatically Using Audio Information

Philippe Daubias^1,2 &
Paul Deléglise¹

EURASIP Journal on Advances in Signal Processing volume 2002, Article number: 720534 (2002) Cite this article

1131 Accesses
7 Citations
Metrics details

Abstract

We aim at modeling the appearance of the lower face region to assist visual feature extraction for audio-visual speech processing applications. In this paper, we present a neural network based statistical appearance model of the lips which classifies pixels as belonging to the lips, skin, or inner mouth classes. This model requires labeled examples to be trained, and we propose to label images automatically by employing a lip-shape model and a red-hue energy function. To improve the performance of lip-tracking, we propose to use blue marked-up image sequences of the same subject uttering the identical sentences as natural nonmarked-up ones. The easily extracted lip shapes from blue images are then mapped to the natural ones using acoustic information. The lip-shape estimates obtained simplify lip-tracking on the natural images, as they reduce the parameter space dimensionality in the red-hue energy minimization, thus yielding better contour shape and location estimates. We applied the proposed method to a small audio-visual database of three subjects, achieving errors in pixel classification around 6%, compared to 3% for hand-placed contours and 20% for filtered red-hue.

Author information

Authors and Affiliations

Laboratoire d'Informatique de l'Université du Maine (LIUM), Institut d'Informatique Claude Chappe, Le Mans, Cedex 9 F-72085, France
Philippe Daubias & Paul Deléglise
Laboratoire d'Informatique Graphique Image et Modélisation (LIGIM), Bâtiment 710, 8, bd Niels Bohr, Villeurbanne, Cedex F-69622, France
Philippe Daubias

Authors

Philippe Daubias
View author publications
You can also search for this author in PubMed Google Scholar
Paul Deléglise
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Daubias, P., Deléglise, P. Statistical Lip-Appearance Models Trained Automatically Using Audio Information. EURASIP J. Adv. Signal Process. 2002, 720534 (2002). https://doi.org/10.1155/S1110865702206186

Download citation

Received: 01 November 2001
Revised: 19 June 2002
Published: 28 November 2002
DOI: https://doi.org/10.1155/S1110865702206186

Statistical Lip-Appearance Models Trained Automatically Using Audio Information

Abstract

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords