Volume 9 Number 7 (Jul. 2014)
Home > Archive > 2014 > Volume 9 Number 7 (Jul. 2014) >
JCP 2014 Vol.9(7): 1628-1638 ISSN: 1796-203X
doi: 10.4304/jcp.9.7.1628-1638

Analysis and Determination of Inner Lip Texture Descriptors for Visual Speech Representation

Xibin Jia1, Hua Du1, Yanfang Han1, David M W Powers1, 2
1College of Computer Science, Beijing University of Technology, Beijing, China
2School of Computer Science, Engineering & Mathematics, Flinders University, Adelaide, Australia


Abstract—The problem of visual speech representation for bimodal based speech recognition includes particular challenges in the modeling of the inner lip texture reflecting different pronunciations, such as the appearance of teeth and tongue. This paper proposes and analyzes several possible statistical inner lip texture descriptors to determine an effective and discriminant feature. Simply using grayscale without full specification of the underlying colour model tends to loss some significant discriminative information. Therefore thorough exploration on the color space components selection in computing the local inner lip texture is thus a primary goal of the present research. The L channel of Lab color space is finally determined as the basis for the development of the inner lip texture model. Through feature level fusion, the final classification of visual speech is performed based on the proposed inner lip texture descriptor and standard geometric features. Together with audio speech, this paper furthers the development of the CHMM based bimodal Chinese character pronunciation recognition system. The experimental results show that the local inner texture descriptors, such as the color moment with geometric feature, outperform the holistic inner texture descriptors, such as the statistical histogram, in representing visual speech with the close discriminability but low dimensionality.

Index Terms—inner lip texture descriptor, local feature, feature fusion, visual speech representation

[PDF]

Cite: Xibin Jia, Hua Du, Yanfang Han, David M W Powers, "Analysis and Determination of Inner Lip Texture Descriptors for Visual Speech Representation," Journal of Computers vol. 9, no. 7, pp. 1628-1638, 2014.

General Information

ISSN: 1796-203X
Abbreviated Title: J.Comput.
Frequency: Bimonthly
Editor-in-Chief: Prof. Liansheng Tan
Executive Editor: Ms. Nina Lee
Abstracting/ Indexing: DBLP, EBSCO,  ProQuest, INSPEC, ULRICH's Periodicals Directory, WorldCat,etc
E-mail: jcp@iap.org
  • Nov 14, 2019 News!

    Vol 14, No 11 has been published with online version   [Click]

  • Mar 20, 2020 News!

    Vol 15, No 2 has been published with online version   [Click]

  • Dec 16, 2019 News!

    Vol 14, No 12 has been published with online version   [Click]

  • Sep 16, 2019 News!

    Vol 14, No 9 has been published with online version   [Click]

  • Aug 16, 2019 News!

    Vol 14, No 8 has been published with online version   [Click]

  • Read more>>