A Proposal of Mouth Shapes Sequence Code for Japanese Pronunciation

Miyazaki, Tsuyoshi; Nakashima, Toyoshiro; Ishii, Naohiro

doi:10.1007/978-3-642-22288-7_5

A Proposal of Mouth Shapes Sequence Code for Japanese Pronunciation

Tsuyoshi Miyazaki³,
Toyoshiro Nakashima⁴ &
Naohiro Ishii⁵

Conference paper

824 Accesses
2 Citations

Part of the book series: Studies in Computational Intelligence ((SCI,volume 368))

Abstract

In this paper, we examine a method in which distinctive mouth shapes are processed using a computer.When lip-reading skill holders do lip-reading, they stare at the changes in mouth shape of a speaker. In recent years, some researches into lip-reading using information technology has been pursued. There are some researches based on the changes in mouth shape. The researchers analyze all data of the mouth shapes during an utterance, whereas lip-reading skill holders look at distinctive mouth shapes. We found that there was a high possibility for lip-reading by using the distinctive mouth shapes. To build the technique into a lip-reading system, we propose an expression method of the distinctive mouth shapes which can be processed using a computer. In this way, we acquire knowledge about the relation between Japanese phones and mouth shapes. We also propose a method to express order of the distinctive mouth shapes which are formed by a speaker.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kenji, M., Pentland, A.: Automatic Lip-reading by Optical-flow Analysis. The Transactions of the Institute of Electronics, Information and Communication Engineers J73-D-II(6), 796–803 (1990) (in Japanese)
Google Scholar
Takashi, O., Teruhiko, O.: Automatic Lipreading of Station Names Using Optical Flow and HMM. Technical Report of IEICE PRMU 102(471), 25–30 (2002) (in Japanese)
Google Scholar
Mang, L., Issei, Y., Yoshihiro, K., Hidemitsu, O.: Automatic Lipreading by Subspace Method. Technical Report of IEICE PRMU 97(251), 9–14 (1997) (in Japanese)
Google Scholar
Takeshi, S., Ryosuke, K.: Lip Reading Based on Trajectory Feature. The IEICE Transactions on Information and Systems (Japanese edition) J90-D(4), 1105–1114 (2007) (in Japanese)
Google Scholar
Kimiyasu, K., Keiichi, U.: An Utered Word Recognition Using Lip Image Information. The Transactions of the Institute of Electronics, Information and Communication Engineers J76-D-II(3), 812–814 (1993) (in Japanese)
Google Scholar
Akihiro, O., Yoshitaka, H., Kenji, O., Toshihiko, M.: Speech Recognition Based on Integration of Visual and Auditory Information. Transactions of Information Processing Society of Japan 39(12), 3232–3241 (1998) (in Japanese)
Google Scholar
Yasuyuki, N., Moritoshi, A.: Lipreading Method Using Color Extraction Method and Eigenspace Technique. The Transactions of the Institute of Electronics, Information and Communication Engineers J85-D-II(12), 1813–1822 (2002) (in Japanese)
Google Scholar

Download references

Author information

Authors and Affiliations

Kanagawa Institute of Technology, 1030 Shimo-ogino, Atsugi, Kanagawa, Japan
Tsuyoshi Miyazaki
Sugiyama Jogakuen University, 17-3 Hoshigaoka-motomachi, Chikusa, Nagoya, Aichi, Japan
Toyoshiro Nakashima
Aichi Institute of Technology, 1247 Yachigusa, Yakusa, Toyota, Aichi, Japan
Naohiro Ishii

Authors

Tsuyoshi Miyazaki
View author publications
You can also search for this author in PubMed Google Scholar
Toyoshiro Nakashima
View author publications
You can also search for this author in PubMed Google Scholar
Naohiro Ishii
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Software Engineering & Information Technology Institute, Central Michigan University, 48859, Mt. Pleasant, MI, U.S.A.
Roger Lee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Miyazaki, T., Nakashima, T., Ishii, N. (2011). A Proposal of Mouth Shapes Sequence Code for Japanese Pronunciation. In: Lee, R. (eds) Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing 2011. Studies in Computational Intelligence, vol 368. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22288-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-22288-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22287-0
Online ISBN: 978-3-642-22288-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics