顔領域の違いによる読話認識性能比較

池田, 大輔; 桂田, 浩一; 入部, 百合絵; 新田, 恒雄; Daisuke, Ikeda; Kouichi, Katsurada; Yurie, Iribe; Tsuneo, Nitta

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

顔領域の違いによる読話認識性能比較

https://ipsj.ixsq.nii.ac.jp/records/79361

名前 / ファイル	ライセンス	アクション
IPSJ-SLP11089018.pdf (771.6 kB)	Copyright (c) 2011 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2011-12-12

タイトル

顔領域の違いによる読話認識性能比較

タイトル

言語

タイトル

Comparison of Lipreading Recognition Using Different Facial Regions

言語

jpn

キーワード

主題Scheme

Other

主題

ポスターセッション

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

豊橋技術科学大学

著者所属

豊橋技術科学大学

著者所属

豊橋技術科学大学

著者所属

豊橋技術科学大学

著者所属(英)

Toyohashi University of Technology

著者所属(英)

Toyohashi University of Technology

著者所属(英)

Toyohashi University of Technology

著者所属(英)

Toyohashi University of Technology

著者名

池田, 大輔

著者名(英)

Daisuke, Ikeda

論文抄録

内容記述タイプ

Other

内容記述

読話とは口の動きや形状を読み取り発話内容を理解することである．従来の読話の研究の多くは口唇領域に対して行われてきた．しかし，発話する音によっては口の動作が大きく周辺の皺や顎の形状の変化が大きい音や，口の動作が小さい音がある．そこで本論文では (A) 顔全体，(B) 口周辺，(C) 口唇領域の 3 つの領域を用いて単語認識，母音・子音認識を行った．実験の結果，母音の認識は顔全体領域が最も高い性能を示し，一方で子音の/r/や/s/は口唇領域が最も高い値を示すことが分かった．

論文抄録(英)

内容記述タイプ

Other

内容記述

Lipreading is the technique to recognize speaker 's utterances from the motion with changing shape of the mouth. Although most of previous approaches to lipreading focus on the limited region of the mouth, utterances of some phonemes often accompanying with the motion of surrounding areas together with the mouth. We have compared three regions, (A) entire face region, (B) mouth and adjacent region, and (C) mouth region, based on these facts. Experimental results of word recognition and vowel/consonant recognition show that vowel recognition using the entire face region results in the highest performance, while the mouth region outputs the best performance for recognizing consonants ‘s’ and ‘r’.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2011-SLP-89, 号 18, p. 1-6, 発行日 2011-12-12

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 20:15:50.036784

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

顔領域の違いによる読話認識性能比較

× 池田, 大輔

× Daisuke, Ikeda

Versions

Share

Cite as

エクスポート