音声認識のための複数の認識器を利用した能動学習

濱中, 悠三; 江森, 正; 越仲, 孝文; 篠田, 浩一; 古井, 貞熙; Yuzo, Hamanaka; Tadashi, Emori; Takafumi, Koshinaka; Koichi, Shinoda; Sadaoki, Furui

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

音声認識のための複数の認識器を利用した能動学習

https://ipsj.ixsq.nii.ac.jp/records/67046

名前 / ファイル	ライセンス	アクション
IPSJ-SLP09079004.pdf (127.8 kB)	Copyright (c) 2009 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2009-12-14

タイトル

音声認識のための複数の認識器を利用した能動学習

タイトル

言語

タイトル

Active learning using multiple recognizers for speech recognition

言語

jpn

キーワード

主題Scheme

Other

主題

【Session-1 音響モデル】

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

東京工業大学

著者所属

株式会社NEC情報システムズ

著者所属

東京工業大学／日本電気株式会社

著者所属

東京工業大学

著者所属

東京工業大学

著者所属(英)

Tokyo Institute of Technology

著者所属(英)

NEC Informatec Systems, Ltd.

著者所属(英)

Tokyo Institute of Technology / NEC Corporation

著者所属(英)

Tokyo Institute of Technology

著者所属(英)

Tokyo Institute of Technology

著者名

濱中, 悠三江森, 正越仲, 孝文篠田, 浩一古井, 貞熙

著者名(英)

Yuzo, Hamanaka Tadashi, Emori Takafumi, Koshinaka Koichi, Shinoda Sadaoki, Furui

論文抄録

内容記述タイプ

Other

内容記述

大語彙連続音声認識器の学習データに対する書き起こしコスト削減のための複数の認識器を利用した能動学習手法を提案する．この手法では複数の認識器から得られた複数の異なる認識結果文を用いて発話の選択を行う．認識結果文をアラインメントするためのプログレッシブ法と Voting Entropy を発話選択に用いている．提案手法を日本語話し言葉コーパスの 190 時間の音声データを使い評価し，能動学習を行わないランダムな発話選択より顕著に良い結果を得た．74% の単語正解精度を得るために必要な書き起こし付きデータ量はランダム選択では 97 時間，単語事後確率を用いた従来手法では 72 時間であるが，提案手法では 60 時間で済むという結果になった．

論文抄録(英)

内容記述タイプ

Other

内容記述

We propose an active learning method with multiple recognizers for large vocabulary continuous speech recognition. In this approach, the recognition results obtained from recognizers are used for selecting utterances. Here, a progressive search method is used for aligning sentences, and voting entropy is used as a measure for selecting utterances. Our method was evaluated by using 190-hour speech data in the Corpus of Spontaneous Japanese. It proved to be significantly better than random selection. It only required 60 h of data to achieve a word accuracy of 74%, while standard training (i.e., random selection) required 97 h of data. The recognition accuracy of our proposed method was also better than that of the conventional uncertainty sampling method using word posterior probabilities as the confidence measures for selecting sentences.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

音声言語情報処理（SLP）

巻 2009-SLP-79, 号 4, p. 1-5, 発行日 2009-12-14

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 00:44:33.262469

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

音声認識のための複数の認識器を利用した能動学習

× 濱中, 悠三江森, 正越仲, 孝文篠田, 浩一古井, 貞熙

× Yuzo, Hamanaka Tadashi, Emori Takafumi, Koshinaka Koichi, Shinoda Sadaoki, Furui

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

音声認識のための複数の認識器を利用した能動学習

× 濱中, 悠三 江森, 正 越仲, 孝文 篠田, 浩一 古井, 貞熙

× Yuzo, Hamanaka Tadashi, Emori Takafumi, Koshinaka Koichi, Shinoda Sadaoki, Furui

Versions

Share

Cite as

エクスポート

× 濱中, 悠三江森, 正越仲, 孝文篠田, 浩一古井, 貞熙