ウエーブレットパケット分解による残響に頑健な音声認識

ゴメス・ランディ; 河原, 達也; Randy, Gomez; Tatsuya, Kawahara

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

ウエーブレットパケット分解による残響に頑健な音声認識

https://ipsj.ixsq.nii.ac.jp/records/82942

名前 / ファイル	ライセンス	アクション
IPSJ-SLP12092011.pdf (581.4 kB)	Copyright (c) 2012 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2012-07-12

タイトル

ウエーブレットパケット分解による残響に頑健な音声認識

タイトル

言語

タイトル

Wavelet Packet Decomposition Approach to Reverberant Speech Recognition

言語

eng

キーワード

主題Scheme

Other

主題

雑音下音声認識

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

京都大学学術情報メディアセンター

著者所属

京都大学学術情報メディアセンター

著者所属(英)

Academic Center for Computing and Media Studies (ACCMS), Kyoto University.

著者所属(英)

Academic Center for Computing and Media Studies (ACCMS), Kyoto University.

著者名

ゴメス・ランディ
河原, 達也

著者名(英)

Randy, Gomez
Tatsuya, Kawahara

論文抄録

内容記述タイプ

Other

内容記述

頑健な音声認識のための残響抑圧を目的として、複数の分解能からなるウエーブレット分析の手法を述べる。提案するウエーブレットパケット分解では、遅い残響成分と音声の成分を効果的に分離するように、各々の分解能を設定する。これにより、各々に適切なウエーブレット基底を用いることで、観測された残響のある信号から効果的なウイナーゲインを計算することができる。残響抑圧は、ウエーブレットパケットの係数をウイーナゲインでフィルタすることで行われる。大語彙連続音声認識(JNASタスク)の評価実験において、提案手法はウエーブレット分析に基づく従来法や他の残響抑圧手法と比べて、高い性能を示した。

論文抄録(英)

内容記述タイプ

Other

内容記述

This paper describes a multiple-resolution signal analysis to suppress late reflection of reverberation for robust automatic speech recognition (ASR). Wavelet packet tree (WPT) decomposition offers a finer resolution to discriminate the late reflection subspace from the speech subspace. By selecting appropriate wavelet basis in the WPT for speech and late reflection, we can effectively estimate the Wiener gain directly from the observed reverberant data. Moreover, the selection procedure is performed in accordance with the likelihood of acoustic model used by the speech recognizer. Dereverberation is realized by filtering the wavelet packet coefficients with the Wiener gain to suppress the effects of the late reflection. Experimental evaluations with large vocabulary continuous speech recognition (LVCSR) in real reverberant conditions show that the proposed method outperforms conventional wavelet-based methods and other dereverberation techniques.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2012-SLP-92, 号 11, p. 1-6, 発行日 2012-07-12

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 18:46:52.607716

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

ウエーブレットパケット分解による残響に頑健な音声認識

× ゴメス・ランディ

× 河原, 達也

× Randy, Gomez

× Tatsuya, Kawahara

Versions

Share

Cite as

エクスポート