ウエーブレットの最適化と雑音プロファイルを用いた雑音抑圧による頑健な音声認識

ゴメス・ランディ; 河原, 達也; Randy, Gomez; Tatsuya, Kawahara

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

ウエーブレットの最適化と雑音プロファイルを用いた雑音抑圧による頑健な音声認識

https://ipsj.ixsq.nii.ac.jp/records/72658

名前 / ファイル	ライセンス	アクション
IPSJ-SLP11085012.pdf (198.5 kB)	Copyright (c) 2011 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2011-01-28

タイトル

ウエーブレットの最適化と雑音プロファイルを用いた雑音抑圧による頑健な音声認識

タイトル

言語

タイトル

Robust Speech Recognition Using Optimized Wavelet Denoising with Noise Profiles

言語

eng

キーワード

主題Scheme

Other

主題

音声認識

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

京都大学学術情報メディアセンター

著者所属

京都大学学術情報メディアセンター

著者所属(英)

Academic Center for Computing and Media Studies (ACCMS), Kyoto University.

著者所属(英)

Academic Center for Computing and Media Studies (ACCMS), Kyoto University.

著者名

ゴメス・ランディ
河原, 達也

著者名(英)

Randy, Gomez
Tatsuya, Kawahara

論文抄録

内容記述タイプ

Other

内容記述

本研究では、音声認識のためのウエーブレットに基づく雑音抑圧を雑音プロファイルと組み合わせることで改善を図る。学習時には、音声と種々の雑音プロファイル毎にウエーブレット変換のパラメータを最適化し、ウイナーフィルタのゲイン係数の推定の高精度化を図る。認識時には、雑音プロファイルを特定し、入力のウエーブレット係数を当該のウイナーゲインでフィルタリングする。さらに、ウイナーゲインにスケーリング係数を導入し、雑音抑圧に伴う歪みによるミスマッチを補償する。評価実験において、従来のウエーブレットに基づく手法と比較を行った。また、様々な雑音条件において頑健性の評価も行った。

論文抄録(英)

内容記述タイプ

Other

内容記述

In this paper, we improved the wavelet-based denoising method for automatic speech recognition (ASR) by using noise profiles. During training, we optimize the wavelet parameters for speech and different noise profiles to achieve a better estimate of the Wiener gain for effective filtering. Denoising is implemented by identifying the noise profile and filtering the noisy wavelet coefficients using a Wiener gain. In addition to wavelet filtering, we also introduce scale factors to the Wiener gain during decoding, to compensate for the mismatch caused by distortion during the denoising process. In our experimental evaluations, we compare our method with existing wavelet-based approach. We also conducted an experiment to test for robustness to different noise conditions.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2011-SLP-85, 号 12, p. 1-6, 発行日 2011-01-28

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 22:40:48.048940

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

ウエーブレットの最適化と雑音プロファイルを用いた雑音抑圧による頑健な音声認識

× ゴメス・ランディ

× 河原, 達也

× Randy, Gomez

× Tatsuya, Kawahara

Versions

Share

Cite as

エクスポート