離散混合出力分布型HMMを用いた非定常雑音下の音声認識

小坂, 哲夫; 加藤正治; 好田, 正紀; Tetsuo, Kosaka; Masaharu, Katoh; Masaki, Kohda

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

離散混合出力分布型HMMを用いた非定常雑音下の音声認識

https://ipsj.ixsq.nii.ac.jp/records/57170

名前 / ファイル	ライセンス	アクション
IPSJ-SLP03049020.pdf (557.5 kB)	Copyright (c) 2003 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2003-12-18

タイトル

離散混合出力分布型HMMを用いた非定常雑音下の音声認識

タイトル

言語

タイトル

Speech Recognition Under Non - Stationary Noise Conditions using Discrete - Mixture HMMs

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

山形大学工学部

著者所属

山形大学工学部

著者所属

山形大学工学部

著者所属(英)

Faculty of Engineering, Yamagata University

著者所属(英)

Faculty of Engineering, Yamagata University

著者所属(英)

Faculty of Engineering, Yamagata University

著者名

小坂, 哲夫加藤正治好田, 正紀

著者名(英)

Tetsuo, Kosaka Masaharu, Katoh Masaki, Kohda

論文抄録

内容記述タイプ

Other

内容記述

従来，定常的な雑音への対処としてスペクトルサブトラクションやPMCなどの種々の手法が提案され一定の成果を収めている．一方非定常雑音はスペクトルの特徴のバリエーションが大きく，大量のデータを集めてモデル化しにくいという問題がある．そのため，未知雑音，非定常雑音への対処は重要な課題の一つとなっている．本研究では，離散混合出力型HMM(DMHMM)を用い，未知の非定常雑音が重畳した音声に対するロバストな音声認識手法を提案する．我々はこれまでMAP推定によるDMHMMを用いた雑音下音声認識の検討を行い，その有効性を示してきた．本研究では，missing feature theory(MFT)に基づくDMHMMの離散分布の補正法を提案する．未知の非定常雑音が重畳すると，音響モデルに学習されない特徴が現れるが，閾値を設けることによりDMHMMの離散分布の補正を行うことにより対処する．提案手法の有効性を示すため，従来法である混合連続分布型HMM(CHMM)およびCHMMへMFTを適用した手法の一つであるacoustic backing-off法と比較実験を行った．その結果3手法の中で最良の結果が得られ，またベースラインと比較して26.0％の認識率の向上を得た．

論文抄録(英)

内容記述タイプ

Other

内容記述

Several promising methods of robust speech recognition in stationary noise environments, e.g., spectral subtraction and PMC, have been proposed. Meanwhile, it is difficult to collect a large amount of non-stationary noise for model training because there are many types of non-stationary noise. As stated above, speech recognition under unknown and non stationary noise environments is one of the most difficult issues. This paper introduces a new method of robust speech recognition under unknown and non-stationary noise environments based on discrete-mixture HMMs (DMHMMs). Recently, we applied the DMHMMs based on MAP estimation to speech recognition in noisy conditions, and found that they were effective for the conditions. In this paper, we propose a compensation method for discrete distributions of DMHMMs based on missing feature theory (MFT). It is more likely that acoustic outlier values appear in unknown and non-stationary noise environments. In order to avoid the influence of the values, discrete distributions are compensated by setting thresholds for discrete probability. The proposed method was compared with both conventional continuous-mixture HMMs and acoustic backing-off method. It showed the best performance, and obtained an average relative improvement of 26.0% over the baseline.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

情報処理学会研究報告音声言語情報処理（SLP）

巻 2003, 号 124(2003-SLP-049), p. 115-120, 発行日 2003-12-18

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 04:36:59.311753

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

離散混合出力分布型HMMを用いた非定常雑音下の音声認識

× 小坂, 哲夫加藤正治好田, 正紀

× Tetsuo, Kosaka Masaharu, Katoh Masaki, Kohda

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

離散混合出力分布型HMMを用いた非定常雑音下の音声認識

× 小坂, 哲夫 加藤正治 好田, 正紀

× Tetsuo, Kosaka Masaharu, Katoh Masaki, Kohda

Versions

Share

Cite as

エクスポート

× 小坂, 哲夫加藤正治好田, 正紀