音声の時間変化モデルに基づく音声信号推定法を用いた　非定常雑音下での音声認識

藤本, 雅清; 有木, 康雉; Masakiyo, Fujimoto; Yasuo, Ariki

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

音声の時間変化モデルに基づく音声信号推定法を用いた　非定常雑音下での音声認識

https://ipsj.ixsq.nii.ac.jp/records/57453

名前 / ファイル	ライセンス	アクション
IPSJ-SLP00034004.pdf (565.6 kB)	Copyright (c) 2000 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2000-12-21

タイトル

音声の時間変化モデルに基づく音声信号推定法を用いた　非定常雑音下での音声認識

タイトル

言語

タイトル

Speech Recognition under Non - stationary Noisy Environments Using Signal Estimation Method Based on Speech State Transition Model

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

龍谷大学理工学部

著者所属

龍谷大学理工学部

著者所属(英)

Faculty of Science and Technology, Ryukoku University

著者所属(英)

Faculty of Science and Technology, Ryukoku University

著者名

藤本, 雅清

著者名(英)

Masakiyo, Fujimoto

論文抄録

内容記述タイプ

Other

内容記述

本研究では，音声の時間変化モデルに基づいた非定常雑音に対する雑音除去法を提案する．提案手法では，音声の時間変化モデルをカルマンフィルタによる推定問題に適用することにより，音楽等のような非定常雑音が重畳した音声から，クリーンな音声信号を推定している．音声の時間変化モデルは，雑音重畳音声におけるクリーン音声の時間変動を，Taylor展開を用いることにより表現したモデルである．モデルの構成に必要なパラメータの1つである雑音の変動成分は，線形予測法により推定を行っている．提案手法の評価のために，3種類の音楽が重畳した音声を用いて大語棄連続音声認識を行ない，単語正解精度において，従来法であるParallel Model Combination(PMC)法と比較を行った．その結果，提案手法により，PMC法よりも高い単語正解精度が得られた．

論文抄録(英)

内容記述タイプ

Other

内容記述

In this paper, we propose a non-stationary noise reduction method based on speech state transition model. Our proposed method estimates the speech signal under non-stationary noisy environments such as musical background by applying speech state transition model to Kalman filtering estimation. The speech state transition model represents the state transition of speech component in non-stationary noisy speech and is modeled by using Taylor expansion. In this model, the state transition of noise component is estimated by using linear predictive estimation. In order to evaluate the proposed method, we carried out large vocabulary continuous speech recognition experiments under 3 types of musics and compared the results with conventionally used Parallel Model Combination(PMC) method in word accuracy rate. As a result, the proposed method obtained word accuracy rate superior to PMC.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

情報処理学会研究報告音声言語情報処理（SLP）

巻 2000, 号 119(2000-SLP-034), p. 19-24, 発行日 2000-12-21

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 04:28:46.078892

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

音声の時間変化モデルに基づく音声信号推定法を用いた　非定常雑音下での音声認識

× 藤本, 雅清

× Masakiyo, Fujimoto

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

音声の時間変化モデルに基づく音声信号推定法を用いた 非定常雑音下での音声認識

× 藤本, 雅清

× Masakiyo, Fujimoto

Versions

Share

Cite as

エクスポート

音声の時間変化モデルに基づく音声信号推定法を用いた　非定常雑音下での音声認識