特定の騒音環境下における音声認識のためのノイズ除去の検討と評価実験

佐野, 将太; 村上, 史尚; 川喜田, 佑介; 宮崎, 剛; 田中, 博; Shota, Sano; Fumitaka, Murakami; Yuusuke, Kawakita; Tsuyoshi, Miyazaki; Hiroshi, Tanaka

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

特定の騒音環境下における音声認識のためのノイズ除去の検討と評価実験

https://ipsj.ixsq.nii.ac.jp/records/211282

名前 / ファイル	ライセンス	アクション
IPSJ-MBL21099002.pdf (3.0 MB)	Copyright (c) 2021 by the Institute of Electronics, Information and Communication Engineers This SIG report is only available to those in membership of the SIG.
MBL:会員：¥0, DLIB:会員：¥0

Item type

SIG Technical Reports(1)

公開日

2021-05-20

タイトル

特定の騒音環境下における音声認識のためのノイズ除去の検討と評価実験

タイトル

言語

タイトル

Investigation and Evaluation Experiment of Noise Removal for Voice Recognition in Specific Noisy Environment

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

神奈川工科大学大学院工学研究科情報工学専攻

著者所属(英)

Information & Computer Sciences, Kanagawa Institute of Technology

著者名

佐野, 将太
村上, 史尚
川喜田, 佑介
宮崎, 剛
田中, 博

著者名(英)

Shota, Sano
Fumitaka, Murakami
Yuusuke, Kawakita
Tsuyoshi, Miyazaki
Hiroshi, Tanaka

論文抄録

内容記述タイプ

Other

内容記述

本稿では，人込みや電車内などの騒音環境下における音声認識精度向上のため，特定の状況に限定してノイズ除去を行った際のノイズ除去性能と音声認識精度の検討結果について述べる．実験ではノイズ除去手法として SS 法と DAE を適用した．人込み，電車内を想定したノイズ 2 種類と，SN 比 -10, -5, 0,5, 10, 15dB の 6 種類でノイズを重畳した音声を作成し，DAE では複数のノイズを混合させて学習モデルを作成した場合と，それらを混合せずにノイズに応じた個別のモデルを用いた場合でノイズ除去を行った．ノイズ除去後に出力された音声に対し，ノイズ重畳前の音声データとのコサイン類似度と，スペクトログラム画像に対する正規化相互相関値，音声認識精度の 3 つからノイズ除去性能の評価を行った．その結果，どの評価方法でも複数のノイズを混合させて作成したモデルより，個別の学習モデルが最も良い結果となることを確認した．また，SN 比 10dB では個別の条件で作成したモデルのみ 80% 程の精度での音声認識が可能であることが確認できた．

論文抄録(英)

内容記述タイプ

Other

内容記述

In this manuscript, the noise removal performance and speech recognition accuracy is described when noise is removed by assuming the specific situation in order to improve speech recognition accuracy in a noisy environment such as a crowded spot or in a train. Noise removal was performed by using the SS and DAE method in the experiment. We created speech data with noise superimposed with 2 types of noise assuming crowded spot and inside a train, and 6 types of SN ratio of -10, -5, 0, 5, 10, 15 dB. In the DAE method, the noise was removed and compared by using the model created by mixing multiple noises, and learning models individually created by adding each noise with SN condition. The noise removal performance was evaluated by the cosine similarity to the time-series data, the similarity of the spectrogram image by the normalized correlation, and the speech recognition accuracy between speech data before noise superimposition and the noise removal. It was verified that the individual learning model gave better results than the results by the model created by mixing noise. Also it was confirmed that speech recognition was possible with an accuracy of about 80% only for the model individually created under the conditions of SN ratio of 10dB.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AA11851388

書誌情報

研究報告モバイルコンピューティングと新社会システム（MBL）

巻 2021-MBL-99, 号 2, p. 1-6, 発行日 2021-05-20

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8817

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 17:51:12.810047

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

特定の騒音環境下における音声認識のためのノイズ除去の検討と評価実験

× 佐野, 将太

× 村上, 史尚

× 川喜田, 佑介

× 宮崎, 剛

× 田中, 博

× Shota, Sano

× Fumitaka, Murakami

× Yuusuke, Kawakita

× Tsuyoshi, Miyazaki

× Hiroshi, Tanaka

Versions

Share

Cite as

エクスポート