音声理解のための音声認識評価尺度とベイズリスク最小化デコーディング

南條浩輝; 河原, 達也; Hiroaki, Nanjo; Tatsuya, Kawahara

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

音声理解のための音声認識評価尺度とベイズリスク最小化デコーディング

https://ipsj.ixsq.nii.ac.jp/records/57078

名前 / ファイル	ライセンス	アクション
IPSJ-SLP04054042.pdf (829.6 kB)	Copyright (c) 2004 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2004-12-22

タイトル

音声理解のための音声認識評価尺度とベイズリスク最小化デコーディング

タイトル

言語

タイトル

ASR Evaluation Measure and Minimum Bayes - Risk Decoding for Open - domain Speech Understanding

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

龍谷大学理工学部情報メディア学科

著者所属

京都大学学術情報メディアセンター

著者所属(英)

Faculty of Science and Technology, Ryukoku University

著者所属(英)

Academic Center for Computing and Media Studies, Kyoto University

著者名

南條浩輝

著者名(英)

Hiroaki, Nanjo

論文抄録

内容記述タイプ

Other

内容記述

ドメインを限定しない自然な話し言葉の音声理解を目的とした音声認識の評価尺度とそれに基づくデコーディング手法を提案する．従来，音声認識の一般的な評価尺度として，全ての単語を一様に扱う「単語誤り率(word error rate: WER)」が用いられてきた．これに対して，情報検索の観点から各単語の重要度を考慮した「重みつきキーワード」誤り率(weighted keyword error rate: WKER)」を提案する．講演音声からの重要文抽出のタスクにおいて，重みつきキーワード誤り率が重要文抽出の制度と相関が高いことを示す．その上で，ベイズリスク最小化(Minimum Bayes-Risk: MBR)」の枠組みに基づいて，重みつきキーワード誤り率の最小化を行う音声認識を実現する．CSJの学会講演17講演を用いて評価を行い，提案する認識手法が重みつきキーワード誤り率及び重要文抽出精度の改善に効果があることを示す．

論文抄録(英)

内容記述タイプ

Other

内容記述

A new evaluation measure of speech recognition and a decoding strategy for keyword-based open-domain speech understanding are presented. Conventionally, WER (word error rate) has been widely used as an evaluation measure of speech recognition, which treats all words in a uniform manner. In this paper, we define a weighted keyword error rate (WKER) which gives a weight on errors from a viewpoint of information retrieval. We first demonstrate that this measure is more appropriate for predicting the performance of key sentence indexing of oral presentations. Then, we formulate a decoding method to minimize WKER based on Minimum Bayes-Risk (MBR) framework, and show that the decoding method works reasonably for improving WKER and key sentence indexing.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

情報処理学会研究報告音声言語情報処理（SLP）

巻 2004, 号 131(2004-SLP-054), p. 247-252, 発行日 2004-12-22

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 04:40:59.282007

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

音声理解のための音声認識評価尺度とベイズリスク最小化デコーディング

× 南條浩輝

× Hiroaki, Nanjo

Versions

Share

Cite as

エクスポート