音声認識との統合によるシステム要求検出

佐古, 淳; 山形, 知行; 滝口, 哲也; 有木, 康雄; Atsushi, SAKO; Tomoyuki, YAMAGATA; Tetsuya, TAKIGUCHI; Yasuo, ARIKI

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

音声認識との統合によるシステム要求検出

https://ipsj.ixsq.nii.ac.jp/records/56778

名前 / ファイル	ライセンス	アクション
IPSJ-SLP07069025.pdf (463.8 kB)	Copyright (c) 2007 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2007-12-20

タイトル

音声認識との統合によるシステム要求検出

タイトル

言語

タイトル

System Request Discrimination Based on AdaBoost

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

神戸大学大学院自然科学研究科

著者所属

神戸大学大学院工学研究科

著者所属

神戸大学大学院工学研究科

著者所属

神戸大学大学院工学研究科

著者所属(英)

Guraduate School of Science and Technology, Kobe University

著者所属(英)

Guraduate School of Engineering, Kobe University

著者所属(英)

Guraduate School of Engineering, Kobe University

著者所属(英)

Guraduate School of Engineering, Kobe University

著者名

佐古, 淳山形, 知行滝口, 哲也有木, 康雄

著者名(英)

Atsushi, SAKO Tomoyuki, YAMAGATA Tetsuya, TAKIGUCHI Yasuo, ARIKI

論文抄録

内容記述タイプ

Other

内容記述

音声をインターフェイスとして用いる際，システムに対してなされた発話か，周りの人間に対してのものかを判別する必要がある．この問題に対し，柔軟な発話を受理可能なものとして，音声認識結果をブースティングによってシステム要求か雑談かを判別する手法の提案を行ってきた．しかし，音声認識結果には認識誤りを含む場合があることから，認識誤りを原因として，システム要求と雑談の判別を誤る場合があった．本稿では，システム要求検出を音声認識の定式化に組み込むことにより，認識仮説まで用いたより高精度な要求検出について述べる．システム要求検出には従来と同様ブースティングを用いる．ただし，ブースティングの出力スコアは確率ではないため，sigmoid 関数を用いて疑似確率化することで，音声認識との統合を行った．実験により，従来の認識結果から識別する手法よりも再現率が改善し，適合率 0.98，再現率 0.94，F 値 0.96 を実現した．

論文抄録(英)

内容記述タイプ

Other

内容記述

It is necessary to discriminate system requests from human-human conversation speeches for speech user interfaces. We had proposed the boosting method that discriminates system requests from chats based on 1-best result of speech recognition system. This method can retrieve various expressions due to boosting algorithm. However it causes discrimination error when speech recognition results includes keyword mis-recognition. In this paper, we propose the system request detection method that can consider not only 1-best result but also speech recognition hypotheses. The proposed method is formulated incorporating system request detection into speech recognition. Boosting method is employed as system request discrimination model, however its output score is not probability. Thus boosting score is converted into pseudo probability based on sigmoid function in order to integrate system request discrimination and speech recognition. The experimental results showed that 0.98 of precision, 0.94 of recall and 0.96 of F-measure.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

情報処理学会研究報告音声言語情報処理（SLP）

巻 2007, 号 129(2007-SLP-069), p. 143-148, 発行日 2007-12-20

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 04:48:55.175845

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

音声認識との統合によるシステム要求検出

× 佐古, 淳山形, 知行滝口, 哲也有木, 康雄

× Atsushi, SAKO Tomoyuki, YAMAGATA Tetsuya, TAKIGUCHI Yasuo, ARIKI

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

音声認識との統合によるシステム要求検出

× 佐古, 淳 山形, 知行 滝口, 哲也 有木, 康雄

× Atsushi, SAKO Tomoyuki, YAMAGATA Tetsuya, TAKIGUCHI Yasuo, ARIKI

Versions

Share

Cite as

エクスポート

× 佐古, 淳山形, 知行滝口, 哲也有木, 康雄