PodDiarizer:ポッドキャスト音声認識・理解のためのユーザ訂正活用型音響ダイアライゼーションシステム

佐々木, 洋子; 緒方, 淳; 後藤, 真孝; Yoko, Sasaki; Jun, Ogata; Masataka, Goto

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

PodDiarizer:ポッドキャスト音声認識・理解のためのユーザ訂正活用型音響ダイアライゼーションシステム

https://ipsj.ixsq.nii.ac.jp/records/75439

名前 / ファイル	ライセンス	アクション
IPSJ-SLP11087016.pdf (290.4 kB)	Copyright (c) 2011 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2011-07-14

タイトル

PodDiarizer:ポッドキャスト音声認識・理解のためのユーザ訂正活用型音響ダイアライゼーションシステム

タイトル

言語

タイトル

PodDiarizer: An Audio Diarization System Based on User Corrections for Speech Recognition and Understanding of Podcasts

言語

jpn

キーワード

主題Scheme

Other

主題

アプリケーション

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

産業技術総合研究所

著者所属

産業技術総合研究所

著者所属

産業技術総合研究所

著者所属(英)

National Institute of Advanced Industrial Science and Technology (AIST)

著者所属(英)

National Institute of Advanced Industrial Science and Technology (AIST)

著者所属(英)

National Institute of Advanced Industrial Science and Technology (AIST)

著者名

佐々木, 洋子

著者名(英)

Yoko, Sasaki

論文抄録

内容記述タイプ

Other

内容記述

本稿では，ポッドキャスト等の Web 上の音コンテンツ中の典型的な音響イベント（背景音楽，Jingle，効果音，話者別発話，笑い声等）の区間やそれらの混合区間を自動的に検出する音響ダイアライゼーションシステム「PodDiarizer」を提案する．こうした音コンテンツに対する音響ダイアライゼーションは，実用的な音声認識・理解のために不可欠である．PodDiarizer では，音響信号の再生に同期して，音響イベントの検出結果をスクロール表示し，その誤りをユーザが容易に訂正できるユーザインタフェースを提供する．その誤り訂正結果を用いて音響イベントのモデルを改善することで，音響ダイアライゼーションシステムの性能を向上させることができる．ポッドキャストを対象とした実験により，音響ダイアライゼーションの性能を評価し，音声認識システムに適用した際の性能向上を確認した．

論文抄録(英)

内容記述タイプ

Other

内容記述

The paper proposes an audio diarization system, PodDiarizer, that can automatically detect typical audio events such as background music, jingle, sound effect, spoken voice of each speaker, laugh, and a combination of these, in podcast audio files on the web. Audio diarization of such audio content is indispensable for practical speech recognition and understanding. PodDiarizer provides a user interface in which users can easily correct diarization errors by editing on the scrolling visualization in synchronization with the audio playback. The results of the error correction can then be used to improve the performance of our diarization system by updating models for audio events. We evaluated the performance of audio diarization for podcasts and confirmed that diarization results improved speech recognition performances.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2011-SLP-87, 号 16, p. 1-6, 発行日 2011-07-14

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 21:14:46.690304

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

PodDiarizer:ポッドキャスト音声認識・理解のためのユーザ訂正活用型音響ダイアライゼーションシステム

× 佐々木, 洋子

× Yoko, Sasaki

Versions

Share

Cite as

エクスポート