予稿の話し言葉変換に基づく言語モデルによる講演音声認識

渡邉, 真人; 秋田, 祐哉; 河原, 達也; Makoto, Watanabe; Yuya, Akita; Tatsuya, Kawahara

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

予稿の話し言葉変換に基づく言語モデルによる講演音声認識

https://ipsj.ixsq.nii.ac.jp/records/79344

名前 / ファイル	ライセンス	アクション
IPSJ-SLP11089001.pdf (349.8 kB)	Copyright (c) 2011 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2011-12-12

タイトル

予稿の話し言葉変換に基づく言語モデルによる講演音声認識

タイトル

言語

タイトル

Automatic Transcription of Lecture Speech using Language Model based on Speaking-Style Transformation of Proceeding Texts

言語

jpn

キーワード

主題Scheme

Other

主題

言語モデル・辞書

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

京都大学情報学研究科

著者所属

京都大学情報学研究科

著者所属

京都大学情報学研究科

著者所属(英)

Graduate School of Informatics, Kyoto University

著者所属(英)

Graduate School of Informatics, Kyoto University

著者所属(英)

Graduate School of Informatics, Kyoto University

著者名

渡邉, 真人
秋田, 祐哉
河原, 達也

著者名(英)

Makoto, Watanabe
Yuya, Akita
Tatsuya, Kawahara

論文抄録

内容記述タイプ

Other

内容記述

講演のような話し言葉の音声認識では，言語モデルがドメインに関連する表現とフィラーや口語表現などの話し言葉特有の表現の両方をカバーすることが求められる．本研究では，単語・構文などの情報に基づくルールベースの話し言葉テキスト変換と，N-gram の統計的話し言葉変換を組み合わせて，書き言葉スタイルの予稿テキストから話し言葉スタイルの言語モデルを構築する手法を提案する．学会講演音声を対象とした評価実験において，提案手法の効果の評価を行った．

論文抄録(英)

内容記述タイプ

Other

内容記述

For automatic speech recognition of spontaneous lecture speech, language models need to cover spoken-style expressions such as fillers and colloquial expressions, as well as domain-dependent topic words. We propose an approach to make a spoken-style language model from written-style texts by combining two transformation methods: a rule-based text transformation using lexical and syntactic information, and statistical transformation of N-gram entries. Experiments over academic presentations were conducted to evaluate the proposed approach.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2011-SLP-89, 号 1, p. 1-6, 発行日 2011-12-12

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 20:15:17.655917

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

予稿の話し言葉変換に基づく言語モデルによる講演音声認識

× 渡邉, 真人

× 秋田, 祐哉

× 河原, 達也

× Makoto, Watanabe

× Yuya, Akita

× Tatsuya, Kawahara

Versions

Share

Cite as

エクスポート