ベイズ推論を用いた連続音声からの言語モデル学習

Graham, NEUBIG; 三村, 正人; 森, 信介; 河原, 達也; Graham, Neubig; Masato, Mimura; Shinsuke, Mori; Tatsuya, Kawahara

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

ベイズ推論を用いた連続音声からの言語モデル学習

https://ipsj.ixsq.nii.ac.jp/records/69914

名前 / ファイル	ライセンス	アクション
IPSJ-SLP10082016.pdf (514.8 kB)	Copyright (c) 2010 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2010-07-15

タイトル

ベイズ推論を用いた連続音声からの言語モデル学習

タイトル

言語

タイトル

Learning a Language Model from Continuous Speech using Bayesian Inference

言語

jpn

キーワード

主題Scheme

Other

主題

言語モデル

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

京都大学情報学研究科

著者所属

京都大学情報学研究科

著者所属

京都大学情報学研究科

著者所属

京都大学情報学研究科

著者所属(英)

Kyoto University, Graduate School of Informatics

著者所属(英)

Kyoto University, Graduate School of Informatics

著者所属(英)

Kyoto University, Graduate School of Informatics

著者所属(英)

Kyoto University, Graduate School of Informatics

著者名

Graham, NEUBIG
三村, 正人
森, 信介
河原, 達也

著者名(英)

Graham, Neubig
Masato, Mimura
Shinsuke, Mori
Tatsuya, Kawahara

論文抄録

内容記述タイプ

Other

内容記述

本稿ではテキストを用いず，連続音声のみから言語モデルを学習する方法を提案する．音響モデルのみを用いて作成された音素ラティスに対して推論を行い，単語境界と言語モデルを同時に学習する．具体的には，ノンパラメトリックベイズ法に基づく階層的 Pitman-Yor 言語モデルを利用し，パラメータは WFST に基づいたギブスサンプリングで推定する．会議音声を用いた実験において，提案手法によって学習された言語モデルはパープレキシティ及び音声認識の音素誤り率を有意に改善することができた．さらに，ラティス処理と単語単位の獲得が誤り率の改善に貢献していることがわかった．

論文抄録(英)

内容記述タイプ

Other

内容記述

This paper proposes a technique for learning a language model directly from continuous speech, without the use of text. Inference is performed over phoneme lattices generated using only acoustic model scores, and word boundaries and a language model are learned simultaneously. A Bayesian non-parametric Hierarchical Pitman-Yor language model is used, and parameters are estimated with WFST-based Gibbs sampling. An experiment was performed using meeting speech, and language models built using the proposed techniques were able to significantly lower ASR phoneme error rates. In addition, lattice processing and word boundary discovery were shown to contribute significantly to this improvement.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2010-SLP-82, 号 16, p. 1-6, 発行日 2010-07-15

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 23:43:09.790012

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

ベイズ推論を用いた連続音声からの言語モデル学習

× Graham, NEUBIG

× 三村, 正人

× 森, 信介

× 河原, 達也

× Graham, Neubig

× Masato, Mimura

× Shinsuke, Mori

× Tatsuya, Kawahara

Versions

Share

Cite as

エクスポート