Hidden Conditional Neural Fieldsを用いた音声認識の検討

藤井, 康寿; 山本, 一公; 中川, 聖一; Yasuhisa, Fujii; Kazumasa, Yamamoto; Seiichi, Nakagawa

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

Hidden Conditional Neural Fieldsを用いた音声認識の検討

https://ipsj.ixsq.nii.ac.jp/records/70707

名前 / ファイル	ライセンス	アクション
IPSJ-SLP10083001.pdf (173.4 kB)	Copyright (c) 2010 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2010-10-22

タイトル

Hidden Conditional Neural Fieldsを用いた音声認識の検討

タイトル

言語

タイトル

A Study of Automatic Speech Recognition using Hidden Conditionan Neural Fields

言語

jpn

キーワード

主題Scheme

Other

主題

一般講演

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

豊橋技術科学大学知能・情報工学系

著者所属

豊橋技術科学大学知能・情報工学系

著者所属

豊橋技術科学大学知能・情報工学系

著者所属(英)

Department of Computer Science and Engineering, Toyohashi University of Technology

著者所属(英)

Department of Computer Science and Engineering, Toyohashi University of Technology

著者所属(英)

Department of Computer Science and Engineering, Toyohashi University of Technology

著者名

藤井, 康寿

著者名(英)

Yasuhisa, Fujii

論文抄録

内容記述タイプ

Other

内容記述

近年，識別モデルを用いた音声認識手法が注目を集めている．特に，Hidden Conditional Randam Fields(HCRF) を用いた音声認識手法は，HMM の自然な拡張となっており，今後の発展が期待できる．HCRF は有望なモデルであるが，仮説のスコアを特徴量の重み付き線形和によって計算するため，特徴量間の非線形な関係をうまくモデル化できないという問題があった．本稿では，HCRF にゲート関数を導入することで，特徴量間の非線形な関係をモデル化することができるように拡張した Hidden Conditional Neural Fields (HCNF) を用いた音声認識手法を提案する．HCNF は，一切の初期モデルを必要とせずに学習することが可能であり，種々の特徴量を使用することも容易である．TIMIT コーパスにおける core テストセット上での monophone を用いた音素認識実験の結果，HCNF による認識結果は，HCRF および，MPE 学習した HMM による認識結果よりもよく，提案法の有効性を示すことができた．

論文抄録(英)

内容記述タイプ

Other

内容記述

Recently, there has been increasing attention in automatic speech recognition using discriminative models. Especially, Hidden Conditional Random Fields(HCRF) is a natural extension of traditional HMM and therefore very promising. However, because HCRF computes the score of a hypothesis by summing up linearly weighted feature values, it cannot consider non-linearity between feature values that will be very crucial for speech recognition. In this paper, we extend HCRF by incorporating gate function used in neural networks and propose a new model called Hidden Conditional Neural Fields(HCNF). Differently with conventional approaches, HCNF can be trained without any initial model and incorporate any kinds of features. Experimental results of continuous phoneme recognition on TIMIT core test set using monophone showed that HCNF was superior to HCRF and HMM trained in MPE manner.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2010-SLP-83, 号 1, p. 1-6, 発行日 2010-10-22

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 23:23:12.932817

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

Hidden Conditional Neural Fieldsを用いた音声認識の検討

× 藤井, 康寿

× Yasuhisa, Fujii

Versions

Share

Cite as

エクスポート