［招待講演］音声認識の方法論の変遷と展望-Acoustic-to-Wordモデルを中心に－

河原, 達也; Tatsuya, Kawahara

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

［招待講演］音声認識の方法論の変遷と展望-Acoustic-to-Wordモデルを中心に－

https://ipsj.ixsq.nii.ac.jp/records/192704

名前 / ファイル	ライセンス	アクション
IPSJ-SLP18125007.pdf (795.4 kB)	Copyright (c) 2018 by the Institute of Electronics, Information and Communication Engineers This SIG report is only available to those in membership of the SIG.
SLP:会員：¥0, DLIB:会員：¥0

Item type

SIG Technical Reports(1)

公開日

2018-12-03

タイトル

［招待講演］音声認識の方法論の変遷と展望-Acoustic-to-Wordモデルを中心に－

タイトル

言語

タイトル

[Invited Talk] Review of Automatic Speech Recognition Methodology—Outlook of Acoustic-to-Word Model—

言語

jpn

キーワード

主題Scheme

Other

主題

オーガナイズドセッション招待講演

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

京都大学情報学研究科

著者所属(英)

School of Informatics, Kyoto University

著者名

河原, 達也

著者名(英)

Tatsuya, Kawahara

論文抄録

内容記述タイプ

Other

内容記述

音声認識の方法論は深層学習，特に End-to-End モデルの導入で大きく変わりつつある．本稿では，従来の方法論を概観し，End-to-End モデルに至るまでの変遷を述べる．単語単位の End-to-End モデルである Acoustic-to-Word モデルは，音響特徴量系列から単語列を直接求めるもので，音響モデルと言語モデルを内包し，発音辞書や複雑な認識プログラムを必要としない革新的な方式である．この方式の課題と解決法についても述べる．

論文抄録(英)

内容記述タイプ

Other

内容記述

The methodology of speech recognition has been changing due to the introduction of deep learning, in particular end-to-end modeling. This article gives a brief overview of the conventional methodologies leading to the end-to-end models. Word-based end-to-end model, referred to as acoustic-to-word model, directly converts a sequence of acoustic features into a word sequence. This model contains acoustic and language models, and does not require a pronunciation lexicon and a complex decoding program. The problems of this new promising model and current solutions are also described.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2018-SLP-125, 号 7, p. 1-6, 発行日 2018-12-03

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8663

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-20 00:02:18.598093

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

［招待講演］音声認識の方法論の変遷と展望-Acoustic-to-Wordモデルを中心に－

× 河原, 達也

× Tatsuya, Kawahara

Versions

Share

Cite as

エクスポート