音素に関する事前知識を埋め込んだアイヌ語 End-to-end 音声認識

李, 在詠; 三村, 正人; 河原, 達也

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

音素に関する事前知識を埋め込んだアイヌ語 End-to-end 音声認識

https://ipsj.ixsq.nii.ac.jp/records/230179

名前 / ファイル	ライセンス	アクション
IPSJ-Z85-1W-04.pdf (166.0 kB)	Copyright (c) 2023 by the Information Processing Society of Japan

Item type

National Convention(1)

公開日

2023-02-16

タイトル

音素に関する事前知識を埋め込んだアイヌ語 End-to-end 音声認識

言語

eng

キーワード

主題Scheme

Other

主題

人工知能と認知科学

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_5794

資源タイプ

conference paper

著者所属

京大

著者所属

京大

著者所属

京大

著者名

李, 在詠
三村, 正人
河原, 達也

論文抄録

内容記述タイプ

Other

内容記述

A method using explicit prior knowledge about phonemes is presented for improving automatic speech recognition (ASR) performance in low resource settings. First, a fixed-length encoding is defined for each phoneme, where each element of the encoding represents a distinct phonetic feature. Second, a phonetic feature prediction layer is put in a deep neural network (DNN) and the feature predictions are used to make the final token predictions. Experiments are conducted in multilingual settings where Ainu is the target low-resource language. Effectiveness and robustness of this method is explored with varying amounts of training data.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00349328

書誌情報

第85回全国大会講演論文集

巻 2023, 号 1, p. 787-788, 発行日 2023-02-16

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 11:15:27.663558

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

音素に関する事前知識を埋め込んだアイヌ語 End-to-end 音声認識

× 李, 在詠

× 三村, 正人

× 河原, 達也

Versions

Share

Cite as

エクスポート