ログイン 新規登録
言語:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. 研究報告
  2. 音声言語情報処理(SLP)
  3. 2018
  4. 2018-SLP-125

Using Functional Load for Optimizing DPGMM based Zero Resource Sub-word Unit Discovery

https://ipsj.ixsq.nii.ac.jp/records/192701
https://ipsj.ixsq.nii.ac.jp/records/192701
46b7f069-c257-4d71-8871-e1d7073283e3
名前 / ファイル ライセンス アクション
IPSJ-SLP18125004.pdf IPSJ-SLP18125004.pdf (842.7 kB)
Copyright (c) 2018 by the Information Processing Society of Japan
オープンアクセス
Item type SIG Technical Reports(1)
公開日 2018-12-03
タイトル
タイトル Using Functional Load for Optimizing DPGMM based Zero Resource Sub-word Unit Discovery
タイトル
言語 en
タイトル Using Functional Load for Optimizing DPGMM based Zero Resource Sub-word Unit Discovery
言語
言語 eng
キーワード
主題Scheme Other
主題 セッション2 単語獲得・感情認識
資源タイプ
資源タイプ識別子 http://purl.org/coar/resource_type/c_18gh
資源タイプ technical report
著者所属
Nara Institute of Science and Technology
著者所属
Nara Institute of Science and Technology/RIKEN, Center for Advanced Intelligence Project AIP
著者所属
Beijing Language and Culture University
著者所属
Nara Institute of Science and Technology/RIKEN, Center for Advanced Intelligence Project AIP
著者所属(英)
en
Nara Institute of Science and Technology
著者所属(英)
en
Nara Institute of Science and Technology / RIKEN, Center for Advanced Intelligence Project AIP
著者所属(英)
en
Beijing Language and Culture University
著者所属(英)
en
Nara Institute of Science and Technology / RIKEN, Center for Advanced Intelligence Project AIP
著者名 Bin, Wu

× Bin, Wu

Bin, Wu

Search repository
Sakriani, Sakti

× Sakriani, Sakti

Sakriani, Sakti

Search repository
Jinsong, Zhang

× Jinsong, Zhang

Jinsong, Zhang

Search repository
Satoshi, Nakamura

× Satoshi, Nakamura

Satoshi, Nakamura

Search repository
著者名(英) Bin, Wu

× Bin, Wu

en Bin, Wu

Search repository
Sakriani, Sakti

× Sakriani, Sakti

en Sakriani, Sakti

Search repository
Jinsong, Zhang

× Jinsong, Zhang

en Jinsong, Zhang

Search repository
Satoshi, Nakamura

× Satoshi, Nakamura

en Satoshi, Nakamura

Search repository
論文抄録
内容記述タイプ Other
内容記述 Unsupervised sub-word discovery of the zero resource language gains attention recently. One of the methods to tackle the problem is using an unsupervised clustering algorithm to recover the discrete phone-like units from the speech, such as the Dirichlet Process Gaussian Mixture Model (DPGMM), which currently achieves top results in the Zero Resource Speech Challenge. However, the DPGMM model is too sensitive to the acoustic variation and often produces too many types of sub-word units. This paper proposes to apply functional load to reduce the size of sub-word units from DPGMM. The functional load is the measurement of how much information in communication is conveyed by contrasts of these units. Then, the aim is to ignore the contrasts of the sub-word units that contribute little in conveying the information of the speech leading to decrease of the number of sub-word classes. We experiment on the official Zerospeech 2015 measuring with ABX error rate.
論文抄録(英)
内容記述タイプ Other
内容記述 Unsupervised sub-word discovery of the zero resource language gains attention recently. One of the methods to tackle the problem is using an unsupervised clustering algorithm to recover the discrete phone-like units from the speech, such as the Dirichlet Process Gaussian Mixture Model (DPGMM), which currently achieves top results in the Zero Resource Speech Challenge. However, the DPGMM model is too sensitive to the acoustic variation and often produces too many types of sub-word units. This paper proposes to apply functional load to reduce the size of sub-word units from DPGMM. The functional load is the measurement of how much information in communication is conveyed by contrasts of these units. Then, the aim is to ignore the contrasts of the sub-word units that contribute little in conveying the information of the speech leading to decrease of the number of sub-word classes. We experiment on the official Zerospeech 2015 measuring with ABX error rate.
書誌レコードID
収録物識別子タイプ NCID
収録物識別子 AN10442647
書誌情報 研究報告音声言語情報処理(SLP)

巻 2018-SLP-125, 号 4, p. 1-2, 発行日 2018-12-03
ISSN
収録物識別子タイプ ISSN
収録物識別子 2188-8663
Notice
SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.
出版者
言語 ja
出版者 情報処理学会
戻る
0
views
See details
Views

Versions

Ver.1 2025-01-20 00:02:22.807224
Show All versions

Share

Mendeley Twitter Facebook Print Addthis

Cite as

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX

Confirm


Powered by WEKO3


Powered by WEKO3