WEKO3
アイテム
Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model
https://ipsj.ixsq.nii.ac.jp/records/75425
https://ipsj.ixsq.nii.ac.jp/records/75425200c0cb4-663f-4f81-9f60-99a28a5f506d
名前 / ファイル | ライセンス | アクション |
---|---|---|
![]() |
Copyright (c) 2011 by the Information Processing Society of Japan
|
|
オープンアクセス |
Item type | SIG Technical Reports(1) | |||||||
---|---|---|---|---|---|---|---|---|
公開日 | 2011-07-14 | |||||||
タイトル | ||||||||
タイトル | Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model | |||||||
タイトル | ||||||||
言語 | en | |||||||
タイトル | Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model | |||||||
言語 | ||||||||
言語 | eng | |||||||
キーワード | ||||||||
主題Scheme | Other | |||||||
主題 | 合成 | |||||||
資源タイプ | ||||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_18gh | |||||||
資源タイプ | technical report | |||||||
著者所属 | ||||||||
東京大学大学院工学系研究科 | ||||||||
著者所属 | ||||||||
東京大学大学院工学系研究科 | ||||||||
著者所属 | ||||||||
東京大学大学院情報理工学系研究科 | ||||||||
著者所属 | ||||||||
東京大学大学院情報理工学系研究科 | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Department of Electrical Engineering and Information Systems, the University of Tokyo | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Department of Electrical Engineering and Information Systems, the University of Tokyo | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Department of Information and Communication Engineering, the University of Tokyo | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Department of Information and Communication Engineering, the University of Tokyo | ||||||||
著者名 |
Miaomiao, Wen
Miaomiao, Wang
Keikichi, Hirose
Nobuaki, Minematsu
× Miaomiao, Wen Miaomiao, Wang Keikichi, Hirose Nobuaki, Minematsu
|
|||||||
著者名(英) |
Miaomiao, Wen
Miaomiao, Wang
Keikichi, Hirose
Nobuaki, Minematsu
× Miaomiao, Wen Miaomiao, Wang Keikichi, Hirose Nobuaki, Minematsu
|
|||||||
論文抄録 | ||||||||
内容記述タイプ | Other | |||||||
内容記述 | In this paper, tone nucleus model is employed to represent and convert F0 contour for synthesizing an emotional Mandarin speech from a neutral speech. Compared with previous prosody transforming methods, the proposed method 1) only converts the tone nucleus part of each syllable rather than the whole F0 contour to avoid the data sparseness problems; 2) builds mapping functions for well-chosen tone nucleus model parameters to better capture Mandarin tonal information. Using only a modest amount of training data, the perceptual accuracy achieved by our method was shown to be comparable to that obtained by a professional speaker. | |||||||
論文抄録(英) | ||||||||
内容記述タイプ | Other | |||||||
内容記述 | In this paper, tone nucleus model is employed to represent and convert F0 contour for synthesizing an emotional Mandarin speech from a neutral speech. Compared with previous prosody transforming methods, the proposed method 1) only converts the tone nucleus part of each syllable rather than the whole F0 contour to avoid the data sparseness problems; 2) builds mapping functions for well-chosen tone nucleus model parameters to better capture Mandarin tonal information. Using only a modest amount of training data, the perceptual accuracy achieved by our method was shown to be comparable to that obtained by a professional speaker. | |||||||
書誌レコードID | ||||||||
収録物識別子タイプ | NCID | |||||||
収録物識別子 | AN10442647 | |||||||
書誌情報 |
研究報告音声言語情報処理(SLP) 巻 2011-SLP-87, 号 2, p. 1-6, 発行日 2011-07-14 |
|||||||
Notice | ||||||||
SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc. | ||||||||
出版者 | ||||||||
言語 | ja | |||||||
出版者 | 情報処理学会 |