Item type |
SIG Technical Reports(1) |
公開日 |
2024-06-07 |
タイトル |
|
|
タイトル |
An experimental study of accent embedding for text to accented speech synthesis |
タイトル |
|
|
言語 |
en |
|
タイトル |
An experimental study of accent embedding for text to accented speech synthesis |
言語 |
|
|
言語 |
eng |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
ポスターセッション2 |
資源タイプ |
|
|
資源タイプ識別子 |
http://purl.org/coar/resource_type/c_18gh |
|
資源タイプ |
technical report |
著者所属 |
|
|
|
The University of Tokyo |
著者所属 |
|
|
|
The University of Tokyo |
著者所属 |
|
|
|
The University of Tokyo |
著者所属(英) |
|
|
|
en |
|
|
The University of Tokyo |
著者所属(英) |
|
|
|
en |
|
|
The University of Tokyo |
著者所属(英) |
|
|
|
en |
|
|
The University of Tokyo |
著者名 |
Hewei, Zhang
Daisuke, Saito
Nobuaki, Minematsu
|
著者名(英) |
Hewei, Zhang
Daisuke, Saito
Nobuaki, Minematsu
|
論文抄録 |
|
|
内容記述タイプ |
Other |
|
内容記述 |
In Text-to-Speech (TTS), End-to-End models had been introduced, which takes text as input and audio as output. This makes it hard to unsupervised control the style, especially accent, which consists of many kinds of acoustic features. We proposed a Phonetic Posterior-Gram-based unsupervised Accent Embedding Extraction model. Experiments showed the ability, robustness to different accent level of training dataset and deeper potential of the model to extract accent features from given utterance. |
論文抄録(英) |
|
|
内容記述タイプ |
Other |
|
内容記述 |
In Text-to-Speech (TTS), End-to-End models had been introduced, which takes text as input and audio as output. This makes it hard to unsupervised control the style, especially accent, which consists of many kinds of acoustic features. We proposed a Phonetic Posterior-Gram-based unsupervised Accent Embedding Extraction model. Experiments showed the ability, robustness to different accent level of training dataset and deeper potential of the model to extract accent features from given utterance. |
書誌レコードID |
|
|
収録物識別子タイプ |
NCID |
|
収録物識別子 |
AN10438388 |
書誌情報 |
研究報告音楽情報科学(MUS)
巻 2024-MUS-140,
号 45,
p. 1-5,
発行日 2024-06-07
|
ISSN |
|
|
収録物識別子タイプ |
ISSN |
|
収録物識別子 |
2188-8752 |
Notice |
|
|
|
SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc. |
出版者 |
|
|
言語 |
ja |
|
出版者 |
情報処理学会 |