情報学広場：情報処理学会電子図書館

WEKO3

To

lat lon distance

[[sub_check.contents]]

[[sub_check.contents]]

[[sub_radio.contents]]

To

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

変分ベイズ法に基づく声質変換

https://ipsj.ixsq.nii.ac.jp/records/56796

名前 / ファイル	ライセンス	アクション
IPSJ-SLP07069043.pdf (435.1 kB)	Copyright (c) 2007 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2007-12-21

タイトル

タイトル

変分ベイズ法に基づく声質変換

タイトル

言語

en

タイトル

Voice Conversion based on Variational Bayesian Method

言語

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

名古屋工業大学大学院工学研究科情報工学専攻

著者所属

名古屋工業大学大学院工学研究科情報工学専攻

著者所属

名古屋工業大学大学院工学研究科情報工学専攻

著者所属

名古屋工業大学大学院工学研究科情報工学専攻

著者所属

名古屋工業大学大学院工学研究科情報工学専攻

著者所属(英)

en

Depertment of Computer Science and Engineering, Nagoya Institute of Technology

著者所属(英)

en

Depertment of Computer Science and Engineering, Nagoya Institute of Technology

著者所属(英)

en

Depertment of Computer Science and Engineering, Nagoya Institute of Technology

著者所属(英)

en

Depertment of Computer Science and Engineering, Nagoya Institute of Technology

著者所属(英)

en

Depertment of Computer Science and Engineering, Nagoya Institute of Technology

著者名

著者名(英)

Masahiro, MARUME

論文抄録

内容記述タイプ

Other

内容記述

音声合成の需要の高まりにより，多様な話者性や発話スタイルを持った音声の合成が望まれている．しかし，このような音声の合成には，話者や発話スタイルに応じてモデルを用意する必要があり現実的ではない．そこで，少量の学習データにより，多様な話者性を持つ音声の合成を可能とする混合ガウスモデル（GMM）に基づく声質変換が提案されている．しかし，従来の GMM に基づく声質変換では，尤度最大化（ML）基準によりモデルパラメータを点推定しているため，学習データが十分に得られない場合，モデルの推定精度が低下する可能性がある．そこで，GMM に基づく声質変換に変分ベイズ法を適用し，ベイズ基準による声質変換を行う．提案法では，ML 基準に比べて，声質変換の音質と話者性において，品質向上が確認でき，推定精度の高いモデルが得られることがわかった．

論文抄録(英)

内容記述タイプ

Other

内容記述

It is desired a technique for synthesizing speech with various speaker characteristics and speaking styles, by increasing the demand of speech synthesis. However, a large amount of training data is required to construct the system for each characteristics and speaking styleVoice conversion based on Gaussian Mixture Model (GMM) is one of techniques which can solve this problem. GMM is estimated from a small amount of training data based on the Maximam Likelihood (ML) criterion. However, the GMM based voice conversion technique still suffers from the overfitting problem due to insufficient training data and a point estimation of the ML criterion. To improve this problem, we applied the varational Bayes method to the GMM based voice conversion. In experiments, it was confirmed that the proposed technique improves the quality of converted voice, because of its higher generalization ability than the conventional ML based approach.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

情報処理学会研究報告音声言語情報処理（SLP）

巻 2007, 号 129(2007-SLP-069), p. 247-252, 発行日 2007-12-21

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

ja

出版者

情報処理学会

戻る

0

views

	Views

Versions

Ver.1

2025-01-22 04:49:28.882138

Show All versions

Share

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX