情報学広場：情報処理学会電子図書館

WEKO3

To

lat lon distance

[[sub_check.contents]]

[[sub_check.contents]]

[[sub_radio.contents]]

To

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

話速変換に伴う時間伸張を吸収するための一方法

https://ipsj.ixsq.nii.ac.jp/records/37574

名前 / ファイル	ライセンス	アクション
IPSJ-HI92044007.pdf (1.3 MB)	Copyright (c) 1992 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

1992-09-10

タイトル

タイトル

話速変換に伴う時間伸張を吸収するための一方法

タイトル

言語

en

タイトル

A METHOD OF ABSORBING TEMPORAL ENLARGEMENT OF SPEECH LENGTHS IN THE VOICE SPEED CONVERTING SYSTEM FOR ELDERLY

言語

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

NHK放送技術研究所

著者所属

NHK放送技術研究所

著者所属

NHK放送技術研究所

著者所属

NHK放送技術研究所

著者所属

NHK放送技術研究所

著者所属(英)

en

NHK Science And Technical Research Laboratories

著者所属(英)

en

NHK Science And Technical Research Laboratories

著者所属(英)

en

NHK Science And Technical Research Laboratories

著者所属(英)

en

NHK Science And Technical Research Laboratories

著者所属(英)

en

NHK Science And Technical Research Laboratories

著者名

池沢, 龍中村, 章清山, 信正都木徹宮坂, 栄一

著者名(英)

Ryou, Ikezawa Akira, Nakamura Nobumasa, Seiyama Tohru, Takagi Eiichi, Miyasaka

論文抄録

内容記述タイプ

Other

内容記述

話速変換システムにおいて話速（話す速さ）を遅くする際，発話時間が伸張し，必然的に実時間に発声される時間との「ずれ」が問題となる．これを解決するために，文章間の無音区間を聴感上，違和感なく最短に短縮し，かつ，話速を固定ではなく，ピッチの大まかな変化に追随するよう，声立てと次の声立ての区間を単位にして，この区間の開始点では話速を遅くし，終了点に向かって徐々に話速を速める手法を開発した．これにより，発話時間を原音声の発話時間に保ったまま，聴きやすいゆっくりした音声に変換することが可能となった．

論文抄録(英)

内容記述タイプ

Other

内容記述

Only voice speed can be made slower than normal at a constant rate with other features (such as pitch, personality ) held original in the voice speed converting system which has recently been developed by NHK. This constant conversion in speed causes temporal enlargement of speech lengths. This paper presents a new algorithm to absorb temporal discrepancy in length between the converted speech and the original one, no matter how the processed speech can perceptually sound slower. The algorithm has two characteristic features; (1) Every long pause between adjacent uttered sentences is shortened with perceptual naturalness. (2) Voice speed set to be slow at the onset of voicing is made faster step by step roughly along a curve of the envelope of pitch frequencies, resulting in a value of speed faster than normal.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AA1221543X

書誌情報

情報処理学会研究報告ヒューマンコンピュータインタラクション（HCI）

巻 1992, 号 69(1992-HI-044), p. 49-56, 発行日 1992-09-10

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

ja

出版者

情報処理学会

戻る

0

views

	Views

Versions

Ver.1

2025-01-22 13:43:57.316417

Show All versions

Share

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX