<?xml version='1.0' encoding='UTF-8'?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
  <responseDate>2026-04-12T11:51:00Z</responseDate>
  <request identifier="oai:ipsj.ixsq.nii.ac.jp:00176408" metadataPrefix="oai_dc" verb="GetRecord">https://ipsj.ixsq.nii.ac.jp/oai</request>
  <GetRecord>
    <record>
      <header>
        <identifier>oai:ipsj.ixsq.nii.ac.jp:00176408</identifier>
        <datestamp>2025-01-20T05:53:06Z</datestamp>
        <setSpec>1164:5159:8497:9012</setSpec>
      </header>
      <metadata>
        <oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns="http://www.w3.org/2001/XMLSchema" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
          <dc:title>A comparative study on modeling and controlling emotional acoustic parameters in neural networks based Japanese and Spanish speech synthesis</dc:title>
          <dc:title>A comparative study on modeling and controlling emotional acoustic parameters in neural networks based Japanese and Spanish speech synthesis</dc:title>
          <dc:creator>Jaime, Lorenzo-Trueba</dc:creator>
          <dc:creator>Shinji, Takaki</dc:creator>
          <dc:creator>Junichi, Yamagishi</dc:creator>
          <dc:creator>Jaime, Lorenzo-Trueba</dc:creator>
          <dc:creator>Shinji, Takaki</dc:creator>
          <dc:creator>Junichi, Yamagishi</dc:creator>
          <dc:subject>音声合成</dc:subject>
          <dc:description>In neural network based speech synthesis, adding a one-hot vector to the input is an easy, intuitive but useful way to model multiple speakers or multiple language. In this paper, we use the one-hot vector for modeling and controlling emotional acoustic parameters. We have used Spanish and Japanese databases having the multiple acted emotional speech uttered by professional speakers in our experiment and will show the performance of the one-hot vector approach.</dc:description>
          <dc:description>In neural network based speech synthesis, adding a one-hot vector to the input is an easy, intuitive but useful way to model multiple speakers or multiple language. In this paper, we use the one-hot vector for modeling and controlling emotional acoustic parameters. We have used Spanish and Japanese databases having the multiple acted emotional speech uttered by professional speakers in our experiment and will show the performance of the one-hot vector approach.</dc:description>
          <dc:description>technical report</dc:description>
          <dc:publisher>情報処理学会</dc:publisher>
          <dc:date>2016-12-13</dc:date>
          <dc:format>application/pdf</dc:format>
          <dc:identifier>研究報告音声言語情報処理（SLP）</dc:identifier>
          <dc:identifier>22</dc:identifier>
          <dc:identifier>2016-SLP-114</dc:identifier>
          <dc:identifier>1</dc:identifier>
          <dc:identifier>6</dc:identifier>
          <dc:identifier>2188-8663</dc:identifier>
          <dc:identifier>AN10442647</dc:identifier>
          <dc:identifier>https://ipsj.ixsq.nii.ac.jp/record/176408/files/IPSJ-SLP16114022.pdf</dc:identifier>
          <dc:language>eng</dc:language>
        </oai_dc:dc>
      </metadata>
    </record>
  </GetRecord>
</OAI-PMH>
