楽曲生成AIによって生成された楽曲の主観的評価と音楽的特徴の関係について

井口, 滉大; 保谷, 哲也; Hiroto, Iguchi; Tetsuya, Hoya

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

楽曲生成AIによって生成された楽曲の主観的評価と音楽的特徴の関係について

https://ipsj.ixsq.nii.ac.jp/records/234707

名前 / ファイル	ライセンス	アクション
IPSJ-SLP24152020.pdf (1.9 MB)	Copyright (c) 2024 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2024-06-07

タイトル

楽曲生成AIによって生成された楽曲の主観的評価と音楽的特徴の関係について

タイトル

言語

タイトル

Relationship between subjective evaluations and musical characteristics of music generated by Music Generation AI

言語

jpn

キーワード

主題Scheme

Other

主題

ポスターセッション1

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

日本大学

著者所属

日本大学

著者所属(英)

Nihon University

著者所属(英)

Nihon University

著者名

井口, 滉大
保谷, 哲也

著者名(英)

Hiroto, Iguchi
Tetsuya, Hoya

論文抄録

内容記述タイプ

Other

内容記述

本研究では，複数トラックの楽曲を同時に生成可能なモデルである Multi-Music Transformer（MMT）を用いて，生成された楽曲についての調査を行った．実験ではモデルの入力次元数を「64」「128」「256」「512」「1024」の 5 つの条件で変化させ，これら 5 つの条件下で 3 曲ずつ，計 15 曲の楽曲生成を行なった．その後，各楽曲に対して計 13 名の被験者による主観的評価の調査を行った結果，次元数「64」で生成された音楽は一貫して低評価であり，それに対して次元数「256」「512」「1024」で生成された音楽は高評価を受けたことが確認された．また，これらの生成された音楽の不協和音数を計測した場合，楽曲によって差が生じたものの，次元数が低い「64」「128」場合では不協和音数が多く，その一方で次元数が比較的高い「256」「512」「1024」場合において不協和音数が少ないことも確認された．

論文抄録(英)

内容記述タイプ

Other

内容記述

This study focuses on the Multi-Music Transformer (MMT), a model capable of generating multiple tracks of music simultaneously, to investigate the music generated. To generate the tracks of music, the input dimension of the model was varied under five conditions: "64", "128", "256", "512", and "1024", and, under these conditions, three pieces of music were generated for each, t otaling 15 pieces. Subsequently, a subjective survey was conducted on each piece of music with a total of 13 participants , and it was confirmed that the music generated with the dimension "64" consistently received low ratings, whereas the music generated with dimensi ons "256", "512", and "1024" received relatively high ratings. The measurements of dissonance in the generated music pieces indicate that, while there were variations across compositions, lower dimensions such as "64" and "128" exhibited higher counts of dissonance, whereas higher dimensions "256", "512", and "1024" corresponded to fewer instances of dissonance.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2024-SLP-152, 号 20, p. 1-6, 発行日 2024-06-07

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8663

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 09:43:06.800628

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

楽曲生成AIによって生成された楽曲の主観的評価と音楽的特徴の関係について

× 井口, 滉大

× 保谷, 哲也

× Hiroto, Iguchi

× Tetsuya, Hoya

Versions

Share

Cite as

エクスポート