テキスト条件付き音楽生成モデルを用いたゲームの場面に応じたBGMの生成

藤澤,透冴; 村上,真; Tougo Fujisawa; Makoto Murakami

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

テキスト条件付き音楽生成モデルを用いたゲームの場面に応じたBGMの生成

https://ipsj.ixsq.nii.ac.jp/records/2008671

名前 / ファイル	ライセンス	アクション
IPSJ-CVIM26245005.pdf (1.3 MB) 9999年1月1日からダウンロード可能です。	Copyright (c) 2026 by the Institute of Electronics, Information and Communication Engineers This SIG report is only available to those in membership of the SIG.
CVIM:会員：¥0, DLIB:会員：¥0

Item type

SIG Technical Reports(1)

公開日

2026-03-17

タイトル

言語

タイトル

テキスト条件付き音楽生成モデルを用いたゲームの場面に応じたBGMの生成

タイトル

言語

タイトル

Background Music Generation depending on Game Situation using Text-to-Music model

言語

jpn

キーワード

主題Scheme

Other

主題

PRMU

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

東洋大学大学院総合情報研究科総合情報学専攻

著者所属

東洋大学大学総合情報学部

著者所属(英)

Toyo University Graduate School of Information Science, Department of Information Science

著者所属(英)

Toyo University of Information Science

著者名

藤澤,透冴
村上,真

著者名(英)

Tougo Fujisawa
Makoto Murakami

論文抄録

内容記述タイプ

Other

内容記述

本研究の目的は，個人でゲームを制作している開発者にとって負担となるゲームBGMを簡単に制作できるシステムを構築することである．テキストを入力すれば楽曲を生成することができるテキスト条件付き音楽生成モデルでは，曲のジャンルやムードや楽器構成といった音楽的特徴を記述したテキストを入力する必要があるが，専門的な音楽知識を有しないゲーム開発者がそういった内容のテキストを記述することは容易ではない．一方，ゲーム映像の内容を自然言語で記述することは容易であると考え，本研究ではゲームの場面情報を記述したテキストからゲームBGMの生成を行う．具体的には，場面情報を記述したテキストを大規模言語モデルにより音楽的特徴を記述したテキストに変換し，そのテキストをテキスト条件付き楽曲生成モデルであるMusicGenに入力することでゲームBGMを生成する．

論文抄録(英)

内容記述タイプ

Other

内容記述

The objective of this study is to develop a system that enables independent game developers to easily create background music (BGM), which is often a significant burden in solo game production. Text-to-Music models can generate music from textual input; however, they require descriptions of musical attributes such as genre, mood, and instrumentation. For game developers without specialized musical knowledge, it is not easy to compose such musically detailed descriptions. On the other hand, it is relatively easy to describe the content of game scenes in natural language. Therefore, this study proposes a method for generating game BGM from text that describes scene information. Specifically, text describing the game scene is first transformed into text describing musical attributes using LLM. The resulting text is then provided as input to MusicGen, a Text-to-Music model, in order to generate the game BGM.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AA11131797

書誌情報

研究報告コンピュータビジョンとイメージメディア（CVIM）

巻 2026-CVIM-245, 号 5, p. 1-5, 発行日 2026-03-17

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8701

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2026-03-11 07:08:25.432287

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

テキスト条件付き音楽生成モデルを用いたゲームの場面に応じたBGMの生成

× 藤澤,透冴

× 村上,真

× Tougo Fujisawa

× Makoto Murakami

Versions

Share

Cite as

エクスポート