非負値行列分衡皐を用いた楽曲中のボーカルパート抽出に関する検討

安井, 優太; 坂野, 秀樹; 板倉, 文忠; Yuta, Yasui; Hideki, Banno; Fumitada, Itakural

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

非負値行列分衡皐を用いた楽曲中のボーカルパート抽出に関する検討

https://ipsj.ixsq.nii.ac.jp/records/79356

名前 / ファイル	ライセンス	アクション
IPSJ-SLP11089013.pdf (525.2 kB) 2100年1月1日からダウンロード可能です。	Copyright (c) 2011 by the Institute of Electronics, Information and Communication Engineers This SIG report is only available to those in membership of the SIG.
SLP:会員：¥0, DLIB:会員：¥0

Item type

SIG Technical Reports(1)

公開日

2011-12-12

タイトル

非負値行列分衡皐を用いた楽曲中のボーカルパート抽出に関する検討

タイトル

言語

タイトル

Study on extraction of vocal part in music signal by using non-negative matrix factorization

言語

jpn

キーワード

主題Scheme

Other

主題

ポスターセッション

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

名城大学大学院理工学研究科

著者所属

名城大学理工学部

著者所属

名城大学理工学部

著者所属(英)

Graduate School of Science and Technology, Meijo University

著者所属(英)

Meijo University

著者所属(英)

Meijo University

著者名

安井, 優太

著者名(英)

Yuta, Yasui

論文抄録

内容記述タイプ

Other

内容記述

本研究では，歌声信号と伴奏信号を重ね合わせた楽曲信号から非負値行列分解を用いて歌声信号を抽出する方法について検討する．非負値行列分解は，入カスペクトログラムに対し，スペクトログラム上に現れる類似したスペクトルパターンを一つの基底ベクトルとして表現することで，複数の基底ベクトルと，それぞれの時間変化情報に分離することができる．しかし，歌声に現れるビブラートなどスペクトルが時間的に変動する信号に対しては有限個の基底で表現することが困難なため，歌声の抽出に適していない．この問題を解決するために，楽曲信号中の歌声信号の基本周波数を基準となる音高に一致させることで基本周波数によるスペクトルの変動を除去し，この信号に対して非負値行列分解を行う手法を提案する．抽出された歌声信号と伴奏信号を S/Ｎ比により評価した結果，従来法に比べ提案法は合成信号の劣化が表れ，S/Ｎ比は低くなる傾向があったが，一部の楽曲信号で有効性が確認された．

論文抄録(英)

内容記述タイプ

Other

内容記述

This paper describes extraction methods of vocal signal in music signal which is a mixture of vocal signal and accompaniment signal by using non-negative matrix factorization. Non-negative matrix factorization (NMF) can factorize an input spectrogram into a finite number of basis vectors and its temporal activity information, because it represents similar spectral patterns appeared on the input spectrogram with a single basis vector. However, NMF is not suitable for extraction of vocal signal because factorization of vocal signal including temporal spectral fluctuation appeared in vibrato of singing voice into a finite number of basis vectors is quite difficult. To solve this problem, we propose a preprocessing method that removes the spectral fluctuation by using a linear frequency axis warping of the spectrum so that a fundamental frequency of vocal signal included in the input music signal aligns to a reference frequency. Then, NMF is applied to this preprocessed signal. We have performed evaluation by SNR of extracted vocal signal and extracted accompaniment signal, in comparison with the conventional method. As a result, it was found that the generated signals by the proposed method had lower quality and SNR. However, the proposed method obtained slight better results for some music signals.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

研究報告音声言語情報処理（SLP）

巻 2011-SLP-89, 号 13, p. 1-6, 発行日 2011-12-12

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 20:15:41.188569

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

非負値行列分衡皐を用いた楽曲中のボーカルパート抽出に関する検討

× 安井, 優太

× Yuta, Yasui

Versions

Share

Cite as

エクスポート