不均衡データに対する多段階学習を用いたアンサンブルモデルによる2クラス分類アルゴリズムの提案

藤原, 和樹; 繁野, 麻衣子; 住田, 潮; Kazuki, Fujiwara; Maiko, Shigeno; Ushio, Sumita

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

不均衡データに対する多段階学習を用いたアンサンブルモデルによる2クラス分類アルゴリズムの提案

https://ipsj.ixsq.nii.ac.jp/records/194472

名前 / ファイル	ライセンス	アクション
IPSJ-MPS19122009.pdf (1.5 MB)	Copyright (c) 2019 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2019-02-21

タイトル

不均衡データに対する多段階学習を用いたアンサンブルモデルによる2クラス分類アルゴリズムの提案

タイトル

言語

タイトル

A New Ensemble Model for Imbalanced Two-class Classification by Learning Multistage

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

筑波大学

著者所属

筑波大学

著者所属

筑波大学

著者所属(英)

University of Tsukuba

著者所属(英)

University of Tsukuba

著者所属(英)

University of Tsukuba

著者名

藤原, 和樹
繁野, 麻衣子
住田, 潮

著者名(英)

Kazuki, Fujiwara
Maiko, Shigeno
Ushio, Sumita

論文抄録

内容記述タイプ

Other

内容記述

近年，機械学習を用いた 2 クラス分類アルゴリズムは，種々のアプリケーション実現のための中核的な技術要素として応用されている．一方で，応用される多くの事例では対象データが少数の陽性と多数の陰性から構成される不均衡データであり，この不均衡性によって陽性の分類精度が低くなってしまうことが問題になっている．本稿では，不均衡データに対する陽性の分類精度の向上を目的とした 2 クラス分類アルゴリズムについて研究する．一般的に，不均衡データに対する陽性の分類精度を向上させようとした場合，偽陰性の減少と引き換えに偽陽性が増加する可能性がある．そこで本研究では，不均衡データに対して偽陰性・偽陽性を共に減少させる多段階学習を用いたアンサンブルモデルによる 2 クラス分類アルゴリズムを提案する．提案手法では，分類が難しいと判断されたデータに対して学習を繰り返し，期毎に複数のモデルを作成する．多段階的に作成された複数のモデルを用いて，偽陰性を減少させるための最適線形結合モデルと偽陽性を減少させるためのカスケード結合モデルをそれぞれ構築し，この 2 つのモデルを統合して最終的な分類を行う．実験を通じて，提案手法を用いることによって，既存手法よりも陽性の分類精度が向上すること，偽陰性と偽陽性が共に減少することを示した．

論文抄録(英)

内容記述タイプ

Other

内容記述

In recent years, algorithms for classification with two possible outcomes have played important roles in machine learning. In various applications in the real world, the analyzing datasets are hard to deal with because the sizes of classes are imbalanced, i.e., the data contain a few positive outcomes while contain many negative outcomes. This imbalance causes the low accuracy of classification for positive. Many algorithms for two-class classification have been developed to improve the accuracy of classification even if the dataset is imbalanced. However, such algorithms tend to yield increasing false positives in return for reducing false negatives. We propose an algorithm tries to reduce both false negative and false positives in multistage learning. By using several learning machines, our algorithm constructs several models. To reduce false negatives, we take an optimal linear combination of those models. On the other hand, to reduce false positives, we take a cascade classifier. Integrating these two models, the solution for classification is given. Computational experiments are performed to verify the proposed method and to ascertain that the classification accuracy of positive is better than the existing method. The proposed method succeeded to decrease both the false negative and false positives.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10505667

書誌情報

研究報告数理モデル化と問題解決（MPS）

巻 2019-MPS-122, 号 9, p. 1-6, 発行日 2019-02-21

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8833

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 23:28:22.033509

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

不均衡データに対する多段階学習を用いたアンサンブルモデルによる2クラス分類アルゴリズムの提案

× 藤原, 和樹

× 繁野, 麻衣子

× 住田, 潮

× Kazuki, Fujiwara

× Maiko, Shigeno

× Ushio, Sumita

Versions

Share

Cite as

エクスポート