Self-Playを用いた深層強化学習におけるスコア分布予測型モデルの提案

神子島, 一弥; 坂地, 泰紀; 野田, 五十樹

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

Self-Playを用いた深層強化学習におけるスコア分布予測型モデルの提案

https://ipsj.ixsq.nii.ac.jp/records/232917

名前 / ファイル	ライセンス	アクション
IPSJ-GI24051029.pdf (2.8 MB)	Copyright (c) 2024 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2024-03-01

タイトル

Self-Playを用いた深層強化学習におけるスコア分布予測型モデルの提案

タイトル

言語

タイトル

A Proposal of Score Distribution Predictive Model in Self-Play Deep Reinforcement Learning

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

北海道大学

著者所属

北海道大学

著者所属

北海道大学

著者所属(英)

The University of Hokkaido

著者所属(英)

The University of Hokkaido

著者所属(英)

The University of Hokkaido

著者名

神子島, 一弥
坂地, 泰紀
野田, 五十樹

論文抄録

内容記述タイプ

Other

内容記述

本稿ではゲーム AI で用いられる Self-Play による深層強化学習において，スコアの確率分布を予測するモデルを提案する．提案モデルでは，一般に用いられているスコアの期待値の代わりに，スコアの確率分布を求める．それを直接用いることによって，スコア学習における性能低下問題を解決する．既存モデルと比較した評価実験により，性能低下問題が解決されることが分かった．更にスコアに対してより精密な操作を可能とする結果も得られた．

論文抄録(英)

内容記述タイプ

Other

内容記述

We propose a model for predicting the probability distribution of score in Self-Play deep reinforcement learning, which is used in game AI. In the proposed model, the probability distribution of score is obtained instead of expected value of score that is commonly used. By using it directly, the performance degradation problem in score learning is solved. Evaluation experiments comparing the proposed model with existing models show that the performance degradation problem is solved. Furthermore, the proposed model allowed more precise manipulation of score.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AA11362144

書誌情報

研究報告ゲーム情報学（GI）

巻 2024-GI-51, 号 29, p. 1-8, 発行日 2024-03-01

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8736

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 10:16:40.677431

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

Self-Playを用いた深層強化学習におけるスコア分布予測型モデルの提案

× 神子島, 一弥

× 坂地, 泰紀

× 野田, 五十樹

Versions

Share

Cite as

エクスポート