MC Softmax 探索における局面の並列評価： GPU とニューラルネットワークモデルの利用

吉野, 拓真; 五十嵐, 治一; 川島, 馨; Takuma, Yoshino; Harukazu, Igarashi; Kaoru, Kawashima

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

MC Softmax 探索における局面の並列評価： GPU とニューラルネットワークモデルの利用

https://ipsj.ixsq.nii.ac.jp/records/207648

名前 / ファイル	ライセンス	アクション
IPSJ-GPWS2020004.pdf (7.2 MB)	Copyright (c) 2020 by the Information Processing Society of Japan
オープンアクセス

Item type

Symposium(1)

公開日

2020-11-06

タイトル

MC Softmax 探索における局面の並列評価： GPU とニューラルネットワークモデルの利用

タイトル

言語

タイトル

Parallel State Evaluation in MC Softmax Search using GPU and Neural Network Models

言語

jpn

キーワード

主題Scheme

Other

主題

コンピュータ将棋

キーワード

主題Scheme

Other

主題

モンテカルロソフトマックス探索

キーワード

主題Scheme

Other

主題

ニューラルネットワークモデル

キーワード

主題Scheme

Other

主題

dlshogi

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_5794

資源タイプ

conference paper

著者所属

芝浦工業大学

著者所属

芝浦工業大学

著者所属

HEROZ 株式会社

著者所属(英)

Shibaura Institute of Technology

著者所属(英)

Shibaura Institute of Technology

著者所属(英)

HEROZ, Inc.

著者名

吉野, 拓真
五十嵐, 治一
川島, 馨

著者名(英)

Takuma, Yoshino
Harukazu, Igarashi
Kaoru, Kawashima

論文抄録

内容記述タイプ

Other

内容記述

選択探索の一種として，モンテカルロソフトマックス探索が提案されている．一般に，大規模ニューラルネットワークモデルによる評価関数を利用する場合，計算に時間がかかることから，αβ探索のような全幅探索よりは，選択探索の方が向いている．特に，モンテカルロソフトマックス探索においては，兄弟ノード局面をまとめて評価する際に，GPU による並列計算を用いれば，評価関数が大規模なニューラルネットワークモデルであっても容易に並列化できる可能性がある．本研究では，dlshogi のソースコードを改変し，モンテカルロソフトマックス探索とニューラルネットワークモデルの評価関数を組み合わせたプログラムを作成した．特に，ニューラルネットワークモデルの入力層に提示する局面の特徴量表現を工夫することにより，GPU で兄弟局面を同時に並列計算する際の処理時間を短縮することを試みた．さらに，ノード選択方策にPolicy Network の出力値を取り入れることにより，探索精度の向上を試みた．

論文抄録(英)

内容記述タイプ

Other

内容記述

As a kind of selective search, the Monte Carlo softmax search has been proposed. In general, selective search is better than full-width search, such as the alpha-beta search, because of the computational time required to use the evaluation function of large-scale neural network models. In particular, in the Monte Carlo softmax search, parallelization can be easily achieved even in the case of large-scale neural network models by using GPU parallelization in the evaluation of sibling nodes. In this study, we have modified the source code of dlshogi to create a program that combines Monte Carlo softmax search and evaluation functions of neural network models. We devised a feature representation of the input layer of the neural network model to reduce the processing time of the parallel computation for the sibling nodes on the GPU. Furthermore, we att empted to improve the accuracy of the search by incorporating the output of the Policy Network into a node selection policy.

書誌情報

ゲームプログラミングワークショップ2020論文集

巻 2020, p. 16-21, 発行日 2020-11-06

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 19:06:11.007419

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

MC Softmax 探索における局面の並列評価： GPU とニューラルネットワークモデルの利用

× 吉野, 拓真

× 五十嵐, 治一

× 川島, 馨

× Takuma, Yoshino

× Harukazu, Igarashi

× Kaoru, Kawashima

Versions

Share

Cite as

エクスポート