WEKO3
-
RootNode
アイテム
Playing mini-Hanabi card game with Q-learning
https://ipsj.ixsq.nii.ac.jp/records/205142
https://ipsj.ixsq.nii.ac.jp/records/205142a43ffc64-868d-426a-9aa0-57777008ab51
名前 / ファイル | ライセンス | アクション |
---|---|---|
![]() |
Copyright (c) 2020 by the Information Processing Society of Japan
|
Item type | National Convention(1) | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
公開日 | 2020-02-20 | |||||||||||
タイトル | ||||||||||||
タイトル | Playing mini-Hanabi card game with Q-learning | |||||||||||
言語 | ||||||||||||
言語 | eng | |||||||||||
キーワード | ||||||||||||
主題Scheme | Other | |||||||||||
主題 | 人工知能と認知科学 | |||||||||||
資源タイプ | ||||||||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_5794 | |||||||||||
資源タイプ | conference paper | |||||||||||
著者所属 | ||||||||||||
京大 | ||||||||||||
著者所属 | ||||||||||||
名大 | ||||||||||||
著者所属 | ||||||||||||
ボッシュ株式会社 | ||||||||||||
著者名 |
ひい, とう
× ひい, とう
× 市来, 正裕
× 中里, 研一
|
|||||||||||
論文抄録 | ||||||||||||
内容記述タイプ | Other | |||||||||||
内容記述 | Hanabi card game is a cooperative card game. Unlike the other games, the players can't see their own cards and can only see other people's. So, it is very challenging for AI players to learn this game. In this study we simulated the Hanabi card game and trained the AI player by using the Q-learning method. However, Q-learning method will take a large amount of time if space states is numerous. Therefore, we parameterized the numbers and kinds of cards to estimate the size of the space states. Finally, we minimized the cards number and trained the AI player by using Q-learning in a short time. | |||||||||||
書誌レコードID | ||||||||||||
収録物識別子タイプ | NCID | |||||||||||
収録物識別子 | AN00349328 | |||||||||||
書誌情報 |
第82回全国大会講演論文集 巻 2020, 号 1, p. 41-42, 発行日 2020-02-20 |
|||||||||||
出版者 | ||||||||||||
言語 | ja | |||||||||||
出版者 | 情報処理学会 |