| Item type |
Symposium(1) |
| 公開日 |
2019-11-01 |
| タイトル |
|
|
タイトル |
An Attempt to Improve Generalization Performance in Reinforcement Learning with Deterministic World Models and WGANs |
| タイトル |
|
|
言語 |
en |
|
タイトル |
An Attempt to Improve Generalization Performance in Reinforcement Learning with Deterministic World Models and WGANs |
| 言語 |
|
|
言語 |
eng |
| 資源タイプ |
|
|
資源タイプ識別子 |
http://purl.org/coar/resource_type/c_5794 |
|
資源タイプ |
conference paper |
| 著者所属 |
|
|
|
Department of Information and Communication Engineering, The University of Tokyo |
| 著者所属 |
|
|
|
Department of Information and Communication Engineering, The School of Information Science and Technology, The Uni-versity of Tokyo |
| 著者所属(英) |
|
|
|
en |
|
|
Department of Information and Communication Engineering, The University of Tokyo |
| 著者所属(英) |
|
|
|
en |
|
|
Department of Information and Communication Engineering, The School of Information Science and Technology, The Uni-versity of Tokyo |
| 著者名 |
Tianshuai, Yu
Yoshimasa, Tsuruoka
|
| 著者名(英) |
Tianshuai, Yu
Yoshimasa, Tsuruoka
|
| 論文抄録 |
|
|
内容記述タイプ |
Other |
|
内容記述 |
Significant progress has been made in the field of Reinforcement Learning (RL) in recent years. Using artificial neural networks, researchers are able to train agents that can play video games as well as or even better than human experts. However, it is common that the same environments are used in both training phases and testing phases, which results in agents’ failure to generalize to other environments. In this work, we propose a method in which environment models and generative models are used to generate virtual game levels so as to improve the generalization performance of RL agents. We conducted experiments using a fully-observable deterministic discrete maze game in order to test the proposed method. However, the proposed method failed to converge during training because our environmnet model was not able to predict the future of unseen levels accurately. |
| 論文抄録(英) |
|
|
内容記述タイプ |
Other |
|
内容記述 |
Significant progress has been made in the field of Reinforcement Learning (RL) in recent years. Using artificial neural networks, researchers are able to train agents that can play video games as well as or even better than human experts. However, it is common that the same environments are used in both training phases and testing phases, which results in agents’ failure to generalize to other environments. In this work, we propose a method in which environment models and generative models are used to generate virtual game levels so as to improve the generalization performance of RL agents. We conducted experiments using a fully-observable deterministic discrete maze game in order to test the proposed method. However, the proposed method failed to converge during training because our environmnet model was not able to predict the future of unseen levels accurately. |
| 書誌情報 |
ゲームプログラミングワークショップ2019論文集
巻 2019,
p. 150-154,
発行日 2019-11-01
|
| 出版者 |
|
|
言語 |
ja |
|
出版者 |
情報処理学会 |