2024-03-29T06:17:40Zhttps://ipsj.ixsq.nii.ac.jp/ej/?action=repository_oaipmhoai:ipsj.ixsq.nii.ac.jp:001563682017-03-31T05:33:31Z08512:08642:08643:08588:08590
F_004 方策こう配法を用いた行動学習 : 方策中での状態遷移確率の表現(F分野:人工知能・ゲーム)F_004 Behavior Learning Based on a Policy Gradient Method : an Expression of State Transition Probabilities Included in the Policiesjpnhttp://id.nii.ac.jp/1001/00156334/Conference Paperhttps://ipsj.ixsq.nii.ac.jp/ej/?action=repository_action_common_download&item_id=156368&item_no=1&attribute_id=1&file_no=1Copyright (c) 2006 by IEICE,IPSJ近畿大学工学部芝浦工業大学工学部石原, 聖司五十嵐, 治一AA11740605情報科学技術フォーラム一般講演論文集522532542006-08-212016-02-18