マルチエージェント強化学習を用いた 門司港花火大会における人流誘導の検討

宮﨑, 和也; 木山, 真人; 尼崎, 太樹; 岡本, 利章; Kazuya, Miyazaki; Masato, Kiyama; Motoki, Amagasaki; Toshiaki, Okamoto

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

マルチエージェント強化学習を用いた門司港花火大会における人流誘導の検討

https://ipsj.ixsq.nii.ac.jp/records/232719

名前 / ファイル	ライセンス	アクション
IPSJ-CVIM24237028.pdf (2.8 MB)	Copyright (c) 2024 by the Institute of Electronics, Information and Communication Engineers This SIG report is only available to those in membership of the SIG.
CVIM:会員：¥0, DLIB:会員：¥0

Item type

SIG Technical Reports(1)

公開日

2024-02-25

タイトル

マルチエージェント強化学習を用いた門司港花火大会における人流誘導の検討

タイトル

言語

タイトル

Multi-agent reinforcement learning based control method for large-scale crowd movement on Mojiko Fireworks Festival dataset

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

熊本大学大学院自然科学教育部

著者所属

熊本大学大学院先端科学研究部

著者所属

熊本大学大学院先端科学研究部

著者所属

熊本大学大学院自然科学教育部

著者所属(英)

Department of Computer Science and Electrical Engineering, Kumamoto University

著者所属(英)

Faculty of Advanced Science and Technology, Kumamoto University

著者所属(英)

Faculty of Advanced Science and Technology, Kumamoto University

著者所属(英)

Department of Computer Science and Electrical Engineering, Kumamoto University

著者名

宮﨑, 和也
木山, 真人
尼崎, 太樹
岡本, 利章

著者名(英)

Kazuya, Miyazaki
Masato, Kiyama
Motoki, Amagasaki
Toshiaki, Okamoto

論文抄録

内容記述タイプ

Other

内容記述

イベントなどにおける事故を受け，人流誘導の重要性が増している．近年，シミュレーションに強化学習を適用する研究がある．強化学習エージェントはシミュレータから現在の状態を受け取り，失敗と成功を繰り返しながら報酬が最大となるような行動を学習する．本研究では，マルチエージェント強化学習の手法である MA-DDPG(Multi-Agent Deep deterministic policy gradient) を人流誘導に適用する．さらに MA-DDPG における actor に対し行動価値関数を学習させる手法を提案する．そして，門司港花火大会のデータセットにおける混雑解消を行う．評価において，先行研究モデルと提案モデルの混雑度を比較した．各手法の比較を行なった結果，提案手法は既存手法と比べ混雑度の緩和に成功し，提案手法は他手法と比べエージェント同士が協力しながら混雑緩和を行っていることがわかった．

論文抄録(英)

内容記述タイプ

Other

内容記述

The importance of human ﬂow guidance is increasing in response to accidents at events. In recent years, some research has applied reinforcement learning to simulation. Reinforcement learning agents receive the current state from the simulator and learn behaviours that maximise rewards through repeated failures and successes. In this study, we apply MA-DDPG, a method of multi-agent reinforcement learning, and propose a method of learning an action value function for an actor in MA-DDPG. Then, we perform congestion reduction on the Mojiko Fireworks Festival dataset. In the evaluation, the congestion of the previous research model and the proposed model are compared.Comparison of the methods shows that the proposed method succeeds in reducing congestion compared to the existing methods. The proposed method was found to be more successful than the other methods in reducing congestion through cooperation between agents.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AA11131797

書誌情報

研究報告コンピュータビジョンとイメージメディア（CVIM）

巻 2024-CVIM-237, 号 28, p. 1-8, 発行日 2024-02-25

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8701

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 10:20:36.323282

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

マルチエージェント強化学習を用いた門司港花火大会における人流誘導の検討

× 宮﨑, 和也

× 木山, 真人

× 尼崎, 太樹

× 岡本, 利章

× Kazuya, Miyazaki

× Masato, Kiyama

× Motoki, Amagasaki

× Toshiaki, Okamoto

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

マルチエージェント強化学習を用いた 門司港花火大会における人流誘導の検討

× 宮﨑, 和也

× 木山, 真人

× 尼崎, 太樹

× 岡本, 利章

× Kazuya, Miyazaki

× Masato, Kiyama

× Motoki, Amagasaki

× Toshiaki, Okamoto

Versions

Share

Cite as

エクスポート

マルチエージェント強化学習を用いた門司港花火大会における人流誘導の検討