情報学広場：情報処理学会電子図書館

WEKO3

To

lat lon distance

[[sub_check.contents]]

[[sub_check.contents]]

[[sub_radio.contents]]

To

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

MPIを埋め込み可能なGPUプログラミングフレームワークの検討

https://ipsj.ixsq.nii.ac.jp/records/74402

名前 / ファイル	ライセンス	アクション
IPSJ-SACSIS2011064.pdf (904.3 kB)	Copyright (c) 2011 by the Information Processing Society of Japan
オープンアクセス

Item type

Symposium(1)

公開日

2011-05-18

タイトル

タイトル

MPIを埋め込み可能なGPUプログラミングフレームワークの検討

タイトル

言語

en

タイトル

A Study of GPU Programming Framework to Provide Embedded MPI

言語

言語

jpn

キーワード

主題Scheme

Other

主題

プログラミングフレームワーク

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_5794

資源タイプ

conference paper

著者所属

電気通信大学大学院情報システム学研究科

著者所属

電気通信大学大学院情報システム学研究科

著者所属

電気通信大学大学院情報システム学研究科

著者所属

電気通信大学大学院情報システム学研究科

著者所属

電気通信大学大学院情報システム学研究科

著者所属(英)

en

Graduate School of Information Systems, The University of Electro-Communications

著者所属(英)

en

Graduate School of Information Systems, The University of Electro-Communications

著者所属(英)

en

Graduate School of Information Systems, The University of Electro-Communications

著者所属(英)

en

Graduate School of Information Systems, The University of Electro-Communications

著者所属(英)

en

Graduate School of Information Systems, The University of Electro-Communications

著者名

著者名(英)

Takefumi, Miysohi

論文抄録

内容記述タイプ

Other

内容記述

GPU を持つ複数の計算ノードで構成されるクラスタ計算機で，所望の計算を効率良く実行するためには，複数の GPU に効率良く処理を割り当てる必要がある．クラスタ計算機内の複数ノードの GPU を使用するプログラムは，GPU 上の処理を記述するコードとデータ通信処理を行う CPU のコードに分断される．分断されたコードは，プログラマにとって見通しが悪く，また，機械的な最適化や形式検証を難しくする．そこで，GPU 同士のデータ授受を見通しよく記述できるようにするために，GPU コード中に MPI を埋め込み可能なプログラミングフレームワークを提案する．埋め込まれたMPI処理は，コンパイル時に CPU へのMPI処理の要求に変換され，実行時に CPU 上のランタイムルーチンによって適切に処理される．本稿では，まず，提案するプログラミングフレームワークにより，複数の GPU を用いるプログラムの記述が容易になることを示す．次に，このフレームワークを実現するためのコード変換およびランタイムルーチンの設計について述べる．また，提案するフレームワークを用いたプログラムの実行時間をプログラマが CPU コード中に MPI 処理を直接記述したプログラムの実行時間と比較し，実行性能が遜色ないことを示す．

論文抄録(英)

内容記述タイプ

Other

内容記述

For leveraging multiple GPUs in a cluster system, it is necessary to assign application tasks to multiple GPUs and execute those tasks with appropriately using communication primitives to handle data transfer among GPUs. In current GPU programming models, communication primitives such as MPI functions cannot be used within GPU. Instead, such functions should be used in the CPU code. Therefore, programmer must handle both GPU and CPU codes for data communications. This makes GPU programming and its optimization very difficult. In this paper, we propose a programming framework which enables programmers to use MPI functions within GPU kernels. Our framework automatically transforms MPI functions written in a GPU kernel into runtime routines executed on the CPU. With this framework, programmability and readability of programs will much improve. We evaluate the performance and overhead of the framework. The result shows that the proposed framework achieves comparable execution performance with a MPI code written by hand in traditional way.

書誌情報

先進的計算基盤システムシンポジウム論文集

巻 2011, p. 298-305, 発行日 2011-05-18

出版者

言語

ja

出版者

情報処理学会

戻る

0

views

	Views

Versions

Ver.1

2025-01-21 21:36:37.581829

Show All versions

Share

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX