宇宙輻射輸送コードにおけるOpenCLによるFPGA演算加速最適化

藤田, 典久; 小林, 諒平; 山口, 佳樹; 朴, 泰祐; 吉川, 耕司; 安部, 牧人; 梅村, 雅之; Norihisa, Fujita; Ryohei, Kobayashi; Yoshiki, Yamaguchi; Taisuke, Boku; Kohji, Yoshikawa; Makito, Abe; Masayuki, Umemura

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

宇宙輻射輸送コードにおけるOpenCLによるFPGA演算加速最適化

https://ipsj.ixsq.nii.ac.jp/records/198580

名前 / ファイル	ライセンス	アクション
IPSJ-TACS1203006.pdf (883.0 kB)	Copyright (c) 2019 by the Information Processing Society of Japan
オープンアクセス

Item type

Trans(1)

公開日

2019-07-29

タイトル

宇宙輻射輸送コードにおけるOpenCLによるFPGA演算加速最適化

タイトル

言語

タイトル

Optimization on Astrophysical Radiative Transfer Code for FPGAs with OpenCL

言語

jpn

キーワード

主題Scheme

Other

主題

FPGA，OpenCL，演算加速装置

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

著者所属

筑波大学計算科学研究センター

著者所属

筑波大学計算科学研究センター／筑波大学システム情報工学研究科

著者所属

筑波大学計算科学研究センター／筑波大学システム情報工学研究科

著者所属

筑波大学計算科学研究センター／筑波大学システム情報工学研究科

著者所属

筑波大学計算科学研究センター／筑波大学数理物質科学研究科

著者所属

筑波大学計算科学研究センター

著者所属

筑波大学計算科学研究センター／筑波大学数理物質科学研究科

著者所属(英)

Center for Computational Sciences, University of Tsukuba

著者所属(英)

Center for Computational Sciences, University of Tsukuba / Graduate School of Systems and Information Engineering, University of Tsukuba

著者所属(英)

Center for Computational Sciences, University of Tsukuba / Graduate School of Systems and Information Engineering, University of Tsukuba

著者所属(英)

Center for Computational Sciences, University of Tsukuba / Graduate School of Systems and Information Engineering, University of Tsukuba

著者所属(英)

Center for Computational Sciences, University of Tsukuba / Graduate School of Pure and Applied Sciences, University of Tsukuba

著者所属(英)

Center for Computational Sciences, University of Tsukuba

著者所属(英)

Center for Computational Sciences, University of Tsukuba / Graduate School of Pure and Applied Sciences, University of Tsukuba

著者名

藤田, 典久
小林, 諒平
山口, 佳樹
朴, 泰祐
吉川, 耕司
安部, 牧人
梅村, 雅之

著者名(英)

Norihisa, Fujita
Ryohei, Kobayashi
Yoshiki, Yamaguchi
Taisuke, Boku
Kohji, Yoshikawa
Makito, Abe
Masayuki, Umemura

論文抄録

内容記述タイプ

Other

内容記述

近年，High Performance Computing（HPC）におけるチャレンジの中の一つに，高い性能と低い消費電力を持つField Programmable Gate Array（FPGA）技術をどのようにして次世代のスーパーコンピュータに用いるかという問題がある．Graphics Processing Unit（GPU）がHPCにおけるアクセラレータとして最も広く用いられているが，均一な大量の並列計算が必要であり，これが性能上のボトルネックとなる場合がある．一方で，FPGAは再構成回路による柔軟さと効率さを持っており，様々な問題に適応できる可能性を持つ．しかしながら，ハードウェアの動作を記述することは複雑であり，アプリケーションの開発者がFPGA回路を実装することは容易ではない．近年のFPGAにおける開発環境の進歩により，OpenCL言語を用いた高位合成（HLS: High Level Synthesis）開発環境が一般的になってきている．我々のこれまでのOpenCLを用いたカーネル記述の経験より，FPGA向けにアプリケーション記述する際は“co-design”に基づくアグレッシブなプログラミング戦略が高い性能を達成するうえで必要であることが分かっている．本研究では，宇宙輻射輸送を解くプログラムで用いられているアルゴリズムであるAuthentic Radiation Transfer（ART）法をOpenCLで記述してFPGA向けに最適化を行う．OpenCLで記述されたアプリケーションに対してco-designに基づくFPGA向け最適化を適用し，CPU，GPU，FPGA間での性能比較を行った．マルチコアCPU実装と比べて最大4.9倍の高速化が達成され，GPU実装との比較ではGPUと同程度の性能を達成した．FPGA実装の性能はGPUと同程度であるが，FPGAの方が通信オーバヘッドはGPUと比べると小さく，並列計算を行う際の性能はGPUの性能を超えられると考えられることから，今後，並列FPGA計算の実装を行う予定である．

論文抄録(英)

内容記述タイプ

Other

内容記述

One of the recent challenges faced by HPC is how to apply FPGA technology to accelerate a next-generation supercomputer as an efficient method of achieving high performance and low power consumption. GPU is the most commonly used accelerator for HPC supported by regularly executed high degree of parallel operations which causes performance bottleneck in some cases. On the other hand, there are great opportunities to flexibly and efficiently utilize FPGAs in reconfigurable circuits to fit various computing situations. However, it is not easy for application developers to implement FPGA logic circuits for their applications and algorithms, which generally require complicated hardware logic descriptions. Because of the progress made in the FPGA development environment in recent years, the HLS development environment using the OpenCL language has become popular. Based on our experience describing kernels using OpenCL, we found that a more aggressive programming strategy is necessary to realize true high performance based on a “co-design” concept to implement the necessary features and operations to fit the target application in an FPGA design. In this paper, we optimize the ART method used in space radiative transfer problems on an FPGA using OpenCL. Using a co-designed method for the optimized programming of a specific application with OpenCL for an FPGA, we achieved a performance that is 4.9 times faster than that of a multicore CPU implementation, and almost the same performance as a GPU implementation. Considering the current advanced FPGAs with interconnection features, we believe that their parallelized implementation with multiple FPGAs will achieve a higher performance than GPU.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AA11833852

書誌情報

情報処理学会論文誌コンピューティングシステム（ACS）

巻 12, 号 3, p. 64-75, 発行日 2019-07-29

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7829

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 21:58:46.001446

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

宇宙輻射輸送コードにおけるOpenCLによるFPGA演算加速最適化

× 藤田, 典久

× 小林, 諒平

× 山口, 佳樹

× 朴, 泰祐

× 吉川, 耕司

× 安部, 牧人

× 梅村, 雅之

× Norihisa, Fujita

× Ryohei, Kobayashi

× Yoshiki, Yamaguchi

× Taisuke, Boku

× Kohji, Yoshikawa

× Makito, Abe

× Masayuki, Umemura

Versions

Share

Cite as

エクスポート