Subwarp Interleavingを利用したGPU向け命令スケジューリング手法

福原, 淳司; 滝本, 宗宏; Junji, Fukuhara; Munehiro, Takimoto

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

Subwarp Interleavingを利用したGPU向け命令スケジューリング手法

https://ipsj.ixsq.nii.ac.jp/records/226830

名前 / ファイル	ライセンス	アクション
IPSJ-SE23214004.pdf (803.7 kB)	Copyright (c) 2023 by the Institute of Electronics, Information and Communication Engineers This SIG report is only available to those in membership of the SIG.
SE:会員：¥0, DLIB:会員：¥0

Item type

SIG Technical Reports(1)

公開日

2023-07-13

タイトル

Subwarp Interleavingを利用したGPU向け命令スケジューリング手法

タイトル

言語

タイトル

Instruction Scheduling for GPUs Utilizing Subwarp Interleaving

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

東京理科大学創域理工学部情報計算科学科

著者所属

東京理科大学創域理工学部情報計算科学科

著者所属(英)

Department of Information Sciences, Tokyo University of Science

著者所属(英)

Department of Information Sciences, Tokyo University of Science

著者名

福原, 淳司
滝本, 宗宏

著者名(英)

Junji, Fukuhara
Munehiro, Takimoto

論文抄録

内容記述タイプ

Other

内容記述

画像処理装置（Graphics Processing Unit，以下，GPU）は，Single-Instruction Multiple-Thread 実行形式を採用しており，分岐命令があると両分岐先を実行する分岐発散を引き起こす場合がある．分岐発散が発生すると，両分岐先の実行コストがかかるので，GPU 向けプログラムの実行効率が低下する．近年，分岐発散が生じた際，一方の分岐先の命令を実行している間にストールした場合に，もう一方の分岐先の命令を割り込ませる Subwarp Interleaving
（以下，SI）という手法が提案された．SI は分岐発散が生じている分岐内のストールを隠蔽できるので，プログラムの実行効率を改善する．しかし，SI は命令の並び順によっては効果的にストールを隠蔽できない問題があった．本稿では，SI を利用して，より効果的にストールを隠蔽するとともにスレッドレベル並列性を高める GPU 向け命令スケジューリング手法を提案する．実験ではサンプルプログラムに対する効果と本手法の将来的な拡張性を示す．

論文抄録(英)

内容記述タイプ

Other

内容記述

Graphics Processing Units (GPUs) exploit the Single-Instruction Multiple-Thread (SIMT) execution model, which causes branch divergence when SIMT threads in a warp follow diﬀerent execution paths. Once the divergence occurs, the execution of diverged paths is serialized, decreasing the performance of GPU programs. Subwarp Interleaving (SI), which allows GPUs to interleave the execution of some subwarps when one subwarp is stalled, has been proposed. Although SI can hide load-to-use stalls in divergent paths, the eﬀectiveness is limited by the order of instructions in divergent branches. In this paper, we propose the novel instruction scheduling algorithm for GPUs to allow SI to hide load-to-use stalls more eﬃciently and improve performance. Our experimental results show that the eﬀect on sample programs and the future expandability of the proposed method.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10112981

書誌情報

研究報告ソフトウェア工学（SE）

巻 2023-SE-214, 号 4, p. 1-6, 発行日 2023-07-13

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8825

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 12:21:52.789542

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

Subwarp Interleavingを利用したGPU向け命令スケジューリング手法

× 福原, 淳司

× 滝本, 宗宏

× Junji, Fukuhara

× Munehiro, Takimoto

Versions

Share

Cite as

エクスポート