ログイン 新規登録
言語:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. 研究報告
  2. ハイパフォーマンスコンピューティング(HPC)
  3. 2025
  4. 2025-HPC-200

Can Tensor Cores Accelerate Non-GEMMWorkloads? An Analytical Study

https://ipsj.ixsq.nii.ac.jp/records/2003140
https://ipsj.ixsq.nii.ac.jp/records/2003140
6cca70d6-7c75-4f0b-9b8d-7db3020ffb97
名前 / ファイル ライセンス アクション
IPSJ-HPC25200003.pdf IPSJ-HPC25200003.pdf (1.4 MB)
 2027年7月28日からダウンロード可能です。
Copyright (c) 2025 by the Information Processing Society of Japan
非会員:¥660, IPSJ:学会員:¥330, HPC:会員:¥0, DLIB:会員:¥0
Item type SIG Technical Reports(1)
公開日 2025-07-28
タイトル
言語 ja
タイトル Can Tensor Cores Accelerate Non-GEMMWorkloads? An Analytical Study
タイトル
言語 en
タイトル Can Tensor Cores Accelerate Non-GEMMWorkloads? An Analytical Study
言語
言語 eng
キーワード
主題Scheme Other
主題 計算方法
資源タイプ
資源タイプ識別子 http://purl.org/coar/resource_type/c_18gh
資源タイプ technical report
著者所属
RIKEN Center for Computational Science
著者所属
University of South Florida
著者所属
Argonne National Laboratory
著者所属
RIKEN Center for Computational Science
著者所属
RIKEN Center for Computational Science
著者所属(英)
en
RIKEN Center for Computational Science
著者所属(英)
en
University of South Florida
著者所属(英)
en
Argonne National Laboratory
著者所属(英)
en
RIKEN Center for Computational Science
著者所属(英)
en
RIKEN Center for Computational Science
著者名 Lingqi,Zhang

× Lingqi,Zhang

Lingqi,Zhang

Search repository
Jiajun,Huang

× Jiajun,Huang

Jiajun,Huang

Search repository
Sheng,Di

× Sheng,Di

Sheng,Di

Search repository
Satoshi,Matsuoka

× Satoshi,Matsuoka

Satoshi,Matsuoka

Search repository
Mohamed,Wahib

× Mohamed,Wahib

Mohamed,Wahib

Search repository
著者名(英) Lingqi Zhang

× Lingqi Zhang

en Lingqi Zhang

Search repository
Jiajun Huang

× Jiajun Huang

en Jiajun Huang

Search repository
Sheng Di

× Sheng Di

en Sheng Di

Search repository
Satoshi Matsuoka

× Satoshi Matsuoka

en Satoshi Matsuoka

Search repository
Mohamed Wahib

× Mohamed Wahib

en Mohamed Wahib

Search repository
論文抄録
内容記述タイプ Other
内容記述 Tensor Cores are specialized units integrated in modern GPUs, designed to accelerate dense matrix operations with remarkable efficiency. They have proven particularly effective in compute-bound workloads, such as those found in deep learning training, where general matrix-matrix multiplication (GEMM) is prevalent. Motivated by this success, recent efforts have explored extending Tensor Core usage to non-GEMM computational patterns. However, despite their potential, effectively utilizing Tensor Cores in broader contexts requires a thorough understanding of their performance characteristics across diverse workloads. This work investigates the applicability of Tensor Cores to non-GEMM workloads, seeking to answer a fundamental question: Can Tensor Cores accelerate non-GEMM kernels?
論文抄録(英)
内容記述タイプ Other
内容記述 Tensor Cores are specialized units integrated in modern GPUs, designed to accelerate dense matrix operations with remarkable efficiency. They have proven particularly effective in compute-bound workloads, such as those found in deep learning training, where general matrix-matrix multiplication (GEMM) is prevalent. Motivated by this success, recent efforts have explored extending Tensor Core usage to non-GEMM computational patterns. However, despite their potential, effectively utilizing Tensor Cores in broader contexts requires a thorough understanding of their performance characteristics across diverse workloads. This work investigates the applicability of Tensor Cores to non-GEMM workloads, seeking to answer a fundamental question: Can Tensor Cores accelerate non-GEMM kernels?
書誌レコードID
収録物識別子タイプ NCID
収録物識別子 AN10463942
書誌情報 研究報告ハイパフォーマンスコンピューティング(HPC)

巻 2025-HPC-200, 号 3, p. 1-8, 発行日 2025-07-28
ISSN
収録物識別子タイプ ISSN
収録物識別子 2188-8841
Notice
SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.
出版者
言語 ja
出版者 情報処理学会
戻る
0
views
See details
Views

Versions

Ver.1 2025-07-10 01:42:33.741414
Show All versions

Share

Mendeley Twitter Facebook Print Addthis

Cite as

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX

Confirm


Powered by WEKO3


Powered by WEKO3