マルチコアアーキテクチャのための密行列LU分解のプログラミング技術

里城, 晴紀; 吉瀬, 謙二; 小長谷, 明彦; Haruki, Satoshiro; Kenji, Kise; Akihiko, Konagaya

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

マルチコアアーキテクチャのための密行列LU分解のプログラミング技術

https://ipsj.ixsq.nii.ac.jp/records/70782

名前 / ファイル	ライセンス	アクション
IPSJ-TACS0303018.pdf (1.0 MB)	Copyright (c) 2010 by the Information Processing Society of Japan
オープンアクセス

Item type

Trans(1)

公開日

2010-09-17

タイトル

マルチコアアーキテクチャのための密行列LU分解のプログラミング技術

タイトル

言語

タイトル

On-chip Parallel Programming Techniques for Dense LU Decomposition

言語

jpn

キーワード

主題Scheme

Other

主題

並列計算

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

著者所属

東京工業大学

著者所属

東京工業大学

著者所属

東京工業大学

著者所属(英)

Tokyo Institute of Technology

著者所属(英)

Tokyo Institute of Technology

著者所属(英)

Tokyo Institute of Technology

著者名

里城, 晴紀

著者名(英)

Haruki, Satoshiro

論文抄録

内容記述タイプ

Other

内容記述

近年，シングルコアプロセッサは消費電力と発熱の制限により性能限界に達したため，多数のプロセッサコアによって性能向上を図るマルチコア，メニーコアのプロセッサが主流となっている．マルチコアプロセッサの性能を引き出すためには，すべてのコアを無駄なく動作させるための並列性の確保と，多数のコアが同時アクセスすることで生じるメモリアクセスボトルネックの解消を同時に満たすことが課題となっている．密な線型方程式を効率良く解くLU分解は高性能計算の代表的なベンチマークとして知られている．これまで，LU分解の高速実行アルゴリズムとしては，並列処理を最大限に活用できるright-looking法が適しているといわれていた．しかしながら，マルチコアプロセッサにおいては演算性能に比べメモリ性能が相対的に低いため，データ転送量の多いright-looking法が必ずしも最大性能を示すとは限らない．本論文では，LU分解を題材に，参照局所性が高いleft-looking法が，最大並列性を実現するright-looking法よりも高性能を実現するマルチコアアーキテクチャの条件を，性能予測モデルとCell BEでの評価実験での結果をふまえて報告する．

論文抄録(英)

内容記述タイプ

Other

内容記述

Recently, multicore processor architectures have been getting attention from the viewpoint of the balance of design complexity and CPU performance in the constraint of electronic power consumption and transistor size. In the multicore processor architectures, high performance computing requires not only parallelism to make use a number of cores but also efficient data transfer mechanism to avoid memory access bottleneck. Dense linear algebra LU decomposition is one of the well-known algorithms used for benchmarks in high performance computing. It is usually said that the right-looking method is better than the left-looking method due to the available parallelism in the LU decomposition. However, this is not always true in the multicore architectures due to the memory bandwidth bottleneck. In this paper, architectural conditions in which the left-looking method overperformed the right-looking method are described with performance estimation models and empirical evaluation on Cell BE.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AA11833852

書誌情報

情報処理学会論文誌コンピューティングシステム（ACS）

巻 3, 号 3, p. 199-208, 発行日 2010-09-17

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7829

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 23:21:48.413084

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

マルチコアアーキテクチャのための密行列LU分解のプログラミング技術

× 里城, 晴紀

× Haruki, Satoshiro

Versions

Share

Cite as

エクスポート