WEKO3
アイテム
Automatic parallelization with OSCAR API Analyzer: a cross-platform performance evaluation
https://ipsj.ixsq.nii.ac.jp/records/87484
https://ipsj.ixsq.nii.ac.jp/records/8748409d4b5a5-7b7b-4581-9943-28c4bb703d82
名前 / ファイル | ライセンス | アクション |
---|---|---|
![]() |
Copyright (c) 2012 by the Information Processing Society of Japan
|
|
オープンアクセス |
Item type | SIG Technical Reports(1) | |||||||
---|---|---|---|---|---|---|---|---|
公開日 | 2012-12-06 | |||||||
タイトル | ||||||||
タイトル | Automatic parallelization with OSCAR API Analyzer: a cross-platform performance evaluation | |||||||
タイトル | ||||||||
言語 | en | |||||||
タイトル | Automatic parallelization with OSCAR API Analyzer: a cross-platform performance evaluation | |||||||
言語 | ||||||||
言語 | eng | |||||||
キーワード | ||||||||
主題Scheme | Other | |||||||
主題 | コンパイラ | |||||||
資源タイプ | ||||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_18gh | |||||||
資源タイプ | technical report | |||||||
著者所属 | ||||||||
Waseda University/Universitat Politecnica de Catalunya | ||||||||
著者所属 | ||||||||
Waseda University | ||||||||
著者所属 | ||||||||
Waseda University | ||||||||
著者所属 | ||||||||
Waseda University | ||||||||
著者所属 | ||||||||
Waseda University | ||||||||
著者所属 | ||||||||
Waseda University | ||||||||
著者所属 | ||||||||
Waseda University | ||||||||
著者所属 | ||||||||
Waseda University | ||||||||
著者所属 | ||||||||
Waseda University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Waseda University / Universitat Politecnica de Catalunya | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Waseda University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Waseda University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Waseda University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Waseda University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Waseda University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Waseda University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Waseda University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Waseda University | ||||||||
著者名 |
CeciliaGonzalez-Alvarez
Youhei, Kanehagi
Kosei, Takemoto
Yohei, Kishimoto
Kohei, Muto
Hiroki, Mikami
Akihiro, Hayashi
Keiji, Kimura
Hironori, Kasahara
× CeciliaGonzalez-Alvarez Youhei, Kanehagi Kosei, Takemoto Yohei, Kishimoto Kohei, Muto Hiroki, Mikami Akihiro, Hayashi Keiji, Kimura Hironori, Kasahara
|
|||||||
著者名(英) |
Cecilia, Gonzalez-Alvarez
Youhei, Kanehagi
Kosei, Takemoto
Yohei, Kishimoto
Kohei, Muto
Hiroki, Mikami
Akihiro, Hayashi
Keiji, Kimura
Hironori, Kasahara
× Cecilia, Gonzalez-Alvarez Youhei, Kanehagi Kosei, Takemoto Yohei, Kishimoto Kohei, Muto Hiroki, Mikami Akihiro, Hayashi Keiji, Kimura Hironori, Kasahara
|
|||||||
論文抄録 | ||||||||
内容記述タイプ | Other | |||||||
内容記述 | To satisfy the demands of auto parallelizing compilers in the diverse industry of multicores, we have developed the OSCAR API Analyzer. It allows programs automatically parallelized by the OSCAR compiler with OSCAR API directives to target many different platforms using just sequential compilers. We have evaluated the execution performance of the parallelization of Fortran SPEC benchmarks (tomcatv, swim2000, mgrid2000) and media C benchmarks (AAC encoder, Optical flow, MPEG2 encoder, MPEG2 decoder, Face detect) on five HPC servers and four embedded multicores. Speedups on servers were up to 18x for 32 cores (swim2000 on Hitachi SR16000), whereas on embedded systems, AAC encoder speedup was up to 47x on TilePro64, for 64 homogeneous cores, and up to 32.65x for the optical flow on the heterogeneous multicore RP-X, using 8 cores and 4 accelerators. | |||||||
論文抄録(英) | ||||||||
内容記述タイプ | Other | |||||||
内容記述 | To satisfy the demands of auto parallelizing compilers in the diverse industry of multicores, we have developed the OSCAR API Analyzer. It allows programs automatically parallelized by the OSCAR compiler with OSCAR API directives to target many different platforms using just sequential compilers. We have evaluated the execution performance of the parallelization of Fortran SPEC benchmarks (tomcatv, swim2000, mgrid2000) and media C benchmarks (AAC encoder, Optical flow, MPEG2 encoder, MPEG2 decoder, Face detect) on five HPC servers and four embedded multicores. Speedups on servers were up to 18x for 32 cores (swim2000 on Hitachi SR16000), whereas on embedded systems, AAC encoder speedup was up to 47x on TilePro64, for 64 homogeneous cores, and up to 32.65x for the optical flow on the heterogeneous multicore RP-X, using 8 cores and 4 accelerators. | |||||||
書誌レコードID | ||||||||
収録物識別子タイプ | NCID | |||||||
収録物識別子 | AN10463942 | |||||||
書誌情報 |
研究報告ハイパフォーマンスコンピューティング(HPC) 巻 2012-HPC-137, 号 10, p. 1-8, 発行日 2012-12-06 |
|||||||
Notice | ||||||||
SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc. | ||||||||
出版者 | ||||||||
言語 | ja | |||||||
出版者 | 情報処理学会 |