Item type |
SIG Technical Reports(1) |
公開日 |
2014-12-02 |
タイトル |
|
|
タイトル |
Performance Analysis of MapReduce Implementations for High Performance Homology Search (Unrefereed Workshop Manuscript) |
タイトル |
|
|
言語 |
en |
|
タイトル |
Performance Analysis of MapReduce Implementations for High Performance Homology Search (Unrefereed Workshop Manuscript) |
言語 |
|
|
言語 |
eng |
キーワード |
|
|
主題Scheme |
Other |
|
主題 |
性能評価 |
資源タイプ |
|
|
資源タイプ識別子 |
http://purl.org/coar/resource_type/c_18gh |
|
資源タイプ |
technical report |
著者所属 |
|
|
|
Tokyo Institute of Technology |
著者所属 |
|
|
|
Tokyo Institute of Technology/JST CREST |
著者所属 |
|
|
|
Tokyo Institute of Technology |
著者所属 |
|
|
|
Tokyo Institute of Technology/JST CREST |
著者所属 |
|
|
|
Tokyo Institute of Technology/JST CREST |
著者所属(英) |
|
|
|
en |
|
|
Tokyo Institute of Technology |
著者所属(英) |
|
|
|
en |
|
|
Tokyo Institute of Technology / JST CREST |
著者所属(英) |
|
|
|
en |
|
|
Tokyo Institute of Technology |
著者所属(英) |
|
|
|
en |
|
|
Tokyo Institute of Technology / JST CREST |
著者所属(英) |
|
|
|
en |
|
|
Tokyo Institute of Technology / JST CREST |
著者名 |
Chaojie, Zhang
Koichi, Shirahata
Shuji, Suzuki
Yutaka, Akiyama
Satoshi, Matsuoka
|
著者名(英) |
Chaojie, Zhang
Koichi, Shirahata
Shuji, Suzuki
Yutaka, Akiyama
Satoshi, Matsuoka
|
論文抄録 |
|
|
内容記述タイプ |
Other |
|
内容記述 |
Homology search to be used in emerging bioinformatics problems such as metagenomics is of increasing importance and challenge as its application area grows more broadly while the computational complexity is increasing, thus requiring massive parallel data processing. Earlier work by some of the authors have devised novel algorithms such as GHOSTX, but the master-worker parallelization to enumerate and schedule for data processing was done with a privately developed, MPI-based master-worker framework called GHOST-MP. An alternative is to utilize the now-popular big data software substrates, such as MapReduce with abundant associated software tool-chains, but it is not clear whether the massive resource required by metagenomic homology search would not overwhelm its known limitations. By converting the GHOST-MP master-worker data processing pipeline to accommodate MapReduce, and benchmarking them on a variety of high-performance MapReduce incarnations including Hadoop and Spark, we attempt to characterize the appropriateness of MapReduce as a generic framework for metagenomics that embody extremely resource consuming requirements for both compute and data. |
論文抄録(英) |
|
|
内容記述タイプ |
Other |
|
内容記述 |
Homology search to be used in emerging bioinformatics problems such as metagenomics is of increasing importance and challenge as its application area grows more broadly while the computational complexity is increasing, thus requiring massive parallel data processing. Earlier work by some of the authors have devised novel algorithms such as GHOSTX, but the master-worker parallelization to enumerate and schedule for data processing was done with a privately developed, MPI-based master-worker framework called GHOST-MP. An alternative is to utilize the now-popular big data software substrates, such as MapReduce with abundant associated software tool-chains, but it is not clear whether the massive resource required by metagenomic homology search would not overwhelm its known limitations. By converting the GHOST-MP master-worker data processing pipeline to accommodate MapReduce, and benchmarking them on a variety of high-performance MapReduce incarnations including Hadoop and Spark, we attempt to characterize the appropriateness of MapReduce as a generic framework for metagenomics that embody extremely resource consuming requirements for both compute and data. |
書誌レコードID |
|
|
収録物識別子タイプ |
NCID |
|
収録物識別子 |
AN10096105 |
書誌情報 |
研究報告計算機アーキテクチャ(ARC)
巻 2014-ARC-213,
号 29,
p. 1-7,
発行日 2014-12-02
|
Notice |
|
|
|
SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc. |
出版者 |
|
|
言語 |
ja |
|
出版者 |
情報処理学会 |