Performance Analysis of MapReduce Implementations for High Performance Homology Search

Zhang Chaojie; Koichi Shirahata; Shuji Suzuki; Yutaka Akiyama; Satoshi Matsuoka

論文・著書情報

タイトル

和文:
英文:	Performance Analysis of MapReduce Implementations for High Performance Homology Search

著者

和文:	ザンチャオジエ, 白幡晃一, 鈴木脩司, 秋山泰, 松岡聡.
英文:	Zhang Chaojie, Koichi Shirahata, Shuji Suzuki, Yutaka Akiyama, Satoshi Matsuoka.

言語

English

掲載誌/書名

和文:
英文:

巻, 号, ページ

出版年月

2015年

出版者

和文:
英文:

会議名称

和文:	2015年ハイパフォーマンスコンピューティングと計算科学シンポジウム
英文:	HPCS2015

開催地

和文:	東京都
英文:	Tokyo

公式リンク

http://hpcs.hpcc.jp/

アブストラクト

Homology search to be used in emerging bioinformatics problems such as metagenomics is of increasing importance and challenge as its application area grows more broadly while the computational complexity is increasing, thus requiring massive parallel data processing. Earlier work by some of the authors have devised novel algorithms such as GHOSTX, but the master-worker parallelization to enumerate and schedule for data processing was done with a privately developed, MPI-based master-worker framework called GHOST-MP. An alternative is to utilize the now-popular big data software substrates, such as MapReduce with abundant associated software tool-chains, but it is not clear whether the massive resource required by metagenomic homology search would not overwhelm its known limitations. By converting the GHOST-MP master-worker data processing pipeline to accommodate MapReduce, and benchmarking them on a variety of high-performance MapReduce incarnations including Hadoop and Spark, we attempt to characterize the appropriateness of MapReduce as a generic framework for metagenomics that embody extremely resource consuming requirements for both compute and data.

Home

各種検索

サポート

T2R2について

関連リンク

論文・著書情報