Home >

news ヘルプ

論文・著書情報


タイトル
和文: 
英文:Performance Analysis of MapReduce Implementations for High Performance Homology Search 
著者
和文: ザン チャオジエ, 白幡 晃一, 鈴木 脩司, 秋山 泰, 松岡 聡.  
英文: Zhang Chaojie, Koichi Shirahata, Shuji Suzuki, Yutaka Akiyama, Satoshi Matsuoka.  
言語 English 
掲載誌/書名
和文: 
英文: 
巻, 号, ページ        
出版年月 2015年 
出版者
和文: 
英文: 
会議名称
和文:2015年ハイパフォーマンスコンピューティングと計算科学シンポジウム 
英文:HPCS2015 
開催地
和文:東京都 
英文:Tokyo 
公式リンク http://hpcs.hpcc.jp/
 
アブストラクト Homology search to be used in emerging bioinformatics problems such as metagenomics is of increasing importance and challenge as its application area grows more broadly while the computational complexity is increasing, thus requiring massive parallel data processing. Earlier work by some of the authors have devised novel algorithms such as GHOSTX, but the master-worker parallelization to enumerate and schedule for data processing was done with a privately developed, MPI-based master-worker framework called GHOST-MP. An alternative is to utilize the now-popular big data software substrates, such as MapReduce with abundant associated software tool-chains, but it is not clear whether the massive resource required by metagenomic homology search would not overwhelm its known limitations. By converting the GHOST-MP master-worker data processing pipeline to accommodate MapReduce, and benchmarking them on a variety of high-performance MapReduce incarnations including Hadoop and Spark, we attempt to characterize the appropriateness of MapReduce as a generic framework for metagenomics that embody extremely resource consuming requirements for both compute and data.

©2007 Tokyo Institute of Technology All rights reserved.