"Lingqi Zhang,Mohamed Wahib,Peng Chen,Jintao Meng,Xiao Wang,Toshio Endo,Satoshi Matsuoka","Revisiting Temporal Blocking Stencil Optimizations","ACM International Conference on Supercomputing (ICS 2023)","proceedings of ACM International Conference on Supercomputing (ICS 2023)",,,,"pp. 251-263",2023,June "Lingqi Zhang,Mohamed Wahib,Peng Chen,Jintao Meng,Xiao Wang,Toshio Endo,Satoshi Matsuoka","PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU Applications","ACM International Conference on Supercomputing (ICS 2023)","proceedings of ACM International Conference on Supercomputing (ICS 2023)",,,,"pp. 167-179",2023,June "Lingqi Zhang,Mohamed Wahib,Peng Chen,Jintao Meng,Xiao Wang,Toshio Endo,Satoshi Matsuoka","Exploiting Scratchpad Memory for Deep Temporal Blocking","the 15th Workshop on General Purpose Processing Using GPU (GPGPU 2023)","proceedings of the 15th Workshop on General Purpose Processing Using GPU (GPGPU 2023)",,,,,2023,Feb. "Lingqi Zhang,Mohamed Wahib,Peng Chen,Jintao Meng,Xiao Wang,Toshio Endo,Satoshi Matsuoka","Breaking the Memory Bottleneck for Iterative Memory-bound Applications Via Persistent Kernels",,"IPSJ SIG Technical Report","IPSJ","Vol. 2022-HPC-187","No. 18",,2022,Dec. "Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Erin McCarthy,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Peter Nugent,Brian Van Essen","The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism",,"IEEE Transactions on Parallel & Distributed Systems (TPDS)",,"vol. 32","no. 7","pp. 1641-1652",2021,July "Jens Domke,Emil Vatai,Alexsandr Drozd,Peng Chen,Yosuke Oyama,Lingqi Zhang,Shweta Salaria,Daichi Mukunoki,Artur Podobas,Mohamed Wahib,Satoshi Matsuoka","Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws?","International Parallel and Distributed Processing Symposium (IPDPS 2021)",,,,,,2021,May "Jens Domke,Emil Vatai,Alexsandr Drozd,Peng Chen,Yosuke Oyama,Lingqi Zhang,Shweta Salaria,Daichi Mukunoki,Artur Podobas,Mohamed Wahib,Satoshi Matsuoka","Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws?",,,,,,,2020,Oct. "Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Erin McCarthy,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Peter Nugent,Brian Van Essen","The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism",,,,,,,2020,July "Peng Chen,Mohamed Wahib,Shinichiro Takizawa,Ryousei Takano,Satoshi Matsuoka.","High resolution Image Reconstruction on Supercomputer","GPU Technology Conference Silicon Vally 2020 (GTC' 20)",,,,,,2020,Mar. "Peng Chen,Mohamed Wahib,Shinichiro Takizawa,Ryousei Takano,Hirotaka Ogawa,Satoshi Matsuoka","A Software Systolic Array on GPUs","GPU Technology Conference Silicon Vally 2020 (GTC' 20)",,,,,,2020,Mar. "Kazuaki Matsumura,Hamid Reza Zohouri,Mohamed Wahib,Toshio Endo,Satoshi Matsuoka.","AN5D: Automated Stencil Framework for High-Degree Temporal Blocking on GPUs .","In proceedings of International Symposium on Code Generation and Optimization (CGO 2020)",,,,,,2020,Feb. "Peng Chen,Mohamed Wahib,Shinichiro Takizawa,Ryousei Takano,Satoshi Matsuoka","A versatile software systolic execution model for GPU memory-bound kernels","Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis",,,,,,2019,Nov. "Peng Chen,Mohamed Wahib,Shinichiro Takizawa,Ryousei Takano,Satoshi Matsuoka","iFDK: a scalable framework for instant high-resolution image reconstruction","Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis",,,,,,2019,Nov. "Jens Domke,Satoshi Matsuoka,Ivan R. Ivanov,Yuki Tsushima,Tomoya Yuki,Akihiro Nomura,Shin’ichi Miura,Nic McDonald,Dennis L. Floyd,Nicolas Dub?","HyperX Topology: First At-Scale Implementation and Comparison to the Fat-Tree","International Conference for High Performance Computing, Networking, Storage and Analysis (SC '19)","SC '19 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis","Association for Computing Machinery",,"No. 40",,2019,Nov. "八島慶汰,石川康太,佐藤育郎,野村哲弘,横田理央,松岡聡","早期終了タイミングを予測する:深層学習における確率勾配の分布の変化点検出","第22回情報論的学習理論ワークショップ (IBIS 2019)",,,,,,2019,Nov. "Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Marc Snir,Peter Nugent,Brian Van Essen","Toward Training a Large 3D Cosmological CNN with Hybrid Parallelization","48th International Conference on Parallel Processing (ICPP 2019)",,,,,,2019,Aug. "Jens Domke,Satoshi Matsuoka,Ivan R. Ivanov,Yuki Tsushima,Tomoya Yuki,Akihiro Nomura,Shin'ichi Miura,Nic McDonald,Dennis L. Floyd,Nicolas Dube","The First Supercomputer with HyperX Topology: A Viable Alternative to Fat-Trees?","26th Symposium on High-Performance Interconnects (IEEE Hot Interconnects 2019)",,,,,,2019,Aug. "Yohei Tsuji,Kazuki Osawa,Yuichiro Ueno,Akira Naruse,Rio Yokota,Satoshi Matsuoka","Performance Optimizations and Analysis of Distributed Deep Learning with Approximated Second-Order Optimization Method","International Conference on Parallel Processing: The 1st Workshop on Parallel and Distributed Machine Learning","Proceedings of the 48th International Conference on Parallel Processing: Workshops",,,"No. 21",,2019,Aug. "Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Marc Snir,Peter Nugent,Brian Van Essen","Toward Training a Large 3D Cosmological CNN with Hybrid Parallelization","The 1st Workshop on Parallel and Distributed Machine Learning 2019 (PDML'19)",,,,,,2019,Aug. "Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Marc Snir,Peter Nugent,Brian Van Essen","Toward Training a Large 3D Cosmological CNN with Hybrid Parallelization","第170回ハイパフォーマンスコンピューティング研究発表会",,,,,,2019,July "土川 稔生,遠藤 敏夫,野村 哲弘,近藤正章,大山 洋介,松岡 聡","メモリアクセスデータを用いた機械学習によるアプリケーションの類型化","並列/分散/協調処理に関するサマーワークショップ(SWoPP2019), 情報処理学会研究報告, 2019-HPC-170 No.12",,,,,,2019,July "Kazuki Osawa,Yohei Tsuji,Yuichiro Ueno,Akira Naruse,Rio Yokota,Satoshi Matsuoka","Second-order Optimization Method for Large Mini-batch: Training ResNet-50 on ImageNet in 35 Epochs","IEEE/CVF Conference on Computer Vision and Pattern Recognition",,,,,,2019,June "Yusuke Nagasaka,Akira Nukada,Ryosuke Kojima,Satoshi Matsuoka","Batched Sparse Matrix Multiplication for Accelerating Graph Convolutional Networks","19th Annual IEEE/ACM International Symposium in Cluster, Cloud, and Grid Computing (CCGrid 2019)","Proceedings of the 19th Annual IEEE/ACM International Symposium in Cluster, Cloud, and Grid Computing (CCGRID)","IEEE",,,,2019,May "Hideyuki Jitsumoto,Yuya Kobayashi,Akihiro Nomura,Satoshi Matsuoka","MH-QEMU: Memory-State-Aware Fault Injection Platform","Supercomputing Asia 2019","Supercomputing Frontiers. SCFA 2019. Lecture Notes in Computer Science","Springer","vol. 11416",,"pp. 71-85",2019,Apr. "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,Satoshi Matsuoka","u-cuDNN: Accelerating Deep Learning Frameworks with Micro-Batches","GPU Technology Conference 2019 (GTC2019)",,,,,,2019,Mar. "Peng Chen,Mohamed Wahib,Shinichiro Takizawa,Ryousei Takano,SATOSHI MATSUOKA","Efficient Algorithms for the Summed Area Tables Primitive on GPUs","IEEE Cluster 2018",,,,,,2018,Sept. "James Lin,Zhigeng Xu,Linjin Cai,Akira Nukada,Satoshi Matsuoka","Evaluating the SW26010 Many-core Processor with a Micro-benchmark Suite for Performance Optimizations",,"Parallel Computing","Elsevier","Vol. 77",,"pp. 128-143",2018,Sept. "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,Satoshi Matsuoka","Accelerating Deep Learning Frameworks with Micro-batches","IEEE Cluster 2018",,,,,,2018,Sept. "Kevin Brown,Nikhil Jain,Satoshi Matsuoka,Martin Schulz,Abhinav Bhatele","Interference between I/O and MPI Traffic on Fat-tree Networks","ICPP 2018: 47th International Conference on Parallel Processing",,"Association for Computing Machinery (ACM)",,,,2018,Aug. "Hamid Reza ZOHOURI,Artur Podobas,SATOSHI MATSUOKA","High-Performance High-Order Stencil Computation on FPGAs Using OpenCL","2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)",,,,,,2018,Aug. "土川稔生,大山洋介,野村哲弘,松岡聡","機械学習による計算機トレースの自動生成","並列/分散/協調処理に関するサマーワークショップ (SWoPP2018)",,,,,,2018,Aug. "八島慶汰,大山洋介,松岡聡","深層学習におけるBatchNormalization使用時の計算時間と精度の関係性","並列/分散/協調処理に関するサマーワークショップ (SWoPP2018)",,,,,,2018,July "Yusuke Nagasaka,Akira Nukada,Satoshi Matsuoka,Kenichi Miura,John Shalf","MRG8 - Random Number Generation for the Exascale Era","PASC 2018: Platform for Advanced Scientific Computing Conference","PASC '18 Proceedings of the Platform for Advanced Scientific Computing Conference","ACM",,,,2018,July "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,Satoshi Matsuoka","μ-cuDNN",,,,,,,2018,July "Adrian Perez Dieguez,Margarita Amor,Doallo Ram?n,Akira Nukada,Satoshi Matsuoka","Efficient Solving of Scan Primitive on Multi-GPU Systems","32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2018)","Proceedings of 32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2018)","IEEE",,,,2018,May "James Lin,Minhua Wen,Delong Meng,Xin Liu,Akira Nukada,Satoshi Matsuoka","Optimizations of Preconditioned Conjugate Gradient on TaihuLight for OpenFOAM","18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2018)","Proceedings of 2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)","IEEE",,,,2018,May "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,SATOSHI MATSUOKA","μ-cuDNN: Accelerating Deep Learning Frameworks with Micro-Batching",,,,,,,2018,Apr. "Jian Guo,Akihiro Nomura,Ryan Barton,Haoyu Zhang,Satoshi Matsuoka","Machine Learning Predictions for Underestimation of Job Runtime on HPC System","4th Asian Conference on Supercomputing Frontiers","Supercomputing Frontiers","Springer International Publishing",,,"Page 179-198",2018,Mar. "Chen Peng,Wahib Mohamed,Takizawa Shinichiro,Matsuoka Satoshi","Pushing the Limits for 2D Convolution Computation On CUDA-enabled GPUs","第163回ハイパフォーマンスコンピューティング研究発表会",,,,,,2018,Mar. "Hamid Reza Zohouri,Artur Podobas,Satoshi Matsuoka","Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL","26th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA)","Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays","ACM, New York, NY, USA",,,"pp. 153-162",2018,Feb. "小林佑矢,實本英之,野村哲弘,松岡聡","メモリアクセスパターン依存故障の注入のためのQEMUベース故障注入器","第163回 ハイパフォーマンスコンピューティング研究会","研究報告ハイパフォーマンスコンピューティング(HPC)","一般社団法人情報処理学会"," 2018-HPC-163"," 8"," 1 - 10",2018,Feb. "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,Satoshi Matsuoka","Less is More: Accelerating Deep Neural Networks with Micro-Batching","第162回ハイパフォーマンスコンピューティング研究発表会",,,,,,2017,Dec. "藤田 和宏,鶴見慶,安良岡由規,根本忍,梁井善行,渡邊寿雄,野村 哲弘,三浦信一,額田彰,遠藤敏夫,松岡聡","新スーパーコンピュータTSUBAME3.0の概要.","2017年度大学ICT推進協議会(AXIES)年次大会 No. TC1-6",,,,,,2017,Dec. "Toshio Endo,Satoshi Matsuoka.","TSUBAME3.0: A Green, Accelerated, Big-Data Supercomputer","ATIP Workshop on International Exascale and Next-Generation Computing Programs, in conjunction with SC17.",,,,,,2017,Nov. "Ikuro Sato,Ryo Fujisaki,Yosuke Oyama,Akihiro Nomura,Satoshi Matsuoka","Asynchronous, data-parallel deep convolutional neural network training with linear prediction model for parameter transition","The 24th International Conference On Neural Information Processing (ICONIP 2017)","International Conference on Neural Information Processing",,"volume 10635",,"pp. 305-314",2017,Nov. "Satoshi Matsuoka,Toshio Endo,Akira Nukada,Shinichi Miura,Akihiro Nomura,Hitoshi Sato,Hideyuki Jitsumoto,Aleksandr Drozd.","Overview of TSUBAME3.0, Green Cloud Supercomputer for Convergence of HPC, AI and Big-Data .",,"Global Scientific Information and Computing Center, Tokyo Institute of Technology, e-Science Journal",,"Vol. 16",,"pp. 2--9",2017,Nov. "Shota Kuroda,Toshio Endo,Satoshi Matsuoka.","Applying Temporal Blocking with a Directive-based Approach.","In Proceedings of Fourth Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), in conjuntion with SC17",,,,,,2017,Nov. "Artur Podobas,Hamid Reza ZOHOURI,SATOSHI MATSUOKA","Evaluating high-level design strategies on FPGAs for high-performance computing","2017 27th International Conference on Field Programmable Logic and Applications (FPL)",,,,,,2017,Oct. "Kevin Brown,Satoshi Matsuoka","Co-locating Graph Analytics and HPC Applications","2017 IEEE International Conference on Cluster Computing (CLUSTER)","2017 IEEE International Conference on Cluster Computing (CLUSTER)",,,,"pp. 659-660",2017,Sept. "Yusuke Nagasaka,Akira Nukada,Satoshi Matsuoka","High-performance and Memory-saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU","International Conference on Parallel Processing",,,,,,2017,Sept. "Shweta Salaria,Kevin Brown,HIDEYUKI JITSUMOTO,SATOSHI MATSUOKA","Evaluation of HPC-Big Data Applications Using Cloud Platforms","The 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)",,"IEEE",,,,2017,July "松岡 聡,遠藤 敏夫,額田 彰,三浦 信一,野村 哲弘,佐藤 仁,實本 英之,Drozd Aleksandr","HPCとビッグデータ・AIを融合するグリーン・クラウドスパコンTSUBAME3.0の概要","並列/分散/協調処理に関するサマーワークショップ(SWoPP2017)",,,,,,2017,July "Jian Guo,Kun Qian,Bjorn Schuller,Satoshi Matsuoka","GPU-based Training of Autoencoders for Bird Sound Data Processing","2017 IEEE International Conference on Consumer Electronics - Taiwan","2017 IEEE International Conference on Consumer Electronics - Taiwan","IEEE",,,,2017,July "辻 陽平,野村 哲弘,實本 英之,佐藤 育郎,松岡 聡","動的なプロセス数操作による分散深層学習の耐故障性と性能評価",,"情報処理学会研究報告 IPSJ SIG Technical Report",,,,,2017,July "小林 佑矢,實本 英之,野村 哲弘,松岡 聡","メモリアクセスパターン依存故障の注入のためのQEMUベース故障注入器","2017年並列/分散/協調処理に関する『秋田』サマー・ワークショップ (SWoPP2017)","研究報告ハイパフォーマンスコンピューティング(HPC)",," 2017-HPC-160"," 8"," 1 - 8",2017,July "Yosuke Oyama,Akihiro Nomura,Ikuro Sato,Hiroki Nishimura,Yukimasa Tamatsu,Satoshi Matsuoka","Predicting Probabilistic Parameters of a Large-Scale Asynchronous SGD Deep Learning System","GPU Technology Conference 2017 (GTC 2017)",,,,,,2017,May "Hamid Reza ZOHOURI,Naoya Maruyama,Aaron Smith,Motohiko Matsuda,SATOSHI MATSUOKA","Evaluating and Optimizing OpenCL Kernels for High Performance Computing with FPGAs","Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'16)",,,,,,2017,Mar. "本山 義史,遠藤 敏夫,松岡 聡,横田 理央,福田 圭祐,佐藤 育郎","低ランク近似行列によるCNNにおける畳み込み演算の最適化","第158回ハイパフォーマンスコンピューティング研究発表会","2017-HPC-158 No.25",,,,,2017,Mar. "大山洋介,野村哲弘,佐藤育郎,松岡聡","ディープラーニングのデータ並列学習における少精度浮動小数点数を用いた通信量の削減","第158回ハイパフォーマンスコンピューティング研究発表会",,,,,,2017,Mar. "黒田 勝汰,遠藤 敏夫,松岡 聡","テ?ィレクティフ?による時空間フ?ロッキンク?の自動適用","ハイパフォーマンスコンピューティング研究会",,,,,,2016,Dec. "Yosuke Oyama,Akihiro Nomura,Ikuro Sato,Hiroki Nishimura,Yukimasa Tamatsu,Satoshi Matsuoka","Predicting Statistics of Asynchronous SGD Parameters for a Large-Scale Distributed Deep Learning System on GPU Supercomputers","2016 IEEE International Conference on Big Data (IEEE BigData 2016)",,,,,,2016,Dec. "Jian Guo,Kun Qian,Huijie Xu,Christoph Janott,Bj¨orn Schuller,Satoshi Matsuoka","GPU-Based Fast Signal Processing for Large Amounts of Snore Sound Data","2016 IEEE 5th Global Conference on Consumer Electronics","2016 IEEE 5th Global Conference on Consumer Electronics","IEEE",,,"pp. 1-2",2016,Dec. "Keisuke Fukuda,Motohiko Matsuda,Naoya Maruyama,Rio Yokota,Kenjiro Taura,Satoshi Matsuoka","Tapas: An Implicitly Parallel ProgrammingFramework For Hierarchical N-body Algorithms","The 22nd IEEE International Conference on Parallel And Distributed Systems","The 22nd IEEE International Conference on Parallel And Distributed Systems",,,,"Page 1100-1109",2016,Dec. "Yusuke Nagasaka,Akira Nukada,Satoshi Matsuoka","Fast Sparse General Matrix-Matrix Multiplication on GPU with Low Memory Usage","The International Conference for High Performance Computing, Networking, Storage and Analysis (SC16)",,,,,,2016,Nov. "Mateusz Bysiek,Aleksandr Drozd,Satoshi Matsuoka","Migrating Legacy Fortran to Python While Retaining Fortran-Level Performance Through Transpilation and Type Hints","6th Workshop on Python for High-Performance and Scientific Computing","Proceedings of the 6th Workshop on Python for High-Performance and Scientific Computing","IEEE Press",,,"pp. 9-18",2016,Nov. "長坂侑亮,額田彰,松岡聡","メモリ使用量を抑えた疎行列疎行列積計算のGPU高速化","第156回ハイパフォーマンスコンピューティング研究発表会","情報処理学会研究報告","情報処理学会","Vol. 2016-HPC-156","No. 15","pp. 1-9",2016,Sept. "大山洋介,野村哲弘,佐藤育郎,西村裕紀,玉津幸政,松岡聡","学習条件を考慮した大規模非同期ディープラーニングシステムの性能モデリング","並列/分散/協調処理に関するサマーワークショップ(SWoPP2016)",,,,,,2016,Aug. "小林 佑矢,實本 英之,野村 哲弘,松岡 聡","仮想マシンエミュレータを用いた特定故障パターン発生時に おけるアプリケーションの誤差の評価","2016年並列/分散/協調処理に関する『松本』サマー・ワークショップ (SWoPP2016)","研究報告ハイパフォーマンスコンピューティング(HPC)",," 2016-HPC-155"," 10"," 1 - 7",2016,Aug. "松岡 聡,天野 英晴,中島 研吾,井上 弘士,工藤 知宏,丸山 直也,田浦 健次朗,岩下 武史,片桐 孝洋,塙敏博,遠藤 敏夫","ポストムーア時代におけるFLOPSからBYTESへの変革","並列/分散/協調処理に関するサマーワークショップ(SWoPP2016)","情報処理学会研究報告, 2016-HPC-155 No.32",,,,,2016,Aug. "Yusuke Nagasaka,Akira Nukada,Satoshi Matsuoka","Adaptive Multi-level Blocking Optimization for Sparse Matrix Vector Multiplication on GPU","International Conference on Computational Science (ICCS 2016)","Procedia Computer Science",,"Volume 80",,"pp. 131-142",2016,June "Yuya Kobayashi,HIDEYUKI JITSUMOTO,Akihiro Nomura,SATOSHI MATSUOKA","Evaluating tolerance of applications against realistic DRAM faults","The 25th International Symposium on High-Performance Parallel and Distributed Computing",,,,,,2016,May "Yosuke Oyama,Akihiro Nomura,Ikuro Sato,Hiroki Nishimura,Yukimasa Tamatsu,SATOSHI MATSUOKA","Training Condition Conscious Performance Modeling of an Asynchronous Data-Parallel Deep Learning System","ACM Symposium on High-Performance Parallel and Distributed Computing",,,,,,2016,May "Satoshi Matsuoka,Hideharu Amano,Kengo Nakajima,Koji Inoue,Tomohiro Kudoh,Naoya Maruyama,Kenjiro Taura,Takeshi Iwashita,Takahiro Katagiri,Toshihiro Hanawa,Toshio Endo","From FLOPS to BYTES: Disruptive Change in High-Performance Computing towards the Post-Moore Era","In proceedings of the ACM International Conference on Computing Frontiers (CF'16)",,,,,,2016,May "Pak Markthub,Akihiro Nomura,Satoshi Matsuoka","Reducing Remote GPU Execution’s Overhead with mrCUDA","GPU Technology Conference (GTC)",,,,,,2016,Apr. "Pak Markthub,Akihiro Nomura,Satoshi Matsuoka","Serving More GPU Jobs in Multi-GPU Batch-Queue Systems using Remote GPU Execution and Migration","情報処理学会 第153回ハイパフォーマンスコンピューティング研究発表会","情報処理学会 研究報告",,"Vol. 2016-HPC-153","No. 27","pp. 1-10",2016,Feb. "Pak Markthub,Akihiro Nomura,SATOSHI MATSUOKA","Serving more GPU jobs, with low penalty, using remote GPU execution and migration",,"Proceedings - 2016 IEEE International Conference on Cluster Computing (CLUSTER2016)",,,,"pp. 485-488",2016, "Kevin Brown,Jens Domke,Satoshi Matsuoka","Hardware-Centric Analysis of Network Performance for MPI Applications","2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS)","2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS)","Institute of Electrical and Electronics Engineers (IEEE)",,,"Page 692-699",2015,Dec. "野村 哲弘,佐々木 淳,三浦 信一,遠藤 敏夫,松岡 聡","TSUBAME2におけるジョブスケジューリング効率化への取り組みと検証","大学ICT推進協議会 2015年度年次大会 企画セッション HPCテクノロジー","大学ICT推進協議会 2015年度年次大会 企画セッション HPCテクノロジー",,,,,2015,Dec. "Toshio Endo,Yuki Takasaki,Satoshi Matsuoka","Realizing Extremely Large-Scale Stencil Applications on GPU Supercomputers",,"In Proceedings of The 21st IEEE International Conference on Parallel and Distributed Systems (ICPADS 2015)",,,,,2015,Dec. "Toshio Endo,Akira Nukada,Satoshi Matsuoka","Power Capping Scheduling on TSUBAME2.5 and Upgrade of TSUBAME-KFC.","Building Energy Efficient HPC Working Group Workshop, held with SC15",,,,,,2015,Nov. "Pak Markthub,Akihiro Nomura,Satoshi Matsuoka","mrCUDA: Low-Overhead Middleware for Transparently Migrating CUDA Execution from Remote to Local GPUs","International Conference for High Performance Computing, Networking, Storage and Analysis (SC15)",,,,,,2015,Nov. "Yusuke Nagasaka,Akira Nukada,Satoshi Matsuoka","Multi-level Blocking Optimization for Fast Sparse Matrix Vector Multiplication on GPUs","The International Conference for High Performance Computing, Networking, Storage and Analysis (SC15)",,,,,,2015,Nov. "長坂侑亮,額田彰,松岡聡","疎行列ヘ?クトル積計算を対象とした GPU 向けメ モリアクセス削減手法","第151回ハイパフォーマンスコンピューティング研究発表会","情報処理学会研究報告","情報処理学会","Vol. 2015-HPC-151","No. 8","pp. 1-7",2015,Sept. "Pak Markthub,Akihiro Nomura,Satoshi Matsuoka","mrCUDA: Low-Overhead Middleware for Live Migrating Remote GPU Execution to Local GPU Execution","GTC Japan Poster Session",,,,,,2015,Sept. "寺西 賢人,野村 哲弘,遠藤 敏夫,松岡 聡","ノード内同時実行ジョブにおけるパフォーマンスカウンタによるプロセス毎消費電力のモデル化","2015年並列/分散/協調処理に関する『別府』サマー・ワークショップ (SWoPP2015)","情報処理学会 研究報告",,"Vol. 2015-HPC-150","No. 28","pp. 1-6",2015,July "Pak Markthub,Akihiro Nomura,Satoshi Matsuoka","mrCUDA: A middleware for migrating rCUDA virtual GPUs to native GPUs","2015年並列/分散/協調処理に関する『別府』サマー・ワークショップ (SWoPP2015)","情報処理学会 研究報告",,"Vol. 2015-HPC-150","No. 6","pp. 1-9",2015,July "野村 哲弘,佐々木 淳,三浦 信一,遠藤 敏夫,松岡 聡","TSUBAME2におけるスケジュール効率化への取り組みとユーザ動向の見える化","2015年並列/分散/協調処理に関する『別府』サマー・ワークショップ (SWoPP2015)","情報処理学会 研究報告",,"Vol. 2015-HPC-150","No. 2","pp. 1-7",2015,July "Toshio Endo,Satoshi Matsuoka.","Realizing Extremely Large-Scale Stencil Applications on GPU Supercomputers with a Memory Hierarchy Management Runtime Library. Workshop on Programming Abstractions for Data Locality (PADAL 2015), Berkeley",,,,,,,2015,June "Naoto Sasaki,Kento Sato,Toshio Endo,Satoshi Matsuoka.","Exploration of Lossy Compression for Application-level Checkpoint/Restart","IEEE International Conference on Parallel and Distributed Processing Symposium 2015 (IPDPS2015)",,,,,,2015,May "野村 哲弘,三浦 信一,遠藤 敏夫,松岡 聡","アプリケーションのEmpiricalな性能モデル構築のためのプロファイル情報の収集","2015年ハイパフォーマンスコンピューティングと計算科学シンポジウム",,,,,,2015,May "高嵜 祐樹,遠藤 敏夫,松岡 聡","GPUクラスタにおける大規模都市気流シミュレーションの最適化と性能モデル","情報処理学会ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2015)",,,,,,2015,May "ABDELHALIM AMER,HUIWEI LU,PAVAN BALAJI,SATOSHI MATSUOKA","MPI+ Threads: Runtime Contention and Remedies","ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming","Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming","ACM",,,"p. 239-248",2015,Feb. "Zhang Chaojie,Koichi Shirahata,Shuji Suzuki,Yutaka Akiyama,Satoshi Matsuoka","Performance Analysis of MapReduce Implementations for High Performance Homology Search","HPCS2015",,,,,,2015, "Yusuke Nagasaka,Akira Nukada,SATOSHI MATSUOKA","Cache-aware Sparse Matrix Formats for Kepler GPU","International Conference on Parallel and Distributed Systems (ICPADS2014)","2014 20th IEEE International Conference on Parallel and Distributed Systems ICPADS 2014",,,,"pp. 281-288",2014,Dec. "Pak Markthub,Akihiro Nomura,Satoshi Matsuoka","Using rCUDA to Reduce GPU Resource-assignment Fragmentation caused by Job Scheduler","The 15th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT2014)","Proceedings of the 15th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT2014)",,,,,2014,Dec. "Toshio Endo,Akira Nukada,Satoshi Matsuoka","TSUBAME-KFC: a Modern Liquid Submersion Cooling Prototype towards Exascale Becoming the Greenest Supercomputer in the World","The 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2014)","Proc. of The 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2014)",,,,"pp. 360-367",2014,Dec. "Hideyuki Shamoto,Koichi Shirahata,Aleksandr Drozd,Hitoshi Sato,Satoshi Matsuoka","Large-scale Distributed Sorting for GPU-based Heterogeneous Supercomputers","IEEE BigData 2014",,,,,,2014,Oct. "Keita Iwabuchi,Hitoshi Sato,Yuichiro Yasui,Katsuki Fujisawa,Satoshi Matsuoka","NVM-based Hybrid BFS with Memory Efficient Data Structure","2014 IEEE International Conference on BigData (IEEE BigData 2014)",,,,,,2014,Oct. "Chih-Song Kuo,Aamer Shah,Akihiro Nomura,Satoshi Matsuoka,Felix Wolf","How File Access Patterns Influence Interference Among Cluster Applications","IEEE International Conference on Cluster Computing (CLUSTER2014)",,,,,,2014,Sept. "Kevin Brown,Jens Domke,Satoshi Matsuoka","Tracing Data Movements within MPI Collectives","EuroMPI/ASIA '14 Proceedings of the 21st European MPI Users' Group Meeting","EuroMPI/ASIA '14 Proceedings of the 21st European MPI Users' Group Meeting","Association for Computing Machinery (ACM)",,,"pp. 117-118",2014,Sept. "Gul Agha,Atsushi Igarashi,Naoki Kobayashi,Hidehiko Masuhara,Etsuya Shibayama,Kenjiro Taura,Satoshi Matsuoka,Akinori Yonezawa","Concurrent Objects and Beyond",,"Lecture notes in computer science, LNCS","Springer-Verlag","Vol. 8665",,,2014,Sept. "Koichi Shirahata,Hitoshi Sato,SATOSHI MATSUOKA","Out-of-core GPU Memory Management for MapReduce-based Large-scale Graph Processing","IEEE Cluster 2014",,,,,,2014,Sept. "遠藤敏夫,額田彰,松岡聡","超省エネスーパーコンピュータTSUBAME",,"PETROTECH","石油学会","Vol. 37","No. 8","pp. 605-609",2014,Aug. "Hideyuki Shamoto,Koichi Shirahata,Aleksandr Drozd,Hitoshi Sato,Satoshi Matsuoka","GPU Implementation of Splitter-based Parallel Sorting for Large-scale Heterogeneous Architectures","GTC Japan 2014",,,,,,2014,July "野村哲弘,三浦信一,遠藤敏夫,松岡聡","実アプリケーションを用いた計算機評価ベンチマークと性能リポジトリの開発","2014年並列/分散/協調処理に関する『新潟』サマー・ワークショップ (SWoPP2015)","情報処理学会研究報告",,"Vol. 2014-HPC-145","No. 29","pp. 1-7",2014,July "Yusuke Nagasaka,Akira Nukada,SATOSHI MATSUOKA","Cache-aware Sparse Matrix Format for GPU","International Superconputing Conference (ISC'14) HPC in Asia Posters",,,,,,2014,June "遠藤敏夫,額田彰,松岡聡","TSUBAME-KFC: 液浸冷却を用いた世界一省エネなスーパーコンピュータ",,"TSUBAME e-Science Journal","東京工業大学 学術国際情報センター",,"No. 11","pp. 2-7",2014,June "Akihiro Nomura,Shinichi Miura,Toshio Endo,SATOSHI MATSUOKA","Application Performance Characterization towards Exa-Scale Supercomputers","HPC in Asia 2014",,,,,,2014,June "Kento Sato,Adam Moody,Kathryn Mohror,Todd Gamblin,Bronis R. de Supinski,Naoya Maruyama,Satoshi Matsuoka","FMI: Fault Tolerant Messaging Interface for Fast and Transparent Recovery","The IEEE International Conference on Parallel and Distributed Processing Symposium 2014 (IPDPS2014)",,,,,,2014,May "Kento Sato,Kathryn Mohror,Adam Moody,Todd Gamblin,Bronis R. de Supinski,Naoya Maruyama,Satoshi Matsuoka","A User-level InfiniBand-based File System and Checkpoint Strategy for Burst Buffers","The 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid2014)",,,,,,2014,May "Katsuki Fujisawa,Toshio Endo,Yuichiro Yasui,Hitoshi Sato,Naoki Matsuzawa,Satoshi Matsuoka,Hayato Waki","Peta-scale General Solver for Semidefinite Programming Problems with over Two Million Constraints","IEEE International Conference on Parallel and Distributed Processing Symposium 2014 (IPDPS2014)","Proc. of IEEE International Conference on Parallel and Distributed Processing Symposium 2014 (IPDPS2014)",,,,"pp. 1171-1180",2014,May "長坂侑亮,額田彰,松岡聡","GPU のキャッシュを考慮した疎行列ヘ?クトル積計算手法の性能評価","第144回ハイパフォーマンスコンピューティング研究発表会","情報処理学会研究報告","情報処理学会","Vol. 2014-HPC-144","No. 5","pp. 1-9",2014,May "Koichi Shirahata,Hitoshi Sato,SATOSHI MATSUOKA","Preliminary I/O Performance Evaluation on GPU Accelerator and External Memory","GPU Technology Conference 2014",,,,,,2014,Mar. "Kento Sato,Akira Nukada,Naoya Maruyama,Satoshi Matsuoka","I/O acceleration with GPU for I/O-bound Applications","GPU Technology Conference 2014",,,,,,2014,Mar. "Satoshi Matsuoka,Hitoshi Sato,Osamu Tatebe,Michihiro Koibuchi,Ikki Fujiwara,Shuji Suzuki,Masanori Kakuta,Takashi Ishida,Yutaka Akiyama,Toyotaro Suzumura,Koji Ueno,Hiroki Kanezashi,Takemasa Miyoshi.","Extreme Big Data (EBD): Next Generation Big Data Infrastructure Technologies Towards Yottabyte/Year",,"Supercomputing frontiers and innovations",,"Vol. 1","No. 2","pp. 89-107",2014, "Chih-Song Kuo,Akihiro Nomura,Satoshi Matsuoka,Aamer Shah,Felix Wolf,Ilya Zhukov","Environment Matters: How Competition for I/O among Applications Degrades their Performance","第199回計算機アーキテクチャ・第142回ハイパフォーマンスコンピューティング合同研究発表会(HOKKE-21)","IPSJ SIG Technical Reports","情報処理学会","Vol. 2013-HPC-142","No. 11","pp. 1-7",2013,Dec. "松岡 聡,佐藤 賢斗,遠藤敏夫","エクサスケールスパコンに向けた耐故障性の評価 --- TSUBAME2.0を例にして ---","情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 2013-HPC-141(22)",,,,,,2013,Oct. "Kento Sato,Satoshi Matsuoka,Adam Moody,Kathryn Mohror,Todd Gamblin,Bronis R. de Supinski,Naoya Maruyama","Burst SSD Buffer: Checkpoint Strategy at Extreme Scale","IPSJ SIG Technical Reports 2013-HPC-141",,,,,,2013,Oct. "Guanghao Jin,Toshio Endo,Satoshi Matsuoka","A Parallel Optimization Method for Stencil Computation on the Domain that is Bigger than Memory Capacity of GPUs","IEEE Cluster Computing (CLUSTER2013)","Proc. of IEEE Cluster Computing (CLUSTER2013)",,,,"pp. 1-8",2013,Sept. "白幡 晃一,佐藤 仁,松岡 聡","GPUアクセラレータと不揮発性メモリを考慮したI/O性能の予備評価","第141回ハイパフォーマンスコンピューティング研究発表会","情報処理学会研究報告2013-HPC-141",,,"No. 1","pp. 1-9",2013,Sept. "白幡晃一,佐藤仁,鈴村豊太郎,松岡聡","MapReduce型グラフ処理アルゴリズムの複数GPUによる大規模計算","GTC Japan 2013",,,,,,2013,July "野村哲弘,三浦信一,遠藤敏夫,松岡聡,鈴木惣一朗,丸山直也","システム評価のためのアプリケーション性能リポジトリの構築と性能モデルの評価","2013年並列/分散/協調処理に関する 『北九州』サマー・ワークショップ(SWoPP北九州2013)","情報処理学会 研究報告","情報処理学会","Vol. 2013-HPC-140","No. 4","pp. 1-6",2013,July "Takafumi Saito,Kento Sato,Hitoshi Sato,SATOSHI MATSUOKA","Energy-aware I/O Optimization for Checkpoint and Restart on a NAND Flash Memory System","The Workshop on Fault-Tolerance for HPC at Extreme Scale 2013 (FTXS2013) in conjunction with the International Symposium on High Performance Parallel and Distributed Computing (HPDC13)",,,,,,2013,June "Abdelhalim Amer,Naoya Maruyama,Miquel Pericas,Kenjiro Taura,Rio Yokota,Satoshi Matsuoka","Fork-Join and Data-Driven Execution Models on Multi-core Architectures: Case Study of the FMM","International Supercomputing Conference","Lecture notes in computer science, LNCS",,"Vol. 7905",,"pp. 255-266",2013,June "Koichi Shirahata,Hitoshi Sato,Toyotaro Suzumura,SATOSHI MATSUOKA","A Scalable Implementation of a MapReduce-based Graph Processing Algorithm for Large-scale Heterogeneous Supercomputers","13th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2013)",,,,,,2013,May "Tetsuya Hoshino,Naoya Maruyama,SATOSHI MATSUOKA,Ryoji Takaki","CUDA vs OpenACC: Performance Case Studies with Kernel Benchmarks and a Memory Bound CFD Application","13th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing","Proceedings of the 13th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing","IEEE Computer Science Press",,,,2013,May "Guanghao Jin,Toshio Endo,Satoshi Matsuoka","A Multi-level Optimization Method for Stencil Computation on the Domain that is Bigger than Memory Capacity of GPU","The Third International Workshop on Accelerators and Hybrid Exascale Systems (AsHES)","Proc. of The Third International Workshop on Accelerators and Hybrid Exascale Systems (AsHES)",,,,"pp. 1080-1087",2013,May "Tetsuya Hoshino,Naoya Maruyama,SATOSHI MATSUOKA","CUDA vs OpenACC: Performance Case Studies","GPU Technology Conference",,,,,,2013,Mar. "Koichi Shirahata,Hitoshi Sato,Toyotaro Suzumura,SATOSHI MATSUOKA","A Scalable Implementation of a MapReduce-based Graph Algorithm for Large-scale Heterogeneous Supercomputers","GPU Technology Conference 2013",,,,,,2013,Mar. "Tetsuya Hoshino,Naoya Maruyama,SATOSHI MATSUOKA","Porting and Optimizing a Large-Scale CFD application with CUDA and OpenACC","Society for Industrial and Applied Mathematics Conference on Computational Science and Engineering",,,,,,2013,Feb. "佐藤仁,白幡晃一,松岡聡","大規模へテロスーパーコンピュータ向けデータ並列処理フレームワークの設計と実装","第138回ハイパフォーマンスコンピューティング研究発表会,","情報処理学会研究報告2013-HPC-138",,,"No. 24","pp. 1-7",2013,Feb. "Aleksandr Drozd,Naoya Maruyama,Satoshi Matsuoka","A Multi GPU Read Alignment Algorithm with Model-based Performance Optimization","Vecpar 2012","Springer's Lecture Notes in Computer Science N7851 (2012)",,"vol. 7851",,"pp. 270-277",2013,Jan. "河村知輝,Naoya Maruyama,松岡聡","ステンシル計算における通信の自動最適化に向けた性能モデルの評価","ハイパフォーマンスコンピューティングと計算科学シンポジウム","情報処理学会研究報告","情報処理学会",,,"pp. 1",2013,Jan. "星野哲也,丸山直也,松岡聡","ディレクティブベースプログラミング言語OpenACCの性能評価","ハイパフォーマンスコンピューティングと計算科学シンポジウム","2013年ハイパフォーマンスコンピューティングと計算科学シンポジウム論文集",,,,,2013,Jan. "Keisuke Fukuda,Naoya Maruyama,Miquel Pericas,Satoshi Matsuoka","Fast Multipole Method on a Dynamic Scheduling Engine on Heterogeneous Environments","GPU Technology Conference",,,,,,2013, "Abdelhalim Amer,Naoya Maruyama,Miquel Pericas,Kenjiro Taura,Rio Yokota,SATOSHI MATSUOKA","Fork-Join and Data-Driven Execution Models on Multi-core Architectures: Case Study of the FMM","International Supercomputing Conference","Supercomputing","Springer Berlin Heidelberg","Volume 7905",,,2013, "Shinichiro Takizawa,SATOSHI MATSUOKA,Masanaru Munetomo,Taizo Kobayashi,HIDEYUKI JITSUMOTO","A Virtual Machine Hosting System on e-Scinece Cyberinfrastructure","The 1st International Workshop on Cloud Computing and Applications (IWCCA 2012)",,,,,,2012,Dec. "金 光浩,遠藤 敏夫,松岡 聡","GPUメモリ容量を超える問題規模に対応する高性能ステンシル計算法","ハイパフォーマンスコンピューティングとアーキテクチャの評価に 関する北海道ワークショップ(HOKKE-20)","情報処理学会研究報告","情報処理学会","Vol. 2012-ARC-194/HPC-137",,,2012,Dec. "野村 哲弘,遠藤 敏夫,松岡 聡","TSUBAME2.0におけるMulti-rail InfiniBandネットワークの性能評価","ハイパフォーマンスコンピューティングとアーキテクチャの評価に 関する北海道ワークショップ(HOKKE-20)","情報処理学会研究報告","情報処理学会","Vol. 2012-ARC-194/HPC-137",,,2012,Dec. "Katsuki Fujisawa,Toshio Endo,Hitoshi Sato,Makoto Yamashita,Satoshi Matsuoka,Maho Nakata","High-Performance General Solver for Extremely Large-scale Semidefinite Programming Problems","International Conference for High Performance Computing, Networking, Storage and Analysis (SC12)","Proceedings of IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC12)","IEEE/ACM",,,,2012,Nov. "Kento Sato,Adam Moody,Kathryn Mohror,Todd Gamblin,Bronis R. de Supinski,Naoya Maruyama,Satoshi Matsuoka","Design and Modeling of a Non-Blocking Checkpoint System","the International Conference on High Performance Computing, Networking, Storage and Analysis (SC'12)",,,,,,2012,Nov. "Akira Nukada,Kento Sato,Satoshi Matsuoka","Scalable Multi-GPU 3-D FFT for TSUBAME 2.0 Supercomputer","2012 ACM/IEEE International Conference for High Performance, Networking, Storage, and Analysis (SC’12)","Proceedings of 2012 ACM/IEEE International Conference for High Performance, Networking, Storage, and Analysis (SC’12)","IEEE Computer Society",,,"pp. 44:1--44:10",2012,Nov. "福田圭祐,丸山直也,Miquel Pericas,松岡聡","動的タスクスケジューリングエンジンStarPUによるKIFMMの実装と性能評価","第136回ハイパフォーマンスコンピューティング研究発表会",,,,,,2012,Oct. "Leonardo Bautista Gomez,Thomas Ropars,Franck Cappello,Naoya Maruyama,SATOSHI MATSUOKA","Hierarchical Clustering Strategies for Fault Tolerance in Large Scale HPC Systems","IEEE International Conference on Cluster Computing",,,,,,2012,Sept. "Koichi Shirahata,Hitoshi Sato,Toyotaro Suzumura,Satoshi Matsuoka","A GPU Implementation of Generalized Graph Processing Algorithm GIM-V","The 3rd International Workshop on Parallel Algorithm and Parallel Software (IWPAPS 2012)","2012 IEEE Cluster Workshops (ClusterW 2012)",,,,,2012,Sept. "Kento Sato,Adam Moody,Kathryn Mohror,Todd Gamblin,Bronis R. De Supinski,Naoya Maruyama,Satoshi Matsuoka","Design and Modeling of an Asynchronous Checkpointing System","IPSJ SIG Technical Reports 2012-HPC-135 (SWoPP 2012)",,,,,,2012,Aug. "河村知輝,丸山直也,松岡聡","並列ステンシル計算における通信の自動最適化に向けた性能モデルの評価","並列/分散/協調処理に関するサマー・ワークショップ","情報処理学会研究報告","情報処理学会","Vol. 2012-HPC-135","No. 32","pp. 1--8",2012,Aug. "Leonardo Bautista Gomez,Bogdan Nicolae,Naoya Maruyama,Franck Cappello,SATOSHI MATSUOKA","Scalable Reed-Solomon-based Reliable Local Storage for HPC Applications in IaaS Clouds","International European Conference on Parallel and Distributed Computing",,,,,,2012,Aug. "Abdelhalim Amer,Ahmed Toufik,Walid-Khaled Hidouci,SATOSHI MATSUOKA","Using Bittorrent and SVC for efficient video sharing and streaming","IEEE Symposium on Computers and Communications (ISCC)","IEEE Symposium on Computers and Communications (ISCC)","IEEE",,," 000537 - 000543",2012,July "星野哲也,丸山直也,松岡聡","大規模流体アプリケーションの CUDA・OpenACC への移植性の評価","2012年並列/分散/協調処理に関する『鳥取』サマー・ワークショップ(SWoPP鳥取2012)","情報処理学会研究報告","情報処理学会","Vol. 2012-HPC-135","No. 42","pp. 1-9",2012,July "Akihiro Nomura,Yutaka Ishikawa,Naoya Maruyama,Satoshi Matsuoka","Implementation of Efficient Non-blocking Collective Communication Framework","HPC in Asia Workshop",,,,,,2012,June "Kento Sato,Adam Moody,Kathryn Mohror,Todd Gamblin,Bronis R. de Supinski,Naoya Maruyama,SATOSHI MATSUOKA","Towards a Light-weight Non-blocking Checkpointing System","HPC in Asia Workshop in conjunction with the 2012 International Supercomputing Conference (ISC’12)",,,,,,2012,June "Akihiro Nomura,Yutaka Ishikawa,Naoya Maruyama,Satoshi Matsuoka","Design and Implementation of Portable and Efficient Non-blocking collective Communication","The 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2012)",,,,,,2012,May "星野哲也,丸山直也,松岡聡","大規模流体アプリケーションのGPUによる高速化手法の評価","先進的計算基盤システムシンポジウム","先進的計算基盤システムシンポジウム",,,,,2012,May "河村知輝,丸山直也,松岡聡","Physis フレームワークにおける性能モデルに基づく通信の自動最適化に向けて","先進的計算基盤システムシンポジウム","先進的計算基盤システムシンポジウム","情報処理学会",,,"pp. 1--2",2012,May "Kento Sato,Adam Moody,Kathryn Mohror,Todd Gamblin,Bronis R. de Supinski,Naoya Maruyama,SATOSHI MATSUOKA","Design and Modeling of a Non-Blocking Checkpoint System","ATIP - A*CRC Workshop on Accelerator Technologies in High Performance Computing",,,,,,2012,May "白幡晃一,佐藤仁,鈴村豊太郎,松岡聡","汎用グラフ処理モデルGIM-Vの複数GPUによる大規模計算とデータ転送の最適化","第133回 ハイパフォーマンスコンピューティング研究発表会","情報処理学会研究報告2012-HPC-133","情報処理学会",,,,2012,Mar. "Akira Nukada,Yutaka Maruyama,Satoshi Matsuoka","High Performance 3-D FFT using multiple CUDA GPUs","Fifth Workshop on General Purpose Processing on Graphics Processing Units","Proceedings of Fifth Workshop on General Purpose Processing on Graphics Processing Units (GPGPU-5)","ACM Press",,,,2012,Mar. "Aleksandr Drozd,Naoya Maruyama,Satoshi Matsuoka","Fast GPU Read Alignment with Burrows Wheeler Transform Based Index","Supercomputing 2011","Proceedings of the 2011 ACM/IEEE conference on Supercomputing (SC'11)",,,,,2011,Nov. "Kento Sato,Adam Moody,Kathryn Mohror,Todd Gamblin,Bronis R. de Supinski,Naoya Maruyama,Satoshi Matsuoka","Towards an Asynchronous Checkpointing System","IPSJ SIG Technical Reports 2011-ARC-197 2011-HPC-132 (HOKKE-19)",,,,,,2011,Nov. "福田圭祐,丸山直也,松岡聡","動的タスクスケジューリングによるCPU/GPUヘテロジニアス環境でのFMMの最適化","HOKKE 2011","情報処理学会 研究報告",,"Vol. 2011-HPC-132","No. 28",,2011,Nov. "Naoya Maruyama,Tatsuo Nomura,Kento Sato,Satoshi Matsuoka","Physis: An Implicitly Parallel Programming Model forStencil Computations on Large-Scale GPU-AcceleratedSupercomputers","Supercomputing 2011","Proceedings of the 2010 ACM/IEEE conference on Supercomputing (SC'11)",,,,,2011,Nov. "Massimo Bernaschi,Mauro Bisson,Toshio Endo,Massimiliano Fatica,Satoshi Matsuoka,Simone Melchionna,Sauro Succi","Petaflop Biofluidics Simulations On A Two Million-Core System","International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)","Proceedings of IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)","IEEE/ACM",,,,2011,Nov. "Takashi Shimokawabe,Takayuki Aoki,Tomohiro Takaki,Akinori Yamanaka,Akira Nukada,Toshio Endo,Naoya Maruyama,Satoshi Matsuoka","Peta-scale Phase-Field Simulation for Dendritic Solidification on the TSUBAME 2.0 Supercomputer","International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)","Proceedings of IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)","IEEE/ACM",,,,2011,Nov. "遠藤 敏夫,額田 彰,松岡 聡,長坂 真路,四津 匡康","グリーンスパコンTSUBAME2.0における電力危機対応運用","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-19)","情報処理学会研究報告","情報処理学会","Vol. 2011-ARC-197/HPC-132",,"pp. 1-9",2011,Nov. "Leonardo Bautista,Naoya Maruyama,Dimitri Komatitsch,Tsuboi Seiji,Franck Cappello,SATOSHI MATSUOKA,Nakamura Takeshi","FTI: High performance Fault Tolerance Interface for hybrid systems","International Conference for High Performance Computing, Networking, Storage and Analysis (SC)",,,,,"Page 1-12",2011,Nov. "遠藤 敏夫,額田 彰,松岡 聡","スーパーコンピュータTSUBAME 2.0 における Linpack 性能1 ペタフロップス超の達成",,"情報処理学会論文誌コンピューティングシステム","情報処理学会","Vol. 4","No. 4 (ACS 35)","pp. 169--179",2011,Oct. "滝澤真一朗,松岡聡,友石正彦,佐藤仁,東田 学","Point-of-Presence連携によるe-サイエンス分散環境","インターネットカンファレンス2011",,,,,,2011,Oct. "白幡晃一,佐藤仁,鈴村豊太郎,松岡聡","GPGPUを用いた高速大規模グラフ処理に向けて","第130回 ハイパフォーマンスコンピューティング研究発表会 2011年並列/分散/協調処理に関する 『鹿児島』サマー・ワークショップ(SWoPP鹿児島2011)","情報処理学会研究報告2011-HPC-130",,,"No. 14","pp. 1--8",2011,Aug. "Irina Demeshko,Satoshi Matsuoka,Toshio Endo","GPU-based approach for elastic-plastic deformation simulation","Summer United Workshops on Parallel, Distributed and Cooperative Processing (SWoPP 2011)","IPSJ SIG Technical Report","IPSJ","Vol. 2011-HPC-130","No. 12","pp. 1-7",2011,Aug. "白幡晃一,佐藤仁,松岡聡","GPUを考慮したMapReduceのアクセラレーション","GTC Workshop Japan 2011",,,,,"pp. 119--120",2011,July "斎藤貴文,佐藤仁,松岡聡","大規模並列ファイルシステムに対する ワークフローアプリケーションのI/O性能解析","並列/分散/協調処理に関するサマー・ワークショップ",,,,,"pp. 1-8",2011,July "福田 圭祐,丸山 直也,松岡 聡","CPU/GPUヘテロジニアス環境におけるFMMの最適化","GTC Workshop Japan 2011","GTC Workshop Japan 2011",,,,,2011,July "Aleksandr Drozd,Satoshi Matsuoka,Naoya Maruyama","Fast GPU Read Alignmenntwith Burrows Wheeler Transform Based Index","SWoPP 2011","IPSJ SIG Technical Report",,"Vol. 2011-HPC-130",,,2011,July "滝澤真一朗,棟朝 雅晴,宇野 篤也,小林 泰三,實本英之,松岡聡,石川 裕","広域分散環境を提供するHPCI先端ソフトウェア運用基盤の設計","第130回 ハイパフォーマンスコンピューティング研究発表会 2011年並列/分散/協調処理に関する 『鹿児島』サマー・ワークショップ(SWoPP鹿児島2011)",,,,,,2011,July "Akira Nukada,Hiroyuki Takizawa,Satoshi Matsuoka","NVCR: A Transparent Checkpoint-Restart Library for NVIDIA CUDA","The 20th International Heterogeneity in Computing Workshop (HCW 2011), in conjunction with IEEE IPDPS 2011","Proceedings of the 20th International Heterogeneity in Computing Workshop (HCW 2011), in conjunction with IEEE IPDPS 2011","The IEEE Press",,,"page 1--10",2011,May "斎藤貴文,千葉立寛,佐藤仁,松岡聡","ワークフローアプリケーションに対する計算資源割り当ての最適化","先進的計算基盤システムシンポジウム",,,,,"pp. 1-2",2011,May "斎藤貴文,千葉立寛,佐藤仁,松岡聡","ワークフローアプリケーションに対する計算資源割り当ての最適化","情報処理学会 ハイパフォーマンスコンピューテング研究会",,,,,"pp. 1-7",2011,May "白幡晃一,鈴村豊太郎,佐藤仁,松岡聡","ストリーミング型クラスタリングアルゴリズムの性能評価","先進的計算基盤システムシンポジウム SACSIS2011",,,,,,2011,May "福田 圭祐,丸山 直也,松岡 聡","CPU/GPUヘテロジニアス環境におけるFMMの最適化","SACSIS2011 - 先進的計算基盤システムシンポジウム","先進的計算基盤システムシンポジウム論文集",,,,,2011,May "遠藤 敏夫,額田 彰,松岡 聡","スーパーコンピュータTSUBAME 2.0 における Linpack 性能1 ペタフロップス超の達成","先進的計算基盤システムシンポジウム(SACSIS2011)","情報処理学会SACSIS2011論文集","情報処理学会",,,"pp. 1-8",2011,May "佐藤 仁,松岡聡","TSUBAME2.0上でのHadoopの性能評価","情報処理学会研究報告 2011-HPC-129","情報処理学会研究報告 2011-HPC-129",,,,,2011,Mar. "Tatsuo Nomura,Naoya Maruyama,Toshio Endo,Satoshi Matsuoka","A Sequential Programming Framework for Large-Scale GPU-Accelerated Structured Grids","SIAM Conference on Computational Science and Enginnering",,,,,,2011,Mar. "Kento Sato,Hitoshi Sato,Satoshi Matsuoka","Orchestrated Data Processing Acceleration for Data-Intensive Applications by using VM-based Migration","The 1st Data Intensive Science Workshop",,,,,,2011,Mar. "Koichi Shirahata,Hitoshi Sato,SATOSHI MATSUOKA","MapReduce Acceleration for GPU-based Heterogeneous Clusters","The 1st Data Intensive Science Workshop",,,,,,2011,Mar. "松岡聡, 遠藤敏夫, 丸山直也, 佐藤仁, 滝澤真一朗","TSUBAME 2.0始まる TSUBAME1.0から2.0への長い道のり (後編)",,"TSUBAME e-Science Journal","東京工業大学学術国際情報センター","Vol. 3",,,2011,Feb. "野村 達雄,丸山 直也,遠藤 敏夫,松岡 聡","ステンシル計算を対象とした大規模GPUクラスタ向け自動並列化フレームワーク","ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2011)",,"情報処理学会",,,,2011,Jan. "野村 達雄,丸山 直也,遠藤 敏夫,松岡 聡","ステンシル計算を対象とした大規模GPUクラスタ向け自動並列化フレームワーク","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-18)","情報処理学会研究報告","情報処理学会","Vol. 2010-ARC-192/HPC-128",,"pp. 1-9",2010,Dec. "Leonardo Bautista,Akira Nukada,Naoya Maruyama,Franck Cappello,SATOSHI MATSUOKA","Low-overhead diskless checkpoint for hybrid computing systems",,"International Conference on High Performance Computing (HiPC 2010)",,,,,2010,Dec. "遠藤 敏夫,額田 彰,松岡 聡","ヘテロ型スーパーコンピュータTSUBAME 2.0のLinpackによる性能評価","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-18)","情報処理学会研究報告","情報処理学会","Vol. 2010-ARC-192/HPC-128",,"pp. 1-6",2010,Dec. "長坂 仁,丸山 直也,額田 彰,遠藤 敏夫,松岡 聡","GPUにおけるモデルに基づいた電力効率の最適化","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-18)","情報処理学会研究報告","情報処理学会","Vol. 2010-ARC-192/HPC-128",,"pp. 1-6",2010,Dec. "Leonardo Bautista,Akira Nukada,Naoya Maruyama,Franck Cappello,SATOSHI MATSUOKA","Low-overhead checkpoint for large-scale GPU-accelerated systems",,"ARC192HPC128-22",,,,,2010,Dec. "島田 大地,遠藤 敏夫,丸山 直也,松岡 聡","OpenCLを用いた異種GPUにおける性能特性に応じた最適化","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-18)","情報処理学会研究報告","情報処理学会","Vol. 2010-ARC-192/HPC-128",,"pp. 1-7",2010,Dec. "Takashi Shimokawabe,Takayuki Aoki,Chiashi Muroi,Junichi Ishida,Kohei Kawano,Toshio Endo,Akira Nukada,Naoya Maruyama,Satoshi Matsuoka","An 80-Fold Speedup, 15.0 TFlops, Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code","International Conference for High Performance Computing, Networking, Storage and Analysis (SC10)","Proceedings of IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC10)","IEEE/ACM",,,,2010,Nov. "松岡聡, 遠藤敏夫, 丸山直也, 佐藤仁, 滝澤真一朗","TSUBAME 2.0始まる TSUBAME1.0から2.0への長い道のり (前編)",,"TSUBAME e-Science Journal","東京工業大学学術国際情報センター","Vol. 2",,,2010,Nov. "Koichi Shirahata,Hitoshi Sato,SATOSHI MATSUOKA","Hybrid Map Task Scheduling for GPU-based Heterogeneous Clusters","The 1st International Workshop on Theory and Practice of MapReduce (MAPRED'2010)","2nd IEEE International Conference on Cloud Computing Technology and Science",,,,,2010,Nov. "Akira Nukada,Satoshi Matsuoka","NukadaFFT: An Auto-Tuning FFT Library for CUDA GPUs","GPU Technology Conference 2010","GPU Technology Conference 2010",,,,,2010,Sept. "Nguyen Toan,Tatsuo Nomura,Hideyuki Jitsumoto,Naoya Maruyama,Toshio Endo,Satoshi Matsuoka","MPI-CUDA Application Checkpointing","GPU Technology Conference 2010","GPU Technology Conference 2010",,,,,2010,Sept. "滝澤真一朗,松岡聡,佐藤仁,東田学,友石正彦","PoP(Point of Presence)によるe-サイエンスリソース連携",,"広帯域ネットワーク利用に関するワークショップ (ADVNET2010)予稿集",,,,,2010,Sept. "松岡聡,遠藤敏夫,丸山直也,佐藤仁,滝澤真一朗","TSUBAME2.0の全貌",,"TSUBAME e-Science Journal","東京工業大学学術国際情報センター",,"No. 1",,2010,Sept. "Hitoshi Nagasaka,Naoya Maruyama,Akira Nukada,Toshio Endo,SATOSHI MATSUOKA","Statistical Power Modeling of GPU Kernels Using Performance Counters","International Green Computing Conference (IGCC'10)","Proceedings of IEEE International Green Computing Conference (IGCC'10),","IEEE",,,"pp. 115-122",2010,Aug. "野村 達雄,丸山 直也,遠藤 敏夫,松岡 聡","GPUクラスタを対象にした並列ステンシル計算の自動コード生成フレームワーク","並列/分散/協調処理に関するサマーワークショップ(SWoPP2010)","情報処理学会研究報告","情報処理学会","Vol. 2010-HPC-126",,"pp. 1-10",2010,Aug. "滝澤真一朗,松岡聡,佐藤仁,東田学,友石正彦,實本英之","e-サイエンス基盤としての計算機センターPOP(Point-of-Presence) 連携",,"並列/分散/協調処理に関するサマー・ワークショップ(SWoPP2010) 予稿集",,,,,2010,Aug. "Nguyen Toan,Hideyuki Jitsumoto,Naoya Maruyama,Tatsuo Nomura,Toshio Endo,Satoshi Matsuoka","MPI-CUDA Applications Checkpointing","Summer United Workshops on Parallel, Distributed and Cooperative Processing (SWoPP 2010)","IPSJ SIG Technical Report","IPSJ","Vol. 2010-HPC-126","No. 18","pp. 1-7",2010,Aug. "白幡晃一,佐藤仁,松岡聡","GPUを考慮したMapReduceのタスクスケジューリング","第126回 ハイパフォーマンスコンピューティング研究発表会 2010年並列/分散/協調処理に関する『金沢』サマー・ワークショップ(SWoPP金沢2010)","情報処理学会研究報告2010-HPC-126",,,"No. 5","pp. 1--8",2010,July "松岡聡,青木尊之,遠藤敏夫,丸山直也,佐藤仁,滝澤真一朗,實本英之","TSUBAMEの造り方から探る PCクラスターと「スパコン」のあいだ",,"月刊ASCII .technologies","アスキー・メディアワークス","Vol. 15","No. 7","pp. 48--55",2010,July "Naoya Maruyama,Satoshi Matsuoka","Model-based Fault Localization: Finding Behavioral Outliers in Large-scale Computing Systems",,"New Generation Computing","Ohmsha, Ltd.","Vol. 28","No. 3","pp. 237--255",2010,July "Ali Cevahir,Akira Nukada,Satoshi Matsuoka","High Performance Conjugate Gradient Solver on Multi-GPU Clusters Using Hypergraph Partitioning","ISC'10","Computer Science ? Research and Development","Springer","Vol. 25","No. 1--2","pp. 83--91",2010,June "遠藤 敏夫,額田 彰,松岡 聡","異種アクセラレータを持つTSUBAMEスーパーコンピュータのLinpack評価",,"応用数理","応用数理学会","Vol. 20","No. 2","pp. 29-36",2010,June "Mohamed Amin Jabri,Satoshi Matsuoka","Authorization within Grid-Computing Using Certificateless Identity-Based Proxy Signature","HPDC 2010","Proc. of the ACM International Symposium on High Performance Distributed Computing (HPDC 2010)","ACM Press",,,,2010,June "Leonardo Bautista,Naoya Maruyama,Franck Cappello,Satoshi Matsuoka","Distributed Diskless Checkpoint for Large Scale Systems",,"10 IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2010)",,,,,2010,May "Nguyen Toan,Tatsuo Nomura,Naoya Maruyama,Satoshi Matsuoka","Fault-Tolerant GPGPU with GPU Checkpointing","先進的計算基盤システムシンポジウム SACSIS2010","先進的計算基盤システムシンポジウム SACSIS2010","IPSJ Symposium Series","Vol. 2010","No. 5","pp. 127-128",2010,May "佐藤賢斗,佐藤仁,松岡聡","仮想マシン動的再配置による大規模データアクセスの高速化",,"情報処理学会先進的計算基盤システムシンポジウム論文集 (SACSIS2010)",,,,,2010,May "Tatsuhiro Chiba,Mathijs den Burger,Thilo Kielmann,SATOSHI MATSUOKA","Dynamic Load-Balanced Multicast for Data-Intensive Applications on Clouds","The 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing",,,,,,2010,May "滝澤真一朗,松岡聡","NAREGIグリッド本格運用に向けたサービス整合性監視システム",,"先進的計算基盤システムシンポジウム (SACSIS2010)予稿集",,,,,2010,May "白幡晃一,佐藤仁,松岡聡","GPUによるMapReduceのアクセラレーション","先進的計算基盤システムシンポジウム SACSIS2010",,,,,"pp. 119--120",2010,May "Naoya Maruyama,Akira Nukada,Satoshi Matsuoka","A High-Performance Fault-Tolerant Software Framework for Memory on Commodity GPUs","24th IEEE International Parallel and Distributed Processing Symposium (IPDPS'10)","24th IEEE International Parallel and Distributed Processing Symposium (IPDPS'10)",,,,,2010,Apr. "Toshio Endo,Akira Nukada,SATOSHI MATSUOKA,Naoya Maruyama","Linpack Evaluation on a Supercomputer with Heterogeneous Accelerators","IEEE International Parallel & Distributed Processing Symposium (IPDPS 2010)","Proceedings of IEEE International Parallel & Distributed Processing Symposium (IPDPS 2010)","IEEE",,,"page 10",2010,Apr. "SATOSHI MATSUOKA","HPC in the Cloud---A Hype, the End of SCs, or Peaceful Coexistence?",,"The 28th Open Grid Forum",,,,,2010,Mar. "渡辺祐也,遠藤敏夫,松岡聡","GPU クラスタにおける科学技術計算の自動最適化","HPC研究会","情報処理学会研究報告","情報処理学会","Vol. 2010-HPC-124","No. 18","pp. 1--7",2010,Feb. "浜野智明,額田彰,遠藤敏夫,松岡聡","GPUクラスタにおける省電力タスクスケジューリング","第124回HPC研究会","情報処理学会研究報告2010-HPC-124","情報処理学会",,,"pp. 1--8",2010,Feb. "SATOSHI MATSUOKA","GPU Acceleration: a Fad or the Yellow Brick Road onto Exascale",,"SIAM Conference on Parallel Processing for Scientific Computing",,,,,2010,Feb. "國府理央,佐藤仁,松岡聡","大規模計算機システムの資源選択を支援するエキスパートシステム",,"情報処理学会研究報告2009-HPC-124",,,,,2010,Feb. "Naoya Maruyama,Akira Nukada,Satoshi Matsuoka","Performance Evaluation of Software Framework for Memory Fault Tolerance in GPU Accelerators",,"SIAM Conference on Parallel Processing and Scientific Computing (PP10), MS36: Trends and Experiences in Heterogeneous Many-core Computing",,,,,2010,Feb. "千葉 立寛,Thilo Kielmann,Mathijs den Burger,松岡 聡","クラウド環境における大規模データブロードキャストの動的最適化",,"ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2010)",,,,,2010,Jan. "SATOSHI MATSUOKA","Accelerated Computing in TSUBAME 1.2/2.0",,"Accelerated Computing Symposium",,,,,2010,Jan. "Ali Cevahir,Cevdet Aykanat,Ata Turk,B. Barla Cambazoglu,Akira Nukada,Satoshi Matsuoka","Efficient PageRank on GPU Clusters","第18回ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ","情報処理学会研究報告","情報処理学会","Vol. 2010-HPC-128","No. 21","pp. 1--6",2009,Nov. "丸山直也,額田 彰,松岡 聡","GPU向け耐メモリエラーソフトウェアフレームワーク",,"情報処理学会研究報告 2009-HPC-123",,,"No. 8","pp. 1--6",2009,Nov. "佐藤仁,小西史一,山本泰智,高木利久,松岡聡","スーパーコンピュータTSUBAME上でのMapReduceの実現",,"情報処理学会研究報告2009-HPC-123(HOKKE17)",,,,,2009,Nov. "Akira Nukada,Satoshi Matsuoka","Auto-Tuning 3-D FFT Library for CUDA GPUs","2009 ACM/IEEE conference on Supercomputing (SC09)","Proceedings of the 2009 ACM/IEEE conference on Supercomputing (SC09)","ACM",,,,2009,Nov. "SATOSHI MATSUOKA","Petascaling Commodity onto Exascale with GPUs and Windows HPC",,"ACM/IEEE Supercomputing (SC09) Microsoft Booth",,,,,2009,Nov. "SATOSHI MATSUOKA","Petascaling Commodity onto Exascale with GPUs on TSUBAME1.2 onto TSUBAME2.0",,"ACM/IEEE Supercomputing (SC09) NVidia Booth",,,,,2009,Nov. "實本英之,中村俊介,遠藤敏夫,松岡聡","増分データとErasure Coding を利用した高速なチェックポイント手法","HPC研究会","情報処理学会研究報告","情報処理学会","Vol. 2009-HPC-122","No. 9","pp. 1--6",2009,Oct. "SATOSHI MATSUOKA","Petascaling Commodity onto Exascale: GPUs as Multithreaded Massively-Parallel Vector Processors - the Only Road to Exascale",,"IEEE Cluster Computing Conference",,,,,2009,Sept. "額田 彰,松岡 聡","CUDA GPU向けの自動最適化FFTライブラリ",,"情報処理学会論文誌コンピューティングシステム(ACS)","情報処理学会","Vol. 2","No. 3","pp. 107--115",2009,Sept. "松岡聡","TSUBAME2.0におけるGPGPUによるスケーラブルなペタフロップス・ベクトル・スーパーコンピューティング",,"GPGPUスクール「計算科学におけるGPGPUを中心とした演算加速機構の利用」",,,,,2009,Sept. "Naoya Maruyama,Akira Nukada,Satoshi Matsuoka","A High-Performance Fault-Tolerant Software Framework for Memory on Commodity GPUs","NVIDIA GPU Technology Conference 2009","NVIDIA GPU Technology Conference 2009",,,,,2009,Sept. "滝澤真一朗,遠藤敏夫,松岡聡","次世代光インターコネクトでのMPI通信に関する研究",,"コンピュータソフトウェア","日本ソフトウェア科学会","Vol. 26","No. 3","pp. 5--19",2009,Aug. "遠藤 敏夫,額田 彰,松岡 聡,丸山 直也","異種アクセラレータを持つヘテロ型スーパーコンピュータ上のLinpack の性能向上手法","並列/分散/協調処理に関するサマーワークショップ(SWoPP2009)","情報処理学会研究報告","情報処理学会","Vol. 2009-HPC-121","No. 24","Page 8",2009,Aug. "國府理央,佐藤仁,松岡聡","大規模計算環境におけるユーザ満足度を考慮した資源管理へむけて","2009年並列/分散/協調処理に関する『仙台』サマー・ワークショップ(SWoPP仙台2009)","電子情報通信学会技術研究報告","電子情報通信学会","Vol. 109","No. 168(CPSY2009-13)","pp. 19--24",2009,July "Satoshi Matsuoka,Takayuki Aoki,Toshio Endo,Akira Nukada,Toshihiro Kato,Atsushi Hasegawa","GPU accelerated computing?from hype to mainstream, the rebirth of vector computing","Scientific Discovery through Advanced Computing (SciDAC 2009)","Journal of Physics: Conference Series","IOP","Vol. 180","No. 1","pp. 012043",2009,July "Naoya Maruyama,Akira Nukada,Satoshi Matsuoka","Software-Based ECC for GPUs","2009 Symposium on Application Accelerators in High Performance Computing (SAAHPC'09)","2009 Symposium on Application Accelerators in High Performance Computing (SAAHPC'09)",,,,,2009,July "滝澤真一朗,遠藤敏夫,松岡聡","光サーキットネットワークの補助的利用によるHPCアプリケーション性能向上",,"情報処理学会 コンピューティングシステム(ACS)",,"Vol. 2","No. 2","pp. 110--121",2009,July "長坂仁,丸山直也,額田 彰,遠藤 敏夫,松岡 聡","GPU における性能と消費電力 の相関性の解析",,"情報処理学会研究報告2009-HPC-121",,,"No. 26","pp. 1--6",2009,July "島田大地,丸山直也,額田彰,遠藤 敏夫,松岡 聡","GPUにおける耐故障性を考慮した数値計算の電力性能",,"情報処理学会研究報告2009-HPC-121",,,"No. 26","pp. 1--5",2009,July "加藤季広,青木尊之,額田彰,遠藤敏夫,松岡聡,長谷川篤史","姫野ベンチマークのGPUマルチノード実行における通信と演算のオーバーラップによる高速化 ? 32GPUで700GFLOPS超を達成 ?","HPC研究会","情報処理学会研究報告「ハイパフォーマンスコンピューティング(HPC)」","情報処理学会","Vol. 2009-HPC-120","No. 3","pp. 1--6",2009,June "松岡聡","TSUBAME2.0における高バンド幅なペタフロップス・コンピューティングの可能性",,"Sun HPCセミナー",,,,,2009,June "SATOSHI MATSUOKA","GPU Accelerated Computing---From Hype to Mainstream, the Rebirth of Vector Computing",,"Scientific Discovery through Advanced Computing Program (SciDAC)",,,,,2009,June "Sumeth Lerthirunwong,Naoya Maruyama,Satoshi Matsuoka","Adaptive Resource Indexing Technique for Unstructured Peer-to-Peer Networks",,"9th IEEE/ACM International Symposium on Cluster Computing and the Grid",,,,"pp. 172--179",2009,May "額田彰,松岡聡","CUDA GPU向けの自動最適化FFTライブラリ","先進的基盤システムシンポジウム SACSIS 2009","先進的基盤システムシンポジウム SACSIS 2009 論文集",,,,"pp. 345--352",2009,May