"Futa Kambe,Toshio Endo","Accelerating Stencil Computations on a GPU by Combining Using Tensor Cores and Temporal Blocking","Workshop on General Purpose Processing using GPU (GPGPU 2024)","proceedings of Workshop on General Purpose Processing using GPU (GPGPU 2024)",,,,,2024,Mar. "Ivan Radanov Ivanov,Oleksandr Zinenko,Jens Domke,Toshio Endo,William S. Moses","Retargeting and Respecializing GPU Workloads for Performance Portability","The International Symposium on Code Generation and Optimization (CGO 2024)","proceedings of The International Symposium on Code Generation and Optimization (CGO 2024)",,,,,2024,Mar. "Ivan Radanov Ivanov,Jens Domke,Toshio Endo,Johannes Doerfert","Automatic Parallelization and OpenMP Offloading of Fortran","LLVM Performance Workshop","proceedings of LLVM Performance Workshop",,,,,2024,Mar. "Ryubu Hosoki,Toshio Endo,Takahiro Hirofuchi,Tsutomu Ikegami","AshPipe: Asynchronous Hybrid Pipeline Parallel for DNN Training","The International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia 2024)","proceedings of The International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia 2024)",,,,"pp. 117-126",2024,Jan. "Shohei Minami,Toshio Endo,Akihiro Nomura","The Aggressive Oversubscribing Scheduling for Interactive Jobs on a Supercomputing System","IEEE High Performance Extreme Computing Conference (HPEC 2023)","proceedings of IEEE High Performance Extreme Computing Conference (HPEC 2023)",,,,,2023,Sept. "岡本 洸琉,遠藤 敏夫","動的スケジューリングライブラリを用いたPythonにおける分散コレスキー分解の実装と評価","並列/分散/協調処理に関するサマーワークショップ(SWoPP2023)","情報処理学会研究報告",," 2023-HPC-190"," 15",,2023,Aug. "Chenyu Wang,Toshio Endo,Takahiro Hirofuchi,Tsutomu Ikegami","Pyramid Swin Transformer for Multi-Task: Expanding to More Computer Vision Tasks","Conference on Advanced Concepts for Intelligent Vision Systems (ACIVS 2023)","proceedings of Conference on Advanced Concepts for Intelligent Vision Systems (ACIVS 2023)",,,,"pp. 53-65",2023,Aug. "神戸 風太,遠藤 敏夫","GPU上のTensor coreを使ったステンシル計算の時間ブロッキングによる高速化","並列/分散/協調処理に関するサマーワークショップ(SWoPP2023)","情報処理学会研究報告",,"Vol. 2023-HPC-190","No. 29",,2023,Aug. "Hayato Fujita,Akihiro Nomura,Toshio Endo,Masakazu Sekijima","Enhancing the Performance of AlphaFold Through Modified Storage Method and Optimization of HHblits on TSUBAME3.0 Supercomputer","2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE)","proceedings of 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE)",,,,,2023,July "Lingqi Zhang,Mohamed Wahib,Peng Chen,Jintao Meng,Xiao Wang,Toshio Endo,Satoshi Matsuoka","Revisiting Temporal Blocking Stencil Optimizations","ACM International Conference on Supercomputing (ICS 2023)","proceedings of ACM International Conference on Supercomputing (ICS 2023)",,,,"pp. 251-263",2023,June "Lingqi Zhang,Mohamed Wahib,Peng Chen,Jintao Meng,Xiao Wang,Toshio Endo,Satoshi Matsuoka","PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU Applications","ACM International Conference on Supercomputing (ICS 2023)","proceedings of ACM International Conference on Supercomputing (ICS 2023)",,,,"pp. 167-179",2023,June "幸 朋矢,遠藤 敏夫","次世代高性能メモリシステムにおけるステンシル計算の局所性向上技術の評価","第188回HPC研究発表会","情報処理学会研究報告","情報処理学会","Vol. 2023-HPC-188","No. 31",,2023,Mar. "William S. Moses,Ivan Radanov Ivanov,Jens Domke,Toshio Endo,Johannes Doerfert,Oleksandr Zinenko","High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs","ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2023)","proceedings of ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2023)",,,,"pp. 119-134",2023,Feb. "Lingqi Zhang,Mohamed Wahib,Peng Chen,Jintao Meng,Xiao Wang,Toshio Endo,Satoshi Matsuoka","Exploiting Scratchpad Memory for Deep Temporal Blocking","the 15th Workshop on General Purpose Processing Using GPU (GPGPU 2023)","proceedings of the 15th Workshop on General Purpose Processing Using GPU (GPGPU 2023)",,,,,2023,Feb. "Chenyu Wang,Toshio Endo,Takahiro Hirofuchi,Tsutomu Ikegami","Pyramid Swin Transformer: Different-Size Windows Swin Transformer for Image Classification and Object Detection","VISAPP 2023","Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications","SciTePress","Vol. 5",,"pp. 583-590",2023,Feb. "Shohei Minami,Toshio Endo,Akihiro Nomura","Effectiveness of the Oversubscribing Scheduling on Supercomputer Systems","International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia '23)","HPC Asia '23: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region","ACM",,,"pp. 18-28",2023,Feb. "William S. Moses,Ivan R. Ivanov,Jens Domke,Toshio Endo,Johannes Doerfert,Oleksandr Zinenko","High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs",,"arXiv",,,,,2023, "Hiroki Aikawa,Toshio Endo,Tomoya Yuki,Takahiro Hirofuchi,Tsutomu Ikegami","Efficient Stencil Computation with Temporal Blocking by Halide DSL","20th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA)","proceedings of 20th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA)",,,,"pp. 870-877",2022,Dec. "Lingqi Zhang,Mohamed Wahib,Peng Chen,Jintao Meng,Xiao Wang,Toshio Endo,Satoshi Matsuoka","Breaking the Memory Bottleneck for Iterative Memory-bound Applications Via Persistent Kernels",,"IPSJ SIG Technical Report","IPSJ","Vol. 2022-HPC-187","No. 18",,2022,Dec. "萩原 汐,吉川 隆英,幸 朋矢,遠藤 敏夫","3D Stacked SRAMを活用したHPC向けメモリアーキテクチャの検討","デザインガイア2022","情報処理学会研究報告","情報処理学会","Vol. 2022-SLDM-200","No. 31",,2022,Nov. "瓜生 侑,遠藤 敏夫","ラムダ式を用いる移植性の高い並列プログラムの実装とCPU・GPU上の評価","並列/分散/協調処理に関するサマーワークショップ(SWoPP2022)","情報処理学会研究報告","情報処理学会","Vol. 2022-HPC-185","No. 30",,2022,July "Chenyu Wang,Toshio Endo,Takahiro Hirofuchi,Tsutomu Ikegami","Speed-up Single Shot Detector on GPU with CUDA","23rd ACIS International Summer Virtual Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD2022-Summer)","Studies in Computational Intelligence",,"Vol. 1074",,"pp. 89-106",2022,July "大沢 泰生,遠藤 敏夫,野村 哲弘","タンパク質構造解析システムAlphafoldの実行時ファイルステージングを用いた高速化","並列/分散/協調処理に関するサマーワークショップ(SWoPP2022)","情報処理学会研究報告",,"Vol. 2022-HPC-185","No. 24",,2022,July "細木 隆豊,遠藤 敏夫,広渕 崇宏,池上 努","負荷分散を改善したハイブリッドパイプライン並列深層学習手法","並列/分散/協調処理に関するサマーワークショップ(SWoPP2022)","情報処理学会研究報告","情報処理学会","Vol. 2022-HPC-185","No. 17",,2022,July "藤田 隼斗,野村 哲弘,遠藤 敏夫,関嶋 政和","タンパク質立体構造予測システム AlphaFold の TSUBAME3.0 上での高速化","情報処理学会 第183回ハイパフォーマンスコンピューティング研究発表会","情報処理学会研究報告",,"Vol. 2022-HPC-183","No. 3","pp. 1--7",2022,Mar. "萩原 汐,児玉 宏喜,吉川 隆英,幸 朋矢,遠藤 敏夫","疎行列演算高速化のためのメモリアーキテクチャ探索","組込み技術とネットワークに関するワークショップ ETNET2022, 情報処理学会研究報告, 2022-SLDM-198, No. 30",,,,,,2022,Mar. "幸 朋矢,遠藤 敏夫","次世代高性能計算ノードにむけたメモリアーキテクチャ探索のためのツールチェーン .","組込み技術とネットワークに関するワークショップ ETNET2022, 情報処理学会研究報告, 2022-ARC-240, No. 20",,,,,,2022,Mar. "Ivan Ivanov,Jens Domke,Toshio Endo.","Automatic translation of CUDA code into high performance CPU code using LLVM IR transformations.","The 4th R-CCS International symposium, Lightning talks session, Online",,,,,,2022,Feb. "遠藤 敏夫","TSUBAMEスーパーコンピュータのAI・ビッグデータ対応と展望","PCクラスタコンソーシアム AI・機械学習技術部会キックオフワークショップ, online",,,,,,2021,Dec. "遠藤 敏夫","TSUBAMEスパコンの過去、現在、未来","PCクラスタコンソーシアム PCCC21「『PCクラスタ』これからの10年」",,,,,,2021,Dec. "相川 洋貴,遠藤 敏夫,幸 朋矢,広渕 崇宏","時間ブロッキングを用いたステンシル計算のHalide言語による高性能実装と評価 .","並列/分散/協調処理に関するサマーワークショップ(SWoPP2021)","情報処理学会研究報告, 2021-HPC-180, No. 16",,,,,2021,July "細木 隆豊,野村 哲弘,遠藤 敏夫","GPUクラスタにおけるハイブリッド並列DNN学習のボトルネック分析と改良 .","並列/分散/協調処理に関するサマーワークショップ(SWoPP2021)","情報処理学会研究報告, 2021-HPC-180, No. 9",,,,,2021,July "Shohei Minami,Toshio Endo,Akihiro Nomura","Measurement and Modeling of Performance of HPC Applications towards Overcommitting Scheduling Systems .","In proceedings of 24th Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP 2021) In Conjunction with IPDPS 2021, 21 pages, Portland (online)",,,,,,2021,May "野村 哲弘,滝澤 真一朗,三浦 信一,遠藤 敏夫,松葉 浩也","センサー情報を意識したジョブスケジューリング実現のための標準ジョブ履歴スキーマの提案","第178回ハイパフォーマンスコンピューティング研究発表会","情報処理学会研究報告","一般社団法人 情報処理学会","Vol. HPC-178","No. 14","pp. 1-8",2021,Mar. "Ivan R. Ivanov,Jens Domke,Akihiro Nomura,Toshio Endo","Improved failover for HPC interconnects through localised routing restoration","The 3rd R-CCS International Symposium, poster session",,,,,,2021,Feb. "Shohei Minami,Toshio Endo,Akihiro Nomura","Performance Modeling of HPC Applications on Overcommitted Systems","HPC Asia 2021","Proceedings of HPC Asia 2021",,,,,2021,Jan. "安良岡 由規,野村 哲弘,遠藤 敏夫","学内インフラとしてのスパコンの対話的利用による利便性向上","2020年度大学ICT推進協議会(AXIES)年次大会",,,,,,2020,Dec. "野村 哲弘,遠藤 敏夫,三浦 信一,朝倉 博紀,越野 俊充,草間 俊博","TSUBAME3のインタラクティブ利用の利便性向上にむけた取り組み","並列/分散/協調処理に関するサマーワークショップ(SWoPP2020), 情報処理学会研究報告, 2020-HPC-175",,,,,,2020,July "南 将平,遠藤 敏夫,野村 哲弘","オーバーコミットスケジュール時のアプリ性能の予備評価","並列/分散/協調処理に関するサマーワークショップ(SWoPP2020), 情報処理学会研究報告, 2020-HPC-175",,,,,,2020,July "Kazuaki Matsumura,Hamid Reza Zohouri,Mohamed Wahib,Toshio Endo,Satoshi Matsuoka.","AN5D: Automated Stencil Framework for High-Degree Temporal Blocking on GPUs .","In proceedings of International Symposium on Code Generation and Optimization (CGO 2020)",,,,,,2020,Feb. "Tomoya Yuki,Toshio Endo.","Toward Latency-Aware Data Arrangement on Many-Core Processors .","HPC Asia 2020, poster session",,,,,,2020,Jan. "Toshio Endo.","Integrating Cache Oblivious Approach with Modern Processor Architecture: The Case of Floyd-Warshall Algorithm.","In proceedings of HPC Asia 2020",,,,,,2020,Jan. "Toshio Endo","Activity Report from Tokyo Tech:Energy Efficiency of TSUBAME3.0.","Energy Efficient HPC State of the Practice Kobe Meeting",,,,,,2019,Aug. "野村 哲弘,三浦 信一,實本 英之,額田 彰,遠藤 敏夫","TSUBAME3.0におけるストレージ利用効率化のためのファイルシステムベンチマーク","並列/分散/協調処理に関するサマーワークショップ(SWoPP2019), 情報処理学会研究報告, 2019-HPC-170 No.24",,,,,,2019,July "土川 稔生,遠藤 敏夫,野村 哲弘,近藤正章,大山 洋介,松岡 聡","メモリアクセスデータを用いた機械学習によるアプリケーションの類型化","並列/分散/協調処理に関するサマーワークショップ(SWoPP2019), 情報処理学会研究報告, 2019-HPC-170 No.12",,,,,,2019,July "Toshio Endo","Current Status of TSUBAME3.0 Operation (as of Mar 2019)","7th Accelerated Data and Computing (ADAC) Workshop",,,,,,2019,Mar. "Yuki Ito,Haruki Imai,Tung Le Duc,Yasushi Negishi,Kiyokuni Kawachiya,Ryo Matsumiya,Toshio Endo","Profiling based out-of-core hybrid method for large neural networks .","the 24th ACM Symposium on Principles and Practice of Parallel Programming, poster session",,,,,,2019,Feb. "Yukinori Sato,Tomoya Yuki,Toshio Endo.","An Autotuning Framework for Scalable Execution of Tiled Code via Iterative Polyhedral Compilation.",,"ACM Transactions on Architecture and Code Optimization (TACO). Volume 15, Issue 4, Article No. 67, 23 pages.",,,,,2019,Jan. "Toshio Endo,Hiroko Midorikawa,Yukinori Sato.","Software Technology That Deals with Deeper Memory Hierarchy in Post-petascale Era.",,"Advanced Software Technologies for Post-Peta Scale Computing","Springer",,,"pp. 227-248",2019,Jan. "遠藤敏夫","光インターコネクト技術を用いたTSUBAME3.0スーパーコンピュータ","平成30年度 光ネットワーク産業・技術研究会 第3回討論会",,,,,,2018,Nov. "Ryo Matsumiya,Toshio Endo","RMA-based Communication Library Featuring Node-local NVMs","In proceedings of 2018 IEEE High Performance Extreme Computing Conference(HPEC '18)",,,,,,2018,Sept. "Toshio Endo","Applying Recursive Temporal Blocking for Stencil Computations to Deeper Memory Hierarchy","In proceedings of the 7th IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA 2018)",,,,,,2018,Aug. "伊藤 祐貴,今井 晴基,レドゥック トゥン,根岸 康,河内谷 清久仁,松宮 遼,遠藤 敏夫","GPUメモリ管理の実行時最適化による大規模深層学習の高速化","並列/分散/協調処理に関するサマーワークショップ(SWoPP2018)","情報処理学会研究報告, 2018-HPC-165 No.30",,,,,2018,Aug. "見村 朔,遠藤敏夫","LSTM を用いた映像分類システムの学習順序による高速化","The 2st. cross-disciplinary Workshop on Computing Systems, Infrastructures, and Programming (xSIG 2018), Young Researcher Sessions",,,,,,2018,May "Ryo Matsumiya,Toshio Endo","vGASNet: Scalable RMA-based Communication Library for Out-of-core Data Processing","The 2nd. cross-disciplinary Workshop on Computing Systems, Infrastructures, and Programming (xSIG 2018), ポスターセッション",,,,,,2018,May "伊藤 祐貴,Haruki Imai,Tung Le Duc,Yasushi Negishi,Kiyokuni Kawachiya,松宮 遼,遠藤 敏夫","Runtime GPU Memory Optimization for Supporting Large Neural Networks on Chainer","The 2nd. cross-disciplinary Workshop on Computing Systems, Infrastructures, and Programming (xSIG 2018), ポスターセッション",,,,,,2018,May "遠藤敏夫","TSUBAME3.0冷却システムの紹介","日本能率協会 第18回熱設計・対策技術シンポジウム",,,,,,2018,Apr. "Toshio Endo","Realizing Extremely Large-Scale Scientific Applications Using Deep Memory Hierarchy.","SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP18)",,,,,,2018,Mar. "Noboru Tanabe,Toshio Endo.","Evaluation of Memory-Latency Sensitivity on Manycore Processors with Large Cache.","2018 2nd International Conference on High Performance Compilation, Computing and Communications (HP3C-2018)",,,,,,2018,Mar. "Noboru Tanabe,Toshio Endo.","Characterizing Memory-Latency Sensitivity of Sparse Matrix Kernels.","26th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP 2018)",,,,,,2018,Mar. "Yuki Ito,Ryo Matsumiya,Toshio Endo.","ooc_cuDNN: Accommodating Convolutional Neural Networks over GPU Memory Capacity.","In Proceedings of 2017 IEEE International Confenrece on Big Data (IEEE BigData 2017)",,,,,,2017,Dec. "Toshio Endo,Hiroko Midorikawa,Yukinori Sato.","Software Technology that Deals with Deeper Memory Hierarchy in Post-petascale Era.","JST/CREST International Symposium on Post Petascale System Software (ISP2S2-2017)",,,,,,2017,Dec. "藤田 和宏,鶴見慶,安良岡由規,根本忍,梁井善行,渡邊寿雄,野村 哲弘,三浦信一,額田彰,遠藤敏夫,松岡聡","新スーパーコンピュータTSUBAME3.0の概要.","2017年度大学ICT推進協議会(AXIES)年次大会 No. TC1-6",,,,,,2017,Dec. "Satoshi Matsuoka,Toshio Endo,Akira Nukada,Shinichi Miura,Akihiro Nomura,Hitoshi Sato,Hideyuki Jitsumoto,Aleksandr Drozd.","Overview of TSUBAME3.0, Green Cloud Supercomputer for Convergence of HPC, AI and Big-Data .",,"Global Scientific Information and Computing Center, Tokyo Institute of Technology, e-Science Journal",,"Vol. 16",,"pp. 2--9",2017,Nov. "Toshio Endo,Satoshi Matsuoka.","TSUBAME3.0: A Green, Accelerated, Big-Data Supercomputer","ATIP Workshop on International Exascale and Next-Generation Computing Programs, in conjunction with SC17.",,,,,,2017,Nov. "Yuki Ito,Ryo Matsumiya,Toshio Endo.","ooc_cuDNN : A Deep Learning Library Supporting CNNs over GPU Memory Capacity.","ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC17), Research Poster Session.",,,,,,2017,Nov. "Shota Kuroda,Toshio Endo,Satoshi Matsuoka.","Applying Temporal Blocking with a Directive-based Approach.","In Proceedings of Fourth Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), in conjuntion with SC17",,,,,,2017,Nov. "Takashi Shimokawabe,Toshio Endo,Naoyuki Onodera,Takayuki Aoki.","A Stencil Framework to Realize Large-scale Computations Beyond Device Memory Capacity on GPU Supercomputers.","In Proceedings of IEEE International Conference on Cluster Computing (CLUSTER 2017)",,,,,"pp. 525-529",2017,Sept. "Takashi Shimokawabe,Toshio Endo,Naoyuki Onodera,TAKAYUKI AOKI","A Stencil Framework to Realize Large-scale Computations Beyond Device Memory Capacity on GPU Supercomputers","2017 IEEE International Conference on Cluster Computing (CLUSTER)",,,,,,2017,Sept. "Yukinori Sato,Toshio Endo","An Accurate Simulator of Cache-line Conflicts to Exploit the Underlying Cache Performance","In Proceedings of 23rd International European Conference on Parallel and Distributed Computing (Euro-par 2017)",,,,,,2017,Aug. "松岡 聡,遠藤 敏夫,額田 彰,三浦 信一,野村 哲弘,佐藤 仁,實本 英之,Drozd Aleksandr","HPCとビッグデータ・AIを融合するグリーン・クラウドスパコンTSUBAME3.0の概要","並列/分散/協調処理に関するサマーワークショップ(SWoPP2017)",,,,,,2017,July "伊藤 祐貴,松宮 遼,遠藤 敏夫","ooc_cuDNN: GPU計算機のメモリ階層を利用した大規模深層学習ライブラリの開発","並列/分散/協調処理に関するサマーワークショップ(SWoPP2017)",,,,,,2017,July "幸 朋矢,佐藤 幸紀,遠藤 敏夫","Polyhedralコンパイラを用いたタイリングパラメータ自動調整ツールのメニーコア環境での評価","並列/分散/協調処理に関するサマーワークショップ(SWoPP2017)",,,,,,2017,July "田邊 昇,遠藤 敏夫","Intel Xeon Phiにおける主記憶遅延増加の影響評価","並列/分散/協調処理に関するサマーワークショップ(SWoPP2017)",,,,,,2017,July "松宮 遼,遠藤 敏夫","vGASNet: メモリ階層深化に向けたスケーラブルな低レイヤ通信ライブラリ","並列/分散/協調処理に関するサマーワークショップ(SWoPP2017)",,,,,,2017,July "Yukinori Sato,Tomoya Yuki,Toshio Endo","ExanaDBT: A Dynamic Compilation System for Transparent Polyhedral Optimizations at Runtime","In Proceedings of ACM International Conference on Computing Frontiers 2017",,," 10pages",,,2017,May "伊藤祐貴,松宮遼,遠藤敏夫","メモリ階層の利用によってGPUメモリ容量を超える深層学習手法","The 1st. cross-disciplinary Workshop on Computing Systems, Infrastructures, and Programming (xSIG 2017)",,,,,,2017,Apr. "松宮遼,遠藤敏夫","Flash SSD を活用する PGAS フレームワークに対する協調キャッシングの導入","The 1st. cross-disciplinary Workshop on Computing Systems, Infrastructures, and Programming (xSIG 2017), ポスターセッション",,,,,,2017,Apr. "田邊 昇,遠藤 敏夫","疎行列系アプリケーション性能の主記憶遅延増加の影響評価","情報処理学会研究報告","2017-HPC-158 No.15",,,,,2017,Mar. "本山 義史,遠藤 敏夫,松岡 聡,横田 理央,福田 圭祐,佐藤 育郎","低ランク近似行列によるCNNにおける畳み込み演算の最適化","第158回ハイパフォーマンスコンピューティング研究発表会","2017-HPC-158 No.25",,,,,2017,Mar. "Takashi Shimokawabe,Toshio Endo,Naoyuki Onodera,Takayuki Aoki","Performance Evaluation of Wind Simulation Based on a GPU-computing Framework to Realize Large-scale Stencil Computations Beyond Device Memory Capacity","The 7th AICS International Symposium",,,,,,2017,Feb. "Takashi Shimokawabe,Toshio Endo,Naoyuki Onodera,Takayuki Aoki","Performance Evaluation of Wind Simulation Based on a GPU-computing Framework to Realize Large-scale Stencil Computations Beyond Device Memory Capacity.","The 7th AICS International Symposium, Poster session",,,,,,2017,Feb. "佐藤幸紀,幸朋矢,遠藤敏夫","透過的メモリ階層チューニングのための動的バイナリ変換機構の設計と開発","情報処理学会研究報告","2016-ARC-216 No.35",,,,,2017,Jan. "田邊昇,遠藤敏夫","中遅延大容量メモリ階層出現のインパクトと新たな対応に関する初期検討","情報処理学会研究報告","2015-HPC-157 No.11",,,,,2016,Dec. "黒田 勝汰,遠藤 敏夫,松岡 聡","テ?ィレクティフ?による時空間フ?ロッキンク?の自動適用","ハイパフォーマンスコンピューティング研究会",,,,,,2016,Dec. "Satoshi Imamura,Keitaro Oka,Yuichiro Yasui,Yuichi Inadomi,Katsuki Fujisawa,Toshio Endo,Koji Ueno,Keiichiro Fukazawa,Nozomi Hata,Yuta Kakibuka,Koji Inoue,Takatsugu Ono","Evaluating the Impacts of Code-Level Performance Tunings on Power Efficiency","In Proceedings of IEEE International Conference on Big Data (BigData 2016) (accepted)",,,,," 6pages",2016,Dec. "遠藤敏夫","ポストペタスケール時代のメモリ階層の深化に対応するソフトウェア技術","「ポストペタスケール高性能計算に資するシステムソフトウェア技術の創出」研究領域 平成28年度公開ワークショップ",,,,,,2016,Dec. "Ryo Matsumiya,Toshio Endo","PGAS Communication Runtime for Extreme Large Data Computation","In Proceedings of Second International Workshop on Extreme Scale Programming Models and Middleware (ESPM2), in conjunction with IEEE/ACM SC16 (accepted)",,,,," 8pages",2016,Nov. "Toshio Endo","Realizing Out-of-Core Stencil Computations using Multi-Tier Memory Hierarchy on GPGPU Clusters",,"In Proceedings of IEEE Cluster Computing (CLUSTER2016)",,,,,2016,Sept. "佐藤真平,佐藤幸紀,遠藤敏夫","ステンシル計算コードの性能とメモリレイアウトの関係性について","並列/分散/協調処理に関するサマーワークショップ(SWoPP2016)","情報処理学会研究報告",,"Vol. 2016-HPC-155","No. 37",,2016,Aug. "松岡 聡,天野 英晴,中島 研吾,井上 弘士,工藤 知宏,丸山 直也,田浦 健次朗,岩下 武史,片桐 孝洋,塙敏博,遠藤 敏夫","ポストムーア時代におけるFLOPSからBYTESへの変革","並列/分散/協調処理に関するサマーワークショップ(SWoPP2016)","情報処理学会研究報告, 2016-HPC-155 No.32",,,,,2016,Aug. "松宮 遼,遠藤 敏夫","Flash SSDを含む多階層メモリを活用するPGASランタイムシステム","並列/分散/協調処理に関するサマーワークショップ(SWoPP2016)","情報処理学会研究報告, 2016-HPC-155 No.31",,,,,2016,Aug. "下川辺 隆史,遠藤敏夫,青木 尊之","GPU デバイスメモリを超える計算を可能とするためのステンシル計算フレームワークの拡張とその性能評価","日本計算工学会 第21回計算工学講演会",,,,,,2016,June "下川辺 隆史,遠藤 敏夫,青木 尊之","GPUデバイスメモリを超える計算を可能とするためのステンシル計算フレームワークの拡張とその性能評価","日本計算工学会 第21回計算工学講演会, B-5-3",,,,,,2016,June "Toshio Endo","Operating Experience with SSD and GPUs","Accelerated Data and Computing (ADAC) Workshop",,,,,,2016,June "Satoshi Matsuoka,Hideharu Amano,Kengo Nakajima,Koji Inoue,Tomohiro Kudoh,Naoya Maruyama,Kenjiro Taura,Takeshi Iwashita,Takahiro Katagiri,Toshihiro Hanawa,Toshio Endo","From FLOPS to BYTES: Disruptive Change in High-Performance Computing towards the Post-Moore Era","In proceedings of the ACM International Conference on Computing Frontiers (CF'16)",,,,,,2016,May "Yukinori Sato,Toshio Endo.","Dynamic Compilation for Transparent Data Locality Analysis and Memory Subsystem Tuning .","The International Workshop on Architectural and Micro-Architectural Support for Dynamic Optimization (AMAS-DO), In conjunction with CGO 2016",,,,,,2016,Mar. "遠藤 敏夫","大規模・高性能演算のための多階層メモリの活用","情報処理学会研究報告, 2015-HPC-153 No.14, 7pages",,,,,,2016,Mar. "Toshio Endo","Harnessing Multi-tier Memory Hierarchy of GPU, Host and Flash.","2016 Conference on Advanced Topics and Auto Tuning in High-Performance Scientific Computing",,,,,,2016,Feb. "Shimpei Sato,Yukinori Sato,Toshio Endo","A Cache-aware Temporal Blocking Method for 3D Stencil Computation","3rd International Workshop on High-Performance Stencil Computations (HiStencils 2016), In conjunction with HiPEAC 2016",,,,,,2016,Jan. "松宮遼,遠藤敏夫,大山恵弘","深化する記憶装置階層のための大規模データ処理基盤の提案","第57回情報処理学会プログラミング・シンポジウム, ポスターセッション",,,,,,2016,Jan. "Yukinori Sato,Toshio Endo","Consolidating memory locality information obtained from static and dynamic analysis of code for performance tuning in source code","2nd Annual Meeting on Advanced Computing System and Infrastructure (ACSI2016), ポスターセッション",,,,,,2016,Jan. "Katsuki Fujisawa,Toyotaro Suzumura,Hitoshi Sato,Koji Ueno,Yuichiro Yasui,Keita Iwabuchi,Toshio Endo","Advanced Computing & Optimization Infrastructure for Extremely Large-Scale Graphs on Post Peta-Scale Supercomputers",,"Optimization in the Real World - Toward Solving Real-World Optimization Problems -, Series of Mathematics for Industry","Springer",,,"pp. 1-13",2016, "Fujisawa, K.,Toshio Endo,Yasui, Y.","Advanced computing and optimization infrastructure for extremely large-scale graphs on post peta-scale supercomputers",,"Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",,"Vol. 9725",,"pp. 265-274",2016, "Toshio Endo,Yuki Takasaki,Satoshi Matsuoka","Realizing Extremely Large-Scale Stencil Applications on GPU Supercomputers",,"In Proceedings of The 21st IEEE International Conference on Parallel and Distributed Systems (ICPADS 2015)",,,,,2015,Dec. "野村 哲弘,佐々木 淳,三浦 信一,遠藤 敏夫,松岡 聡","TSUBAME2におけるジョブスケジューリング効率化への取り組みと検証","大学ICT推進協議会 2015年度年次大会 企画セッション HPCテクノロジー","大学ICT推進協議会 2015年度年次大会 企画セッション HPCテクノロジー",,,,,2015,Dec. "Yuki Tsujita,Toshio Endo,Katsuki Fujisawa","The Scalable Petascale Data-Driven Approach for the Cholesky Factorization with Multiple GPUs","In Proceedings of First International Workshop on Extreme Scale Programming Models and Middleware (ESPM2 2015), in conjunction with IEEE/ACM SC15",,,,,,2015,Nov. "Toshio Endo,Akira Nukada,Satoshi Matsuoka","Power Capping Scheduling on TSUBAME2.5 and Upgrade of TSUBAME-KFC.","Building Energy Efficient HPC Working Group Workshop, held with SC15",,,,,,2015,Nov. "Shimpei Sato,Yukinori Sato,Toshio Endo","Investigating Potential Performance Benefits of Memory Layout Optimization based on Roofline Model","In Proceedings of The Second Workshop on Software Engineering for Parallel Systems (SEPS), in conjunction with ACM SPLASH 2015",,,,,,2015,Oct. "Yukinori Sato,Shimpei Sato,Toshio Endo","Exana: An Execution-driven Application Analysis Tool for Assisting Productive Performance Tuning","In Proceedings of The Second Workshop on Software Engineering for Parallel Systems (SEPS), in conjunction with ACM SPLASH 2015",,,,,,2015,Oct. "佐藤幸紀,佐藤真平,遠藤敏夫","CPU性能チューニングを支援するアプリケーション解析ツールExanaのデモ","電子情報通信学会 コンピュータシステム研究会 萌芽的コンピュータシステム研究展示会",,,,,,2015,Oct. "佐藤真平,佐藤幸紀,遠藤敏夫","テンポラルブロッキングを適用したステンシル計算コードのSIMD化とルーフラインモデルを用いた性能解析","情報処理学会 第151回ハイパフォーマンスコンピューテング研究会",,,,,,2015,Sept. "佐藤真平,佐藤幸紀,遠藤敏夫","ルーフラインモデルによる性能幅推定とステンシル計算コードにおけるメモリレイアウト最適化による性能最大化","並列/分散/協調処理に関するサマーワークショップ(SWoPP2015)","情報処理学会研究報告",,"Vol. 2015-ARC-216","No. 32","pp. 1-6",2015,Aug. "佐藤幸紀,遠藤敏夫","実行駆動型キャッシュシミュレーションおよびメモリ参照特性解析におけるオーバーヘッドの評価","並列/分散/協調処理に関するサマーワークショップ(SWoPP2015), 情報処理学会研究報告, 2015-ARC-216 No.31, 7pages",,,,,,2015,Aug. "野村 哲弘,佐々木 淳,三浦 信一,遠藤 敏夫,松岡 聡","TSUBAME2におけるスケジュール効率化への取り組みとユーザ動向の見える化","2015年並列/分散/協調処理に関する『別府』サマー・ワークショップ (SWoPP2015)","情報処理学会 研究報告",,"Vol. 2015-HPC-150","No. 2","pp. 1-7",2015,July "寺西 賢人,野村 哲弘,遠藤 敏夫,松岡 聡","ノード内同時実行ジョブにおけるパフォーマンスカウンタによるプロセス毎消費電力のモデル化","2015年並列/分散/協調処理に関する『別府』サマー・ワークショップ (SWoPP2015)","情報処理学会 研究報告",,"Vol. 2015-HPC-150","No. 28","pp. 1-6",2015,July "Toshio Endo,Satoshi Matsuoka.","Realizing Extremely Large-Scale Stencil Applications on GPU Supercomputers with a Memory Hierarchy Management Runtime Library. Workshop on Programming Abstractions for Data Locality (PADAL 2015), Berkeley",,,,,,,2015,June "Yuki Tsujita,Toshio Endo.","Data Driven Scheduling Approach for the Multi-node Multi-GPU Cholesky Decomposition","Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP), in conjunction with IPDPS 2015",,,,,,2015,May "Kazuki Tsuzuku,Toshio Endo.","Power Capping of CPU-GPU Heterogeneous Systems Using Power and Performance Models","International Conference on Smart Cities and Green ICT Systems (SMARTGREENS2015)",,,,,"pp. 1-8",2015,May "辻田裕紀,遠藤敏夫","マルチノード・マルチGPU上のコレスキー分解に対するデータドリブン型アルゴリズム手法","情報処理学会ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2015), ポスターセッション",,,,,,2015,May "野村 哲弘,三浦 信一,遠藤 敏夫,松岡 聡","アプリケーションのEmpiricalな性能モデル構築のためのプロファイル情報の収集","2015年ハイパフォーマンスコンピューティングと計算科学シンポジウム",,,,,,2015,May "遠藤敏夫","異種プロセッサマシンのメモリ階層を活用するHHRT ライブラリの実装","情報処理学会ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2015), ポスターセッション",,,,,,2015,May "Naoto Sasaki,Kento Sato,Toshio Endo,Satoshi Matsuoka.","Exploration of Lossy Compression for Application-level Checkpoint/Restart","IEEE International Conference on Parallel and Distributed Processing Symposium 2015 (IPDPS2015)",,,,,,2015,May "高嵜 祐樹,遠藤 敏夫,松岡 聡","GPUクラスタにおける大規模都市気流シミュレーションの最適化と性能モデル","情報処理学会ハイパフォーマンスコンピューティングと計算科学シンポジウム (HPCS2015)",,,,,,2015,May "高嵜 祐樹,遠藤敏夫,松岡 聡","GPU搭載システムにおける都市気流シミュレーションの大規模化と性能モデル",,"情報処理学会研究報告. [ハイパフォーマンスコンピューティング]","一般社団法人情報処理学会","Vol. 2015","No. 13","pp. 1-8",2015,Feb. "Toshio Endo","Harnessing Memory Hierarchy towards Extreme Fast and Big Simulations","2015 Conference on Advanced Topics and Auto Tuning in High-Performance Scientific Computing.","Proc. of 2015 Conference on Advanced Topics and Auto Tuning in High-Performance Scientific Computing.",,,,,2015,Feb. "Toshio Endo,Akira Nukada,Satoshi Matsuoka","TSUBAME-KFC: a Modern Liquid Submersion Cooling Prototype towards Exascale Becoming the Greenest Supercomputer in the World","The 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2014)","Proc. of The 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2014)",,,,"pp. 360-367",2014,Dec. "Guanghao Jin,James Lin,Toshio Endo","Efficient Utilization of Memory Hierarchy to Enable the Computation on Bigger Domains for Stencil Computation in CPU-GPU Based Systems","IEEE International Conference on High Performance Computing and Applications (ICHPCA-2014)","Proc. of IEEE International Conference on High Performance Computing and Applications (ICHPCA-2014)",,,,,2014,Dec. "Toshio Endo,Guanghao Jin","Software Technologies Coping with Memory Hierarchy of GPGPU Clusters for Stencil Computations","IEEE Cluster Computing (CLUSTER2014)","Proc. of IEEE Cluster Computing (CLUSTER2014)",,,,"pp. 132-139",2014,Sept. "遠藤敏夫,額田彰,松岡聡","超省エネスーパーコンピュータTSUBAME",,"PETROTECH","石油学会","Vol. 37","No. 8","pp. 605-609",2014,Aug. "野村哲弘,三浦信一,遠藤敏夫,松岡聡","実アプリケーションを用いた計算機評価ベンチマークと性能リポジトリの開発","2014年並列/分散/協調処理に関する『新潟』サマー・ワークショップ (SWoPP2015)","情報処理学会研究報告",,"Vol. 2014-HPC-145","No. 29","pp. 1-7",2014,July "Guanghao Jin,Toshio Endo","Data Management and Loop Controlling to Surpass Memory Capacity of GPU in OpenACC Framework","GTC Technology Conference Japan",,,,,,2014,July "Hiroko Midorikawa,Hideyuki Tan,Toshio Endo","An Evaluation of the Potential of Flash SSD as Large and Slow Memory for Stencil Computations","The 2014 International Conference on High Performance Computing & Simulation (HPCS 2014)","Proc. of The 2014 International Conference on High Performance Computing & Simulation (HPCS 2014)",,,,,2014,July "Toshio Endo","Experiences with the 5.7Pflop/s System TSUBAME2.5 at Tokyo Tech","HP-CAST 22",,,,,,2014,June "遠藤敏夫,額田彰,松岡聡","TSUBAME-KFC: 液浸冷却を用いた世界一省エネなスーパーコンピュータ",,"TSUBAME e-Science Journal","東京工業大学 学術国際情報センター",,"No. 11","pp. 2-7",2014,June "Akihiro Nomura,Shinichi Miura,Toshio Endo,SATOSHI MATSUOKA","Application Performance Characterization towards Exa-Scale Supercomputers","HPC in Asia 2014",,,,,,2014,June "Katsuki Fujisawa,Toshio Endo,Yuichiro Yasui,Hitoshi Sato,Naoki Matsuzawa,Satoshi Matsuoka,Hayato Waki","Peta-scale General Solver for Semidefinite Programming Problems with over Two Million Constraints","IEEE International Conference on Parallel and Distributed Processing Symposium 2014 (IPDPS2014)","Proc. of IEEE International Conference on Parallel and Distributed Processing Symposium 2014 (IPDPS2014)",,,,"pp. 1171-1180",2014,May "渡辺 治,遠藤 敏夫","スーパーコンピューティングコンテスト・2013",,"数学セミナー","日本評論社","Vol. 53","No. 1","pp. 50-55",2014, "松岡 聡,佐藤 賢斗,遠藤敏夫","エクサスケールスパコンに向けた耐故障性の評価 --- TSUBAME2.0を例にして ---","情報処理学会研究報告. [ハイパフォーマンスコンピューティング] 2013-HPC-141(22)",,,,,,2013,Oct. "Guanghao Jin,Toshio Endo,Satoshi Matsuoka","A Parallel Optimization Method for Stencil Computation on the Domain that is Bigger than Memory Capacity of GPUs","IEEE Cluster Computing (CLUSTER2013)","Proc. of IEEE Cluster Computing (CLUSTER2013)",,,,"pp. 1-8",2013,Sept. "野村哲弘,三浦信一,遠藤敏夫,松岡聡,鈴木惣一朗,丸山直也","システム評価のためのアプリケーション性能リポジトリの構築と性能モデルの評価","2013年並列/分散/協調処理に関する 『北九州』サマー・ワークショップ(SWoPP北九州2013)","情報処理学会 研究報告","情報処理学会","Vol. 2013-HPC-140","No. 4","pp. 1-6",2013,July "Yukinori Sato,Hiroko Midorikawa,Toshio Endo","Identifying working data set of particular loop iterations for dynamic performance tuning","Workshop on Architectural and Microarchitectural Support for Binary Translation (AMAS-BT2013)","Proc. of Workshop on Architectural and Microarchitectural Support for Binary Translation (AMAS-BT2013)",,,,"pp. 1-6",2013,June "Guanghao Jin,Toshio Endo,Satoshi Matsuoka","A Multi-level Optimization Method for Stencil Computation on the Domain that is Bigger than Memory Capacity of GPU","The Third International Workshop on Accelerators and Hybrid Exascale Systems (AsHES)","Proc. of The Third International Workshop on Accelerators and Hybrid Exascale Systems (AsHES)",,,,"pp. 1080-1087",2013,May "金 光浩,遠藤 敏夫,松岡 聡","GPUメモリ容量を超える問題規模に対応する高性能ステンシル計算法","ハイパフォーマンスコンピューティングとアーキテクチャの評価に 関する北海道ワークショップ(HOKKE-20)","情報処理学会研究報告","情報処理学会","Vol. 2012-ARC-194/HPC-137",,,2012,Dec. "野村 哲弘,遠藤 敏夫,松岡 聡","TSUBAME2.0におけるMulti-rail InfiniBandネットワークの性能評価","ハイパフォーマンスコンピューティングとアーキテクチャの評価に 関する北海道ワークショップ(HOKKE-20)","情報処理学会研究報告","情報処理学会","Vol. 2012-ARC-194/HPC-137",,,2012,Dec. "藤澤克樹,遠藤敏夫","大規模半正定値計画問題に対する内点法アルゴリズムの高速計算",,"TSUBAME e-Science Journal","東京工業大学 学術国際情報センター",,"No. 7",,2012,Dec. "Katsuki Fujisawa,Toshio Endo,Hitoshi Sato,Makoto Yamashita,Satoshi Matsuoka,Maho Nakata","High-Performance General Solver for Extremely Large-scale Semidefinite Programming Problems","International Conference for High Performance Computing, Networking, Storage and Analysis (SC12)","Proceedings of IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC12)","IEEE/ACM",,,,2012,Nov. "Shiqiao Du,Takuro Udagawa,Toshio Endo,Masakazu Sekijima","Molecular Dynamics Simulation of a Biomolecule with High Speed, Low Power and Accuracy Using GPU-Accelerated TSUBAME2.0 Supercomputer","Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2011)","Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC 2011)",,,,,2011,Dec. "Massimo Bernaschi,Mauro Bisson,Toshio Endo,Massimiliano Fatica,Satoshi Matsuoka,Simone Melchionna,Sauro Succi","Petaflop Biofluidics Simulations On A Two Million-Core System","International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)","Proceedings of IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)","IEEE/ACM",,,,2011,Nov. "Takashi Shimokawabe,Takayuki Aoki,Tomohiro Takaki,Akinori Yamanaka,Akira Nukada,Toshio Endo,Naoya Maruyama,Satoshi Matsuoka","Peta-scale Phase-Field Simulation for Dendritic Solidification on the TSUBAME 2.0 Supercomputer","International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)","Proceedings of IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)","IEEE/ACM",,,,2011,Nov. "遠藤 敏夫,額田 彰,松岡 聡,長坂 真路,四津 匡康","グリーンスパコンTSUBAME2.0における電力危機対応運用","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-19)","情報処理学会研究報告","情報処理学会","Vol. 2011-ARC-197/HPC-132",,"pp. 1-9",2011,Nov. "遠藤 敏夫,額田 彰,松岡 聡","スーパーコンピュータTSUBAME 2.0 における Linpack 性能1 ペタフロップス超の達成",,"情報処理学会論文誌コンピューティングシステム","情報処理学会","Vol. 4","No. 4 (ACS 35)","pp. 169--179",2011,Oct. "遠藤敏夫","GPGPUと東工大TSUBAME2.0スパコン",,"電子情報通信学会ソサイエティ大会講演論文集","一般社団法人電子情報通信学会","Vol. 2011",,"pp. "SS-54"-"SS-55"",2011,Aug. "Irina Demeshko,Satoshi Matsuoka,Toshio Endo","GPU-based approach for elastic-plastic deformation simulation","Summer United Workshops on Parallel, Distributed and Cooperative Processing (SWoPP 2011)","IPSJ SIG Technical Report","IPSJ","Vol. 2011-HPC-130","No. 12","pp. 1-7",2011,Aug. "遠藤敏夫","ペタスケールグリーンスパコンTSUBAME2.0",,"計算工学","日本計算工学会","Vol. 16","No. 3","pp. 34-35",2011,July "遠藤 敏夫,額田 彰,松岡 聡","スーパーコンピュータTSUBAME 2.0 における Linpack 性能1 ペタフロップス超の達成","先進的計算基盤システムシンポジウム(SACSIS2011)","情報処理学会SACSIS2011論文集","情報処理学会",,,"pp. 1-8",2011,May "Tatsuo Nomura,Naoya Maruyama,Toshio Endo,Satoshi Matsuoka","A Sequential Programming Framework for Large-Scale GPU-Accelerated Structured Grids","SIAM Conference on Computational Science and Enginnering",,,,,,2011,Mar. "野村 達雄,丸山 直也,遠藤 敏夫,松岡 聡","ステンシル計算を対象とした大規模GPUクラスタ向け自動並列化フレームワーク","ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2011)",,"情報処理学会",,,,2011,Jan. "遠藤 敏夫,額田 彰,松岡 聡","ヘテロ型スーパーコンピュータTSUBAME 2.0のLinpackによる性能評価","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-18)","情報処理学会研究報告","情報処理学会","Vol. 2010-ARC-192/HPC-128",,"pp. 1-6",2010,Dec. "島田 大地,遠藤 敏夫,丸山 直也,松岡 聡","OpenCLを用いた異種GPUにおける性能特性に応じた最適化","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-18)","情報処理学会研究報告","情報処理学会","Vol. 2010-ARC-192/HPC-128",,"pp. 1-7",2010,Dec. "長坂 仁,丸山 直也,額田 彰,遠藤 敏夫,松岡 聡","GPUにおけるモデルに基づいた電力効率の最適化","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-18)","情報処理学会研究報告","情報処理学会","Vol. 2010-ARC-192/HPC-128",,"pp. 1-6",2010,Dec. "野村 達雄,丸山 直也,遠藤 敏夫,松岡 聡","ステンシル計算を対象とした大規模GPUクラスタ向け自動並列化フレームワーク","ハイパフォーマンスコンピューティングとアーキテクチャの評価に関する北海道ワークショップ(HOKKE-18)","情報処理学会研究報告","情報処理学会","Vol. 2010-ARC-192/HPC-128",,"pp. 1-9",2010,Dec. "Takashi Shimokawabe,Takayuki Aoki,Chiashi Muroi,Junichi Ishida,Kohei Kawano,Toshio Endo,Akira Nukada,Naoya Maruyama,Satoshi Matsuoka","An 80-Fold Speedup, 15.0 TFlops, Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code","International Conference for High Performance Computing, Networking, Storage and Analysis (SC10)","Proceedings of IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC10)","IEEE/ACM",,,,2010,Nov. "Nguyen Toan,Tatsuo Nomura,Hideyuki Jitsumoto,Naoya Maruyama,Toshio Endo,Satoshi Matsuoka","MPI-CUDA Application Checkpointing","GPU Technology Conference 2010","GPU Technology Conference 2010",,,,,2010,Sept. "松岡聡,遠藤敏夫,丸山直也,佐藤仁,滝澤真一朗","TSUBAME2.0の全貌",,"TSUBAME e-Science Journal","東京工業大学学術国際情報センター",,"No. 1",,2010,Sept. "野村 達雄,丸山 直也,遠藤 敏夫,松岡 聡","GPUクラスタを対象にした並列ステンシル計算の自動コード生成フレームワーク","並列/分散/協調処理に関するサマーワークショップ(SWoPP2010)","情報処理学会研究報告","情報処理学会","Vol. 2010-HPC-126",,"pp. 1-10",2010,Aug. "Nguyen Toan,Hideyuki Jitsumoto,Naoya Maruyama,Tatsuo Nomura,Toshio Endo,Satoshi Matsuoka","MPI-CUDA Applications Checkpointing","Summer United Workshops on Parallel, Distributed and Cooperative Processing (SWoPP 2010)","IPSJ SIG Technical Report","IPSJ","Vol. 2010-HPC-126","No. 18","pp. 1-7",2010,Aug. "Hitoshi Nagasaka,Naoya Maruyama,Akira Nukada,Toshio Endo,SATOSHI MATSUOKA","Statistical Power Modeling of GPU Kernels Using Performance Counters","International Green Computing Conference (IGCC'10)","Proceedings of IEEE International Green Computing Conference (IGCC'10),","IEEE",,,"pp. 115-122",2010,Aug. "松岡聡,青木尊之,遠藤敏夫,丸山直也,佐藤仁,滝澤真一朗,實本英之","TSUBAMEの造り方から探る PCクラスターと「スパコン」のあいだ",,"月刊ASCII .technologies","アスキー・メディアワークス","Vol. 15","No. 7","pp. 48--55",2010,July "遠藤 敏夫,額田 彰,松岡 聡","異種アクセラレータを持つTSUBAMEスーパーコンピュータのLinpack評価",,"応用数理","応用数理学会","Vol. 20","No. 2","pp. 29-36",2010,June "Toshio Endo,Akira Nukada,SATOSHI MATSUOKA,Naoya Maruyama","Linpack Evaluation on a Supercomputer with Heterogeneous Accelerators","IEEE International Parallel & Distributed Processing Symposium (IPDPS 2010)","Proceedings of IEEE International Parallel & Distributed Processing Symposium (IPDPS 2010)","IEEE",,,"page 10",2010,Apr. "浜野智明,額田彰,遠藤敏夫,松岡聡","GPUクラスタにおける省電力タスクスケジューリング","第124回HPC研究会","情報処理学会研究報告2010-HPC-124","情報処理学会",,,"pp. 1--8",2010,Feb. "渡辺祐也,遠藤敏夫,松岡聡","GPU クラスタにおける科学技術計算の自動最適化","HPC研究会","情報処理学会研究報告","情報処理学会","Vol. 2010-HPC-124","No. 18","pp. 1--7",2010,Feb. "實本英之,中村俊介,遠藤敏夫,松岡聡","増分データとErasure Coding を利用した高速なチェックポイント手法","HPC研究会","情報処理学会研究報告","情報処理学会","Vol. 2009-HPC-122","No. 9","pp. 1--6",2009,Oct. "遠藤 敏夫,額田 彰,松岡 聡,丸山 直也","異種アクセラレータを持つヘテロ型スーパーコンピュータ上のLinpack の性能向上手法","並列/分散/協調処理に関するサマーワークショップ(SWoPP2009)","情報処理学会研究報告","情報処理学会","Vol. 2009-HPC-121","No. 24","Page 8",2009,Aug. "滝澤真一朗,遠藤敏夫,松岡聡","次世代光インターコネクトでのMPI通信に関する研究",,"コンピュータソフトウェア","日本ソフトウェア科学会","Vol. 26","No. 3","pp. 5--19",2009,Aug. "長坂仁,丸山直也,額田 彰,遠藤 敏夫,松岡 聡","GPU における性能と消費電力 の相関性の解析",,"情報処理学会研究報告2009-HPC-121",,,"No. 26","pp. 1--6",2009,July "Satoshi Matsuoka,Takayuki Aoki,Toshio Endo,Akira Nukada,Toshihiro Kato,Atsushi Hasegawa","GPU accelerated computing?from hype to mainstream, the rebirth of vector computing","Scientific Discovery through Advanced Computing (SciDAC 2009)","Journal of Physics: Conference Series","IOP","Vol. 180","No. 1","pp. 012043",2009,July "滝澤真一朗,遠藤敏夫,松岡聡","光サーキットネットワークの補助的利用によるHPCアプリケーション性能向上",,"情報処理学会 コンピューティングシステム(ACS)",,"Vol. 2","No. 2","pp. 110--121",2009,July "島田大地,丸山直也,額田彰,遠藤 敏夫,松岡 聡","GPUにおける耐故障性を考慮した数値計算の電力性能",,"情報処理学会研究報告2009-HPC-121",,,"No. 26","pp. 1--5",2009,July "Toshio Endo","Supercomputing on The TSUBAME GPU-Accelerated Cluster","CSIRO GPU Cluster Workshop",,,,,,2009,June "加藤季広,青木尊之,額田彰,遠藤敏夫,松岡聡,長谷川篤史","姫野ベンチマークのGPUマルチノード実行における通信と演算のオーバーラップによる高速化 ? 32GPUで700GFLOPS超を達成 ?","HPC研究会","情報処理学会研究報告「ハイパフォーマンスコンピューティング(HPC)」","情報処理学会","Vol. 2009-HPC-120","No. 3","pp. 1--6",2009,June "Hitoshi Sato,Satoshi Matsuoka,Toshio Endo","File Clustering Based Replication Algorithm in a Grid Environment","The 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","IEEE Computer Society",,,"pp. 204-211",2009,May "島田大地,丸山直也,額田 彰,遠藤 敏夫,松岡 聡","GPUにおける耐故障性を考慮した数値計算の電力性能",,"先進的計算シンポジウム (SACSIS2009)、ポスター発表",,"Vol. 2009","No. 5","pp. 161--163",2009,May "長坂仁,丸山直也,額田 彰,遠藤 敏夫,松岡 聡","GPUにおける性能と消費電力の相関性の解析",,"先進的計算シンポジウム (SACSIS2009)、ポスター発表",,"Vol. 2009","No. 5","pp. 151--152",2009,May "Tomoaki Hamano,Toshio Endo,Satoshi Matsuoka","Power-Aware Dynamic Task Scheduling for Heterogeneous Accelerated Clusters","The Fifth Workshop on High-Performance, Power-Aware Computing (HPPAC), in conjunction to IEEE IPDPS 2009","2009 IEEE International Symposium on Parallel&Distributed Processing",,,,"pp. 1--8",2009,May "Hideyuki Jitsumoto,Toshio Endo,Satoshi Matsuoka","Environmental-aware optimization of MPI checkpointing intervals","HPC ASIA 2009","HPC ASIA 2009",,,,"pp. 8pages-",2009,Mar. "遠藤敏夫,額田彰,松岡聡,丸山直也,實本英之","四種プロセッサからなるヘテロ型スーパーコンピュータにおけるLinpackチューニング","計算機アーキテクチャ・ハイパフォーマンスコンピューティング合同研究発表会(HOKKE-2009)","計算機アーキテクチャ・ハイパフォーマンスコンピューティング合同研究発表会(HOKKE-2009)論文集",,,,,2009,Mar. "細萱祐人,遠藤敏夫,松岡聡","スワップコストの動的推定によるメモリの省電力化手法","計算機アーキテクチャ・ハイパフォーマンスコンピューティング合同研究発表会(HOKKE-2009)",,,,,,2009,Mar. "山崎翔平,遠藤敏夫,松岡聡","プロセス間共通メモリイメージを考慮したマイグレーション最適化","計算機アーキテクチャ・ハイパフォーマンスコンピューティング合同研究発表会(HOKKE-2009)",,,,,,2009,Mar. "遠藤敏夫","東京工業大学TSUBAMEにおけるアクセラレータ活用事例",,"情報処理",,"Vol. 50","No. 2","pp. 100-106",2009,Feb. "遠藤敏夫,額田彰,松岡聡,丸山直也,實本英之","四種プロセッサからなるヘテロ型スーパーコンピュータにおけるLinpackチューニング","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2009)","ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2009)論文集",,,,,2009,Jan. "山崎翔平,遠藤敏夫,松岡聡","プロセス間共通メモリイメージを考慮したマイグレーション最適化","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2009)",,,,,,2009,Jan. "尾形泰彦,額田彰,丸山直也,遠藤敏夫,松岡聡","複数 GPU システムに対応する自動最適化 3D-FFT ライブラリ","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2009)","ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2009)論文集",,,,,2009,Jan. "細萱祐人,遠藤敏夫,松岡聡","SWAPアクセス数の実行時推定によるメモリの省電力化手法","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2009)",,,,,,2009,Jan. "滝澤 真一朗,遠藤敏夫,松岡聡","光サーキットネットワークの補助的利用によるHPCアプリケーション性能向上","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2009)","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2009)",,,,"pp. 65-72",2009,Jan. "Akira Nukada,Yasuhiko Ogata,Toshio Endo,Satoshi Matsuoka","Bandwidth intensive 3-D FFT kernel for GPUs using CUDA","2008 ACM/IEEE conference on Supercomputing (SC08)","Proceedings of the 2008 ACM/IEEE conference on Supercomputing (SC08)","IEEE",,,"pp. 1-11",2008,Nov. "Satoshi Matsuoka,Yutaka Akiyama,Akira Nukada,Toshio Endo,Yasuhiko Ogata,Fumikazu Konishi","HPC-GPGPU: Large-scale commodity accelerated clusters and its application to advanced structural proteomics","AHeDD2008/IPAB2008 Joint Symposium",,,,,,2008,Oct. "Hideyuki Jitsumoto,Toshio Endo,Satoshi Matsuoka","Environmental-aware optimization of MPI checkpointing intervals","The 2008 IEEE International Conference on Cluster Computing (Cluster 2008)","The 2008 IEEE International Conference on Cluster Computing (Cluster 2008)",,,,,2008,Sept. "Hitoshi Sato,Satoshi Matsuoka,Toshio Endo,Naoya Maruyama","Access-pattern and bandwidth aware file replication algorithm in a grid environment","The 9th IEEE/ACM International Conference on Grid Computing (Grid 2008)","The 9th IEEE/ACM International Conference on Grid Computing (Grid 2008)",,,,"pp. 250-257",2008,Sept. "遠藤敏夫","TSUBAMEにおけるアクセラレータの利用状況について","並列生物情報処理イニシアティブ(IPAB)アクセラレータWGセミナー",,,,,,2008,Sept. "遠藤敏夫","アクセラレータを用いた大規模ヘテロ環境におけるLinpack","九州大学情報基盤研究開発センター 先駆的科学計算に関するフォーラム2008",,,,,,2008,Sept. "滝澤 真一朗,遠藤敏夫,松岡聡","光ネットワークの補助的利用によるHPC性能向上","並列/分散/協調処理に関するサマーワークショップ(SWoPP2008)",,,,,,2008,Aug. "額田彰,尾形泰彦,遠藤敏夫,松岡聡","CUDA 環境における高性能3次元FFT",,"情報処理学会論文誌コンピューティングシステム",,"Vol. 1","No. 2","pp. 231-239",2008,Aug. "浜野智明,遠藤敏夫,松岡聡","ヘテロ計算環境のための省電力タスクスケジューリング","並列/分散/協調処理に関するサマーワークショップ(SWoPP2008)",,,,,,2008,Aug. "渡辺祐也,遠藤敏夫,松岡聡","複数GPUにおけるセルフスケジューリングによる並列数値演算","並列/分散/協調処理に関するサマーワークショップ(SWoPP2008)",,,,,,2008,Aug. "佐藤 仁,松岡聡,遠藤敏夫","広域分散ファイルシステムにおけるアクセスパターンと性能を考慮したファイル配置","並列/分散/協調処理に関するサマーワークショップ(SWoPP2008)",,,,,,2008,Aug. "丸山直也,松岡聡,尾形泰彦,額田彰,遠藤敏夫","ソフトウェアECCによるGPUメモリの耐故障性の実現と評価","並列/分散/協調処理に関するサマーワークショップ(SWoPP2008)",,,,,,2008,Aug. "千葉立寛,遠藤敏夫,松岡聡","グリッド環境におけるMPI Scatter/Gather通信アルゴリズムの最適化","並列/分散/協調処理に関するサマーワークショップ(SWoPP2008)",,,,,,2008,Aug. "浜野智明,遠藤敏夫,松岡聡","ヘテロ計算環境のための省電力タスクスケジューリング","情報処理学会 先進的基盤システムシンポジウム (SACSIS2008)",,,,,,2008,June "渡辺裕也,遠藤敏夫,松岡聡","不均一な複数GPUにおけるセルフスケジューリングによる並列数値演算","情報処理学会 先進的基盤システムシンポジウム (SACSIS2008)",,,,,,2008,June "尾形泰彦,遠藤敏夫,丸山直也,松岡聡","性能モデルに基づくCPU及びGPUを併用する効率的なFFTライブラリ",,"情報処理学会論文誌コンピューティングシステム",,"Vol. 1","No. 1","pp. 40-50",2008,June "額田彰,尾形泰彦,遠藤敏夫,松岡聡","CUDA 環境における高性能3次元FFT","情報処理学会 先進的計算基盤システムシンポジウム(SACSIS2008)","情報処理学会 先進的計算基盤システムシンポジウム(SACSIS2008)",,,,"pp. 81-88",2008,June "Yuto Hosogaya,Toshio Endo,Satoshi Matsuoka","Performance evaluation of parallel applications on next generation memory architecture with power-aware paging method","The Fourth Workshop on High-Performance","The Fourth Workshop on High-Performance, Power-Aware Computing (HPPAC), in conjunction with IEEE IPDPS 2008",,,,"pp. 8pages-",2008,Apr. "Toshio Endo,Satoshi Matsuoka","Massive supercomputing coping with heterogeneity of modern accelerators","IEEE International Parallel & Distributed Processing Symposium (IPDPS 2008)","IEEE International Parallel & Distributed Processing Symposium (IPDPS 2008)",,,,"pp. 10pages-",2008,Apr. "Shin'ichiro Takizawa,Toshio Endo,Satoshi Matsuoka","Locality aware MPI communication on a commodity opto-electronic hybrid network","Workshop on Large-Scale Parallel Processing (LSPP)","Workshop on Large-Scale Parallel Processing (LSPP), in conjunction with IEEE IPDPS 2008",,,,"pp. 8pages-",2008,Apr. "Yasuhiko Ogata,Toshio Endo,Naoya Maruyama,Satoshi Matsuoka","An efficient, model-based CPU-GPU heterogeneous FFT library","International Heterogeneity in Computing Workshop (HCW '08)","International Heterogeneity in Computing Workshop (HCW '08), in conjunction with IEEE IPDPS 2008",,,,"pp. 10pages-",2008,Apr. "滝澤 真一朗,遠藤敏夫,松岡聡","情報爆発時代の光インターコネクト上でのMPI通信アルゴリズム","第70回情報処理学会全国大会",,,,,,2008,Mar. "遠藤敏夫,松岡聡","情報爆発時代へ向けた不均一アーキテクチャにおけるスーパーコンピューティング","第70回情報処理学会全国大会",,,,,,2008,Mar. "佐藤 仁,松岡聡,遠藤敏夫","情報爆発時代のグリッドファイルシステム上での大規模データ管理","第70回情報処理学会全国大会",,,,,,2008,Mar. "實本英之,遠藤敏夫,松岡聡","情報爆発に対応する耐故障性 MPI フレームワークの提案","第70回情報処理学会全国大会",,,,,,2008,Mar. "千葉立寛,遠藤敏夫,松岡聡","情報爆発時代のグリッド環境に対応したMPI集団通信アルゴリズムの最適化","第70回情報処理学会全国大会",,,,,,2008,Mar. "細萱祐人,遠藤敏夫,松岡聡","省電力ページング方式を実装した次世代メモリアーキテクチャ上での並列プログラムの評価","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2008)","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2008)",,,,"pp. 25-32",2008,Jan. "尾形泰彦,遠藤敏夫,丸山直也,松岡聡","性能モデルに基づくCPU及びGPUを併用する効率的なFFTライブラリ","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2008)","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2008)",,,,"pp. 107-114",2008,Jan. "滝澤 真一朗,遠藤敏夫,松岡聡","次世代光インターコネクトでの MPI 通信性能の評価","日本ソフトウェア科学会第24回大会(2007年度)",,,,,,2007,Sept. "佐藤 仁,松岡聡,遠藤敏夫","広域分散環境における大規模データ管理のためのノードグルーピング","情報処理学会研究報告",,,,,,2007,Aug. "滝澤 真一朗,遠藤敏夫,松岡聡","次世代光インターコネクト上での MPI アプリケーションの評価","情報処理学会研究報告 2007-HPC-111(SWOPP2007)",,,,,,2007,Aug. "尾形泰彦,遠藤敏夫,松岡聡","CPUおよびGPUを併用するFFTライブラリの提案と評価","情報処理学会研究報告 2007-HPC-111(SWOPP2007)",,,,,,2007,Aug. "細萱祐人,遠藤敏夫,松岡聡","次世代省電力メモリを用いた並列プログラムの省電力化の評価","情報処理学会研究報告 2007-HPC-111(SWOPP2007)",,,,,,2007,Aug. "Tatsuhiro Chiba,Toshio Endo,Satoshi Matsuoka","High-performance MPI broadcast algorithm for grid environments utilizing multi-lane NICs","Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid'07)","Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid'07)",,,,"pp. 487-494",2007,May "實本英之,遠藤敏夫,松岡聡","フォールト/リカバリモデルを考慮した耐故障性をもつMPI フレームワークABARIS の提案と評価","情報処理学会研究報告2007-HPC-109(HOKKE2007)",,,,,,2007,Mar. "Hideyuki Jitsumoto,Toshio Endo,Satoshi Matsuoka","ABARIS: An adaptable fault detection/recovery component framework for MPIs","12th IEEE Workshop on Dependable Parallel","12th IEEE Workshop on Dependable Parallel, Distributed and Network-Centric Systems (DPDNS07), in conjunction with IPDPS2007",,,,"pp. 311-",2007,Mar. "遠藤敏夫,松岡聡,橋爪信明,長坂真路","ヘテロ型スーパーコンピュータTSUBAMEのLinpackによる性能評価",,"情報処理学会論文誌コンピューティングシステム","情報処理学会","Vol. 48","No. SIG8 (ACS18)","pp. 62-70",2007,Jan. "遠藤敏夫,松岡聡,橋爪信明,長坂真路","ヘテロ型スーパーコンピュータTSUBAMEのLinpackによる性能評価","2007年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2007","2007年ハイパフォーマンスコンピューティングと計算科学シンポジウムHPCS2007論文集",,,,"pp. 33-40",2007,Jan. "千葉立寛,遠藤敏夫,松岡聡","グリッド環境におけるマルチレーンを用いたMPIコレクティブ通信アルゴリズム","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2007)","情報処理学会 ハイパフォーマンスコンピューティングと計算科学シンポジウム(HPCS2007)",,,,"pp. 95-102",2007,Jan. "千葉立寛,遠藤敏夫,松岡聡","グリッド環境におけるマルチレーンを用いたMPIコレクティブ通信アルゴリズム",,"情報処理学会論文誌コンピューティングシステム",,"Vol. 48","No. SIG6","pp. 103-113",2007, "合田憲人,大澤清,大角知孝,笠井武史,小野功,實本英之,松岡聡,斎藤秀雄,遠藤敏夫,横山大作,田浦健次朗,近山隆,田中良夫,下坂久司,梶原広輝,廣安知之,藤澤克樹","グリッドチャレンジテストベッドの構築と運用〜グリチャレテストベッドの作り方〜","並列/分散/協調処理に関する『高知』サマー・ワークショップ(SWoPP2006)","情報処理学会研究報告 2006-HPC-107",,,,"pp. 49-54",2006,July "遠藤敏夫,松岡聡,橋爪信明,長坂真路,後藤和茂","ヘテロ型スーパーコンピュータTSUBAMEのLinpackによる性能評価","並列/分散/協調処理に関する『高知』サマー・ワークショップ(SWoPP2006)","情報処理学会研究報告 2006-HPC-107",,,,"pp. 43-48",2006,July