"Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Erin McCarthy,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Peter Nugent,Brian Van Essen","The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism",,"IEEE Transactions on Parallel & Distributed Systems (TPDS)",,"vol. 32","no. 7","pp. 1641-1652",2021,July "Jens Domke,Emil Vatai,Alexsandr Drozd,Peng Chen,Yosuke Oyama,Lingqi Zhang,Shweta Salaria,Daichi Mukunoki,Artur Podobas,Mohamed Wahib,Satoshi Matsuoka","Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws?","International Parallel and Distributed Processing Symposium (IPDPS 2021)",,,,,,2021,May "Yosuke Oyama","Hierarchical Hybrid Parallel Training of Large-Scale Convolutional Neural Networks",,,,,,,2021,Mar. "Yosuke Oyama","Hierarchical Hybrid Parallel Training of Large-Scale Convolutional Neural Networks",,,,,,,2021,Mar. "Yosuke Oyama","Hierarchical Hybrid Parallel Training of Large-Scale Convolutional Neural Networks",,,,,,,2021,Mar. "Jens Domke,Emil Vatai,Alexsandr Drozd,Peng Chen,Yosuke Oyama,Lingqi Zhang,Shweta Salaria,Daichi Mukunoki,Artur Podobas,Mohamed Wahib,Satoshi Matsuoka","Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws?",,,,,,,2020,Oct. "Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Erin McCarthy,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Peter Nugent,Brian Van Essen","The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism",,,,,,,2020,July "Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Marc Snir,Peter Nugent,Brian Van Essen","Toward Training a Large 3D Cosmological CNN with Hybrid Parallelization","The 1st Workshop on Parallel and Distributed Machine Learning 2019 (PDML'19)",,,,,,2019,Aug. "Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Marc Snir,Peter Nugent,Brian Van Essen","Toward Training a Large 3D Cosmological CNN with Hybrid Parallelization","48th International Conference on Parallel Processing (ICPP 2019)",,,,,,2019,Aug. "Yosuke Oyama,Naoya Maruyama,Nikoli Dryden,Peter Harrington,Jan Balewski,Satoshi Matsuoka,Marc Snir,Peter Nugent,Brian Van Essen","Toward Training a Large 3D Cosmological CNN with Hybrid Parallelization","第170回ハイパフォーマンスコンピューティング研究発表会",,,,,,2019,July "土川 稔生,遠藤 敏夫,野村 哲弘,近藤正章,大山 洋介,松岡 聡","メモリアクセスデータを用いた機械学習によるアプリケーションの類型化","並列/分散/協調処理に関するサマーワークショップ(SWoPP2019), 情報処理学会研究報告, 2019-HPC-170 No.12",,,,,,2019,July "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,Satoshi Matsuoka","u-cuDNN: Accelerating Deep Learning Frameworks with Micro-Batches","GPU Technology Conference 2019 (GTC2019)",,,,,,2019,Mar. "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,Satoshi Matsuoka","Accelerating Deep Learning Frameworks with Micro-batches","IEEE Cluster 2018",,,,,,2018,Sept. "土川稔生,大山洋介,野村哲弘,松岡聡","機械学習による計算機トレースの自動生成","並列/分散/協調処理に関するサマーワークショップ (SWoPP2018)",,,,,,2018,Aug. "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,Satoshi Matsuoka","μ-cuDNN",,,,,,,2018,July "八島慶汰,大山洋介,松岡聡","深層学習におけるBatchNormalization使用時の計算時間と精度の関係性","並列/分散/協調処理に関するサマーワークショップ (SWoPP2018)",,,,,,2018,July "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,SATOSHI MATSUOKA","μ-cuDNN: Accelerating Deep Learning Frameworks with Micro-Batching",,,,,,,2018,Apr. "Yosuke Oyama,Tal Ben-Nun,Torsten Hoefler,Satoshi Matsuoka","Less is More: Accelerating Deep Neural Networks with Micro-Batching","第162回ハイパフォーマンスコンピューティング研究発表会",,,,,,2017,Dec. "Ikuro Sato,Ryo Fujisaki,Yosuke Oyama,Akihiro Nomura,Satoshi Matsuoka","Asynchronous, data-parallel deep convolutional neural network training with linear prediction model for parameter transition","The 24th International Conference On Neural Information Processing (ICONIP 2017)","International Conference on Neural Information Processing",,"volume 10635",,"pp. 305-314",2017,Nov. "Yosuke Oyama,Akihiro Nomura,Ikuro Sato,Hiroki Nishimura,Yukimasa Tamatsu,Satoshi Matsuoka","Predicting Probabilistic Parameters of a Large-Scale Asynchronous SGD Deep Learning System","GPU Technology Conference 2017 (GTC 2017)",,,,,,2017,May "大山洋介,野村哲弘,佐藤育郎,松岡聡","ディープラーニングのデータ並列学習における少精度浮動小数点数を用いた通信量の削減","第158回ハイパフォーマンスコンピューティング研究発表会",,,,,,2017,Mar. "Yosuke Oyama,Akihiro Nomura,Ikuro Sato,Hiroki Nishimura,Yukimasa Tamatsu,Satoshi Matsuoka","Predicting Statistics of Asynchronous SGD Parameters for a Large-Scale Distributed Deep Learning System on GPU Supercomputers","2016 IEEE International Conference on Big Data (IEEE BigData 2016)",,,,,,2016,Dec. "Yosuke Oyama,Hiroki Ohtsuji,Jun Kato,Kosuke Suzuki,Mitsuru Sato,Eiji Yoshida","Partially-Decompressible Dictionary Based Compression Format for All Flash Array","1st Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems (PDSW-DISCS’16)",,,,,,2016,Nov. "大山洋介,野村哲弘,佐藤育郎,西村裕紀,玉津幸政,松岡聡","学習条件を考慮した大規模非同期ディープラーニングシステムの性能モデリング","並列/分散/協調処理に関するサマーワークショップ(SWoPP2016)",,,,,,2016,Aug. "Yosuke Oyama,Akihiro Nomura,Ikuro Sato,Hiroki Nishimura,Yukimasa Tamatsu,SATOSHI MATSUOKA","Training Condition Conscious Performance Modeling of an Asynchronous Data-Parallel Deep Learning System","ACM Symposium on High-Performance Parallel and Distributed Computing",,,,,,2016,May