Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
HPC
Accelerating High-order Stencils on GPUs
Finite-difference methods based on high-order stencils are commonly used for modeling of seismic wave propagation, weather forecasting, …
Ryuichi Sai
,
John Mellor-Crummey
,
Xiaozhu Meng
,
Keren Zhou
,
Mauricio Araya-Polo
,
Jie Meng
Cite
Project
DOI
URL
An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications
The US Department of Energy’s fastest supercomputers and forthcoming exascale systems employ Graphics Processing Units (GPUs) to …
Keren Zhou
,
Xiaozhu Meng
,
Ryuichi Sai
,
Dejan Grubisic
,
John Mellor-Crummey
Cite
Project
DOI
URL
Analyzing GPU-accelerated Applications Using HPCToolkit
Using HPCToolkit to Measure and Analyze the Performance of GPU-accelerated Applications Tutorial
Apr 1, 2021 9:48 PM — 9:48 PM
Virtual
Keren Zhou
Project
Slides
Video
GPA: A GPU Performance Advisor Based on Instruction Sampling
Developing efficient GPU kernels can be difficult because of the complexity of GPU architectures and programming models. Existing …
Keren Zhou
,
Xiaozhu Meng
,
Ryuichi Sai
,
John Mellor-Crummey
Cite
Project
DOI
URL
Measurement and Analysis of GPU-accelerated Applications with HPCToolkit
To address the challenge of performance analysis on the US DOE’s forthcoming exascale supercomputers, Rice University has been …
Keren Zhou
,
Laksono Adhianto
,
Jonathon Anderson
,
Aaron Cherian
,
Dejan Grubisic
,
Mark Krentel
,
Yumeng Liu
,
Xiaozhu Meng
,
John Mellor-Crummey
Cite
Project
DOI
URL
Measurement and Analysis of GPU-Accelerated OpenCL Computations on Intel GPUs
Graphics Processing Units (GPUs) have become a key technology for accelerating node performance in supercomputers, including the US …
Aaron Thomas Cherian
,
Keren Zhou
,
Dejan Grubisic
,
Xiaozhu Meng
,
John Mellor-Crummey
Cite
Project
DOI
URL
A Tool for Top-down Performance Analysis of GPU-Accelerated Applications
To support performance measurement and analysis of GPU-accelerated applications, we extended the HPCToolkit performance tools with …
Keren Zhou
,
Mark Krentel
,
John Mellor-Crummey
Cite
Project
DOI
URL
GVPROF: A Value Profiler for GPU-Based Clusters
GPGPUs are widely used in high-performance computing systems to accelerate scientific and machine learning workloads. Developing …
Keren Zhou
,
Yueming Hao
,
John Mellor-Crummey
,
Xiaozhu Meng
,
Xu Liu
Cite
Project
DOI
URL
Tools for Top-down Performance Analysis of GPU-Accelerated Applications
This paper describes extensions to Rice University’s HPCToolkit performance tools to support measurement and analysis of …
Keren Zhou
,
Mark W. Krentel
,
John Mellor-Crummey
Cite
Project
DOI
URL
Cite
×