Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
GPU
A Tool for Top-down Performance Analysis of GPU-Accelerated Applications
To support performance measurement and analysis of GPU-accelerated applications, we extended the HPCToolkit performance tools with …
Keren Zhou
,
Mark Krentel
,
John Mellor-Crummey
Cite
Project
DOI
URL
GVPROF: A Value Profiler for GPU-Based Clusters
GPGPUs are widely used in high-performance computing systems to accelerate scientific and machine learning workloads. Developing …
Keren Zhou
,
Yueming Hao
,
John Mellor-Crummey
,
Xiaozhu Meng
,
Xu Liu
Cite
Project
DOI
URL
Tools for Top-down Performance Analysis of GPU-Accelerated Applications
This paper describes extensions to Rice University’s HPCToolkit performance tools to support measurement and analysis of …
Keren Zhou
,
Mark W. Krentel
,
John Mellor-Crummey
Cite
Project
DOI
URL
Optimizing GPU-accelerated Applications with HPCToolkit
Presented the prototype of HPCToolkit’s GPU support at PETASCALE'19
Jul 29, 2019 9:56 PM — 9:56 PM
Lake Tahoe, California
Keren Zhou
Project
Slides
A Tool for Performance Analysis of GPU-accelerated Applications
Presented a talk about our profiling tool at CGO'19
Mar 18, 2019 11:10 PM — 11:10 PM
Washington DC, USA
Keren Zhou
Project
Poster
Slides
A Tool for Performance Analysis of GPU-Accelerated Applications
Architectures for High-Performance Computing (HPC) now commonly employ accelerators such as Graphics Processing Units (GPUs). …
Keren Zhou
,
John Mellor-Crummey
Cite
Project
DOI
URL
A Performance Analysis Framework for Exploiting GPU Microarchitectural Capability
Presented our work on static performance analysis for GPUs at ICS17
Jul 20, 2017 9:36 PM — 9:36 PM
Chicago, IL, USA
Keren Zhou
Slides
Deep Learning on Modern Architectures
Discussed how state-of-the-art deep learning libraries optimize computations by utilizing architectural features.
Apr 1, 2017 10:00 PM — 10:00 PM
Institute of Computing Technology, Chinese Academy of Sciences
Keren Zhou
Slides
A Performance Analysis Framework for Exploiting GPU Microarchitectural Capability
GPUs are widely used in accelerating deep neural networks (DNNs) for their high bandwidth and parallelism. But tuning the performance …
Keren Zhou
,
Guangming Tan
,
Xiuxia Zhang
,
Chaowei Wang
,
Ninghui Sun
Cite
Project
DOI
URL
Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning
In this paper, we present a methodology to understand GPU microarchitectural features and improve performance for compute-intensive …
Xiuxia Zhang
,
Guangming Tan
,
Shuangbai Xue
,
Jiajia Li
,
Keren Zhou
,
Mingyu Chen
Cite
Project
DOI
URL
«
»
Cite
×