Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
Performance Tool
GPA
GPA is a performance advisor for NVIDIA GPUs that suggests potential code optimization opportunities at a hierarchy of levels, including individual lines, loops, and functions. GPA uses data flow analysis to approximately attribute measured instruction stalls to their root causes and uses information about a program’s structure and the GPU to match inefficiency patterns with suggestions for optimization. GPA estimates each optimization’s speedup based on a PC sampling-based performance model.
Code
HPCToolkit
Our tool provides a profile view and a trace view for GPU-accelerated applications. The profile view identifies where GPU APIs are invoked in CPU calling context, approximates calling context for GPU execution, and analyzes instruction mix for GPU kernels. The tool traces CPU and GPU activities for a large number of processes and threads with minimal overhead.
Code
DOC
GVProf
We implemented GVProf, the first value profiler that locates value redundancy problems in applications running on GPU-based clusters. Our experiments show that GVProf incurs acceptable overhead and scales to large executions. GVProf provides useful insights to guide performance optimization. Under the guidance of GVProf, we optimized several HPC and machine learning workloads, obtaining speedups up to 1.93x.
Code
DOC
Practical Performance Optimization for Deep Learning Applications
Presented triton programming language and a deep learning profiler
May 18, 2022 10:02 PM — 10:02 PM
Virtual
Keren Zhou
Project
Project
Slides
ValueExpert: Exploring Value Patterns in GPU-accelerated Applications
Presented a talk about our value profiling tool at ASPLOS'22
Mar 2, 2022 12:00 AM — 12:00 AM
Virtual
Keren Zhou
Project
Slides
Performance Measurement, Analysis, and Optimization of GPU-accelerated Applications
Presented a poster and a talk about my PhD research
Nov 15, 2021 9:54 PM — 9:54 PM
St. Louis, MO, USA
Keren Zhou
Project
Project
Project
Poster
Analyzing GPU-accelerated Applications Using HPCToolkit
Using HPCToolkit to Measure and Analyze the Performance of GPU-accelerated Applications Tutorial
Apr 1, 2021 9:48 PM — 9:48 PM
Virtual
Keren Zhou
Project
Slides
Video
GPA: A GPU Performance Advisor Based on Instruction Sampling
Presented our work on GPU performance advisor at CGO21
Mar 1, 2021 9:34 PM — 9:34 PM
Virtual
Keren Zhou
Project
Slides
GVProf: A Value Profiler for GPU-Based Clusters
Presented a talk about our value profiling tool for GPUs
Nov 19, 2020 9:53 PM — 9:53 PM
Virtual
Keren Zhou
Project
Slides
A Tool for Top-down Performance Analysis of GPU-accelerated Applications
Presented a poster and a short talk about HPCToolkit’s GPU support
Apr 24, 2020 9:50 PM — 9:50 PM
San Diego, CA, USA
Keren Zhou
Project
Poster
Slides
»
Cite
×