Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
Recent & Upcoming Talks
2024
Profiling and Debugging GPU-accelerated AI Applications
Presented our research on debugging and profiling of GPU-accelerated AI applications.
Oct 24, 2024 9:41 PM — 9:41 PM
Virtual
Keren Zhou
Project
Slides
Proton: Introduction and Development
Presented the ongoing work on Proton
Oct 21, 2024 9:41 PM — 9:41 PM
Virtual
Yuanwei Fang
,
Corbin Robeck
,
Keren Zhou
Project
Slides
Dev Tools: Proton/Interpreter
Presented the Proton and Interpreter tools in the Triton project.
Sep 17, 2024 9:41 PM — 9:41 PM
Virtual
Keren Zhou
Project
Slides
Video
Triton Update
Presented a talk about Triton and called for contributions to improving the language
Aug 13, 2024 10:56 PM — 10:56 PM
Lake Tahoe, California
Keren Zhou
Project
Slides
FASTEN: Fast GPU-accelerated Segmented Matrix Multiplication for Heterogenous Graph Neural Networks
Presented the FASTEN work for accelerating segmented matrix multiplication
Jun 1, 2024 9:41 PM — 9:41 PM
Virtual
Keren Zhou
Project
Slides
Update on Triton's Interpreter
Review Triton’s Interpreter’s progress and future plans
Apr 3, 2024 10:03 PM — 10:03 PM
Virtual
Keren Zhou
Project
Slides
Proton: A Profiler for Triton
Went through Proton’s design overview
Feb 20, 2024 10:03 PM — 10:03 PM
Virtual
Keren Zhou
Project
Slides
2023
Technical Review on PyTorch 2.0 and Triton
High-level overview of PyTorch 2.0 and Triton integration
Aug 7, 2023 10:03 PM — 10:03 PM
Virtual
Keren Zhou
Project
Slides
Towards Agile Development of Efficient Deep Learning Operators (Hardware Insights)
Presented a talk about Triton and requested feedback from Intel engineers
Jun 29, 2023 10:56 PM — 10:56 PM
Virtual
Keren Zhou
Project
Slides
Towards Agile Development of Efficient Deep Learning Operators (Call for Contributions)
Presented a talk about Triton and called for contributions to improving the language
Jun 19, 2023 10:56 PM — 10:56 PM
Lake Tahoe, California
Keren Zhou
Project
Slides
2022
Towards Agile Development of Efficient Deep Learning Operators (Pre-MLIR)
Presented triton programming language and its next step
Dec 2, 2022 10:03 PM — 10:03 PM
Virtual
Keren Zhou
Project
Slides
Practical Performance Optimization for Deep Learning Applications
Presented triton programming language and a deep learning profiler
May 18, 2022 10:02 PM — 10:02 PM
Virtual
Keren Zhou
Project
Project
Slides
ValueExpert: Exploring Value Patterns in GPU-accelerated Applications
Presented a talk about our value profiling tool at ASPLOS'22
Mar 2, 2022 12:00 AM — 12:00 AM
Virtual
Keren Zhou
Project
Slides
2021
Performance Measurement, Analysis, and Optimization of GPU-accelerated Applications
Presented a poster and a talk about my PhD research
Nov 15, 2021 9:54 PM — 9:54 PM
St. Louis, MO, USA
Keren Zhou
Project
Project
Project
Poster
Analyzing GPU-accelerated Applications Using HPCToolkit
Using HPCToolkit to Measure and Analyze the Performance of GPU-accelerated Applications Tutorial
Apr 1, 2021 9:48 PM — 9:48 PM
Virtual
Keren Zhou
Project
Slides
Video
GPA: A GPU Performance Advisor Based on Instruction Sampling
Presented our work on GPU performance advisor at CGO21
Mar 1, 2021 9:34 PM — 9:34 PM
Virtual
Keren Zhou
Project
Slides
2020
GVProf: A Value Profiler for GPU-Based Clusters
Presented a talk about our value profiling tool for GPUs
Nov 19, 2020 9:53 PM — 9:53 PM
Virtual
Keren Zhou
Project
Slides
Tools for Top-down Performance Analysis of GPU-Accelerated Applications
Presented the beta version of our comprehensive GPU profiling tool at ICS20
Jul 1, 2020 9:41 PM — 9:41 PM
Virtual
Keren Zhou
Project
Slides
A Tool for Top-down Performance Analysis of GPU-accelerated Applications
Presented a poster and a short talk about HPCToolkit’s GPU support
Apr 24, 2020 9:50 PM — 9:50 PM
San Diego, CA, USA
Keren Zhou
Project
Poster
Slides
2019
Optimizing GPU-accelerated Applications with HPCToolkit
Presented the prototype of HPCToolkit’s GPU support at PETASCALE'19
Jul 29, 2019 9:56 PM — 9:56 PM
Lake Tahoe, California
Keren Zhou
Project
Slides
A Tool for Performance Analysis of GPU-accelerated Applications
Presented a talk about our profiling tool at CGO'19
Mar 18, 2019 11:10 PM — 11:10 PM
Washington DC, USA
Keren Zhou
Project
Poster
Slides
2017
A Performance Analysis Framework for Exploiting GPU Microarchitectural Capability
Presented our work on static performance analysis for GPUs at ICS17
Jul 20, 2017 9:36 PM — 9:36 PM
Chicago, IL, USA
Keren Zhou
Slides
Deep Learning on Modern Architectures
Discussed how state-of-the-art deep learning libraries optimize computations by utilizing architectural features.
Apr 1, 2017 10:00 PM — 10:00 PM
Institute of Computing Technology, Chinese Academy of Sciences
Keren Zhou
Slides
2016
Convolution Methods
Introduced various kinds of convolution methods and analyzed their complexities, memory consumptions, and data access patterns.
Jun 1, 2016 9:58 PM — 9:58 PM
Institute of Computing Technology, Chinese Academy of Sciences
Keren Zhou
Slides
Cite
×