Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
ICS
FASTEN: Fast GPU-accelerated Segmented Matrix Multiplication for Heterogenous Graph Neural Networks
This paper introduces FASTEN, a cutting-edge library developed to address the computational challenges inherent in Heterogeneous Graph …
Keren Zhou
,
Karthik Ganapathi Subramanian
,
Po-Hsun Lin
,
Matthias Fey
,
Binqian Yin
,
Jiajia Li
Cite
Project
DOI
URL
Low Overhead and Context Sensitive Profiling of GPU-Accelerated Applications
As we near the end of Moore’s law scaling, the next-generation computing platforms are increasingly exploring heterogeneous …
Keren Zhou
,
Jonathon Anderson
,
Xiaozhu Meng
,
John Mellor-Crummey
Cite
Project
DOI
URL
Tools for Top-down Performance Analysis of GPU-Accelerated Applications
This paper describes extensions to Rice University’s HPCToolkit performance tools to support measurement and analysis of …
Keren Zhou
,
Mark W. Krentel
,
John Mellor-Crummey
Cite
Project
DOI
URL
A Performance Analysis Framework for Exploiting GPU Microarchitectural Capability
GPUs are widely used in accelerating deep neural networks (DNNs) for their high bandwidth and parallelism. But tuning the performance …
Keren Zhou
,
Guangming Tan
,
Xiuxia Zhang
,
Chaowei Wang
,
Ninghui Sun
Cite
Project
DOI
URL
Cite
×