Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
Convolution
A Performance Analysis Framework for Exploiting GPU Microarchitectural Capability
GPUs are widely used in accelerating deep neural networks (DNNs) for their high bandwidth and parallelism. But tuning the performance …
Keren Zhou
,
Guangming Tan
,
Xiuxia Zhang
,
Chaowei Wang
,
Ninghui Sun
Cite
Project
DOI
URL
Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning
In this paper, we present a methodology to understand GPU microarchitectural features and improve performance for compute-intensive …
Xiuxia Zhang
,
Guangming Tan
,
Shuangbai Xue
,
Jiajia Li
,
Keren Zhou
,
Mingyu Chen
Cite
Project
DOI
URL
Cite
×