Keren Zhou
Keren Zhou
Home
Experience
Projects
Featured
Publications
Talks
Students
Tags
News
Light
Dark
Automatic
CPU
HPCToolkit
Our tool provides a profile view and a trace view for GPU-accelerated applications. The profile view identifies where GPU APIs are invoked in CPU calling context, approximates calling context for GPU execution, and analyzes instruction mix for GPU kernels. The tool traces CPU and GPU activities for a large number of processes and threads with minimal overhead.
Code
DOC
Deep Learning on Modern Architectures
Discussed how state-of-the-art deep learning libraries optimize computations by utilizing architectural features.
Apr 1, 2017 10:00 PM — 10:00 PM
Institute of Computing Technology, Chinese Academy of Sciences
Keren Zhou
Slides
Cite
×