2019 journal article
Coordinated CTA Combination and Bandwidth Partitioning for GPU Concurrent Kernel Execution
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 16(3).
2016 article
A Model-Driven Approach to Warp/Thread-Block Level GPU Cache Bypassing
2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC).
2015 conference paper
Analyzing graphics processor unit (GPU) instruction set architectures
Ieee international symposium on performance analysis of systems and, 155–156.
2014 conference paper
Understanding the tradeoffs between software-managed vs. hardware-managed caches in GPUs
Ieee international symposium on performance analysis of systems and, 231–241.
Citation Index includes data from a number of different sources. If you have questions about the sources of data in the Citation Index or need a set of data which is free to re-distribute, please contact us.
Certain data included herein are derived from the Web of Science© and InCites© (2024) of Clarivate Analytics. All rights reserved. You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.