2014 conference paper
Warp-level divergence in GPUs: Characterization, impact, and mitigation
International symposium on high-performance computer, 284–295.
2013 journal article
Locality principle revisited: A probability-based quantitative approach
Journal of Parallel and Distributed Computing, 73(7), 1011–1027.
2012 conference paper
CPU-assisted GPGPU on fused CPU-GPU architectures
International symposium on high-performance computer, 103–114.
2012 conference paper
Locality principle revisited: A probability-based quantitative approach
2012 ieee 26th international parallel and distributed processing symposium (ipdps), 995–1009.
2010 conference paper
An optimizing compiler for GPGPU programs with input-data sharing
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 343–344.