- ASPLOS'17-Locality-Aware CTA Clustering for Modern GPUs
- ASPLOS'17-Dynamic Resource Management for Efficient Utilization of Multitasking GPUs
- HPCA'17-Dynamic GPGPU Power Management Using Adaptive Model Predictive Control
- ISCA'16-Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems
- HPCA'17-Controlled Kernel Launch for Dynamic Parallelism in GPUs
- ISCA'16-LaPerm: Locality Aware Scheduler for Dynamic Parallelism on GPUs
- ISCA'16-Virtual Thread Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit
- GTC'17-COOPERATIVE GROUPS