CaLRS: a critical-aware shared LLC request scheduling algorithm on GPGPU

Jianliang Ma; Jinglei Meng; Tianzhou Chen; Minghui Wu

doi:10.1155/2015/848416

CaLRS: a critical-aware shared LLC request scheduling algorithm on GPGPU

ScientificWorldJournal. 2015:2015:848416. doi: 10.1155/2015/848416. Epub 2015 Feb 2.

Authors

Jianliang Ma¹, Jinglei Meng¹, Tianzhou Chen¹, Minghui Wu²

Affiliations

¹ College of Computer Science, Zhejiang University, Zheda Road No. 38, Hangzhou 310013, China.
² Zhejiang University City College, Huzhou Road No. 51, Hangzhou 310015, China.

Abstract

Ultra high thread-level parallelism in modern GPUs usually introduces numerous memory requests simultaneously. So there are always plenty of memory requests waiting at each bank of the shared LLC (L2 in this paper) and global memory. For global memory, various schedulers have already been developed to adjust the request sequence. But we find few work has ever focused on the service sequence on the shared LLC. We measured that a big number of GPU applications always queue at LLC bank for services, which provide opportunity to optimize the service order on LLC. Through adjusting the GPU memory request service order, we can improve the schedulability of SM. So we proposed a critical-aware shared LLC request scheduling algorithm (CaLRS) in this paper. The priority representative of memory request is critical for CaLRS. We use the number of memory requests that originate from the same warp but have not been serviced when they arrive at the shared LLC bank to represent the criticality of each warp. Experiments show that the proposed scheme can boost the SM schedulability effectively by promoting the scheduling priority of the memory requests with high criticality and improves the performance of GPU indirectly.

Publication types

Research Support, Non-U.S. Gov't