Po-Hsun Lin Zhou, K., Subramanian, K. G., Lin, P.-H., Fey, M., Yin, B., & Li, J. (2024). FASTEN: Fast GPU-accelerated Segmented Matrix Multiplication for Heterogeneous Graph Neural Networks. PROCEEDINGS OF THE 38TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2024, pp. 511–524. https://doi.org/10.1145/3650200.3656593