Jiajia Li

Works (12)

Updated: November 24th, 2025 14:26

2025 article

Advancing Matrix Operations for High-Performance and Memory-Efficient Automata Processing on GPUs

Wu, Z., Ge, T., Li, J., Chen, X., & Liu, H. (2025, November 4). ACM Transactions on Architecture and Code Optimization.

By: Z. Wu, T. Ge*, J. Li n, X. Chen* & H. Liu*

topics (OpenAlex): Network Packet Processing and Optimization; Parallel Computing and Optimization Techniques; Graph Theory and Algorithms
Source: NC State University Libraries
Added: November 22, 2025

2025 article

SRSparse: Generating Codes for High-Performance Sparse Matrix-Vector Semiring Computations

Du, Z., Liu, Y., Sun, N., Cui, H., Feng, X., & Li, J. (2025, March 7). ACM Transactions on Architecture and Code Optimization.

author keywords: High performance computing; sparse matrix computation; auto-tuning; code generator; semiring computation
topics (OpenAlex): Parallel Computing and Optimization Techniques; Network Packet Processing and Optimization; Neural Networks and Applications
Source: Web Of Science
Added: July 28, 2025

2025 article

SymProp: Scaling Sparse Symmetric Tucker Decomposition via Symmetry Propagation

Li, Z., Shivakumar, S., Li, J., & Kannan, R. (2025, June 3).

author keywords: Sparse tensors; symmetric tensors; Tucker decomposition
topics (OpenAlex): Nonlinear Waves and Solitons
Source: Web Of Science
Added: September 29, 2025

2025 article

gHyPart: GPU-friendly End-to-End Hypergraph Partitioner

Wu, Z., Zhao, H., Liu, H., Wen, W., & Li, J. (2025, January 10). ACM Transactions on Architecture and Code Optimization.

By: Z. Wu, H. Zhao, H. Liu, W. Wen n & J. Li n

author keywords: Hypergraph partitioning; GPU; parallelization strategies
topics (OpenAlex): VLSI and FPGA Design Techniques; Graph Theory and Algorithms; Interconnection Networks and Systems
Source: Web Of Science
Added: May 6, 2025

2024 article

FASTEN: Fast GPU-accelerated Segmented Matrix Multiplication for Heterogenous Graph Neural Networks

Zhou, K., Subramanian, K. G., Lin, P.-H., Fey, M., Yin, B., & Li, J. (2024, May 30).

By: K. Zhou*, K. Subramanian n, P. Lin n, M. Fey, B. Yin* & J. Li n

author keywords: Graph Neural Networks; GPUs; Matrix Multiplication; Batch Processing; Performance Modeling
topics (OpenAlex): Advanced Graph Neural Networks; Ferroelectric and Negative Capacitance Devices; Parallel Computing and Optimization Techniques
Source: Web Of Science
Added: August 5, 2024

2024 article

POSTER: Optimizing Sparse Tensor Contraction with Revisiting Hash Table Design

Feng, G., Jia, W., Sun, N., Tan, G., & Li, J. (2024, February 20).

author keywords: sparse tensor contraction; hash table
topics (OpenAlex): Algorithms and Data Compression; Tensor decomposition and applications; Parallel Computing and Optimization Techniques
Source: Web Of Science
Added: May 13, 2024

2023 article

Fast Parallel Tensor Times Same Vector for Hypergraphs

Shivakumar, S., Amburg, I., Aksoy, S. G., Li, J., Young, S. J., & Aluru, S. (2023, December 18).

By: S. Shivakumar*, I. Amburg*, S. Aksoy*, J. Li n, S. Young* & S. Aluru*

author keywords: hypergraphs; sparse symmetric tensor times same vector; tensor eigenvector; generating function
topics (OpenAlex): Tensor decomposition and applications; Computational Physics and Python Applications; Parallel Computing and Optimization Techniques
Source: Web Of Science
Added: July 8, 2024

2023 article

Performance Implication of Tensor Irregularity and Optimization for Distributed Tensor Decomposition

Miao, Z., Calhoun, J. C., Ge, R., & Li, J. (2023, February 7). ACM Transactions on Parallel Computing.

author keywords: Sparse tensor; tensor decomposition; CPD; irregularity
topics (OpenAlex): Tensor decomposition and applications; Parallel Computing and Optimization Techniques; Algorithms and Data Compression; Advanced Neuroimaging Techniques and Applications
TL;DR: This work proposes irregularity-aware distributed Cpd that leverages the sparsity and irregularity information to identify the best tradeoff between different imbalances with low time overhead and materializes the idea with two optimization methods. (via Semantic Scholar)
Source: Web Of Science
Added: August 21, 2023

2023 article

Sparse Symmetric Format for Tucker Decomposition

Shivakumar, S., Li, J., Kannan, R., & Aluru, S. (2023, March 29). IEEE Transactions on Parallel and Distributed Systems.

author keywords: Tensors; Symmetric matrices; Sparse matrices; Indexes; Signal processing algorithms; Matrix decomposition; Parallel algorithms; Compressed storage; sparse tensors; symmetric tensors; tensor times matrix chain
topics (OpenAlex): Tensor decomposition and applications; Parallel Computing and Optimization Techniques; Algorithms and Data Compression
TL;DR: The novel Compressed Sparse Symmetric (CSS) format for sparse symmetric tensors is presented, along with an efficient parallel algorithm for the S<inline-formula><tex-math notation="LaTeX") TTM operation, and it is theoretically established that S.Tensor Times Matrix chain operation achieves a better memory versus run-time trade-off compared to state-of- theart implementations. (via Semantic Scholar)
Source: Web Of Science
Added: July 3, 2023

2022 article

AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices

Du, Z., Li, J., Wang, Y., Li, X., Tan, G., & Sun, N. (2022, November 1).

By: Z. Du*, J. Li n, Y. Wang*, X. Li*, G. Tan* & N. Sun*

author keywords: auto-tuner; sparse matrix-vector multiplication; SpMV; GPU; code generator; sparse data structures
topics (OpenAlex): Parallel Computing and Optimization Techniques; Advanced Data Storage Technologies; Distributed and Parallel Computing Systems
TL;DR: AlphaSparse automatically creates novel machine-designed formats and SpMV kernel implementations en-tirely from the knowledge of input sparsity patterns and hard-ware architectures, a superset of all existing works that goes beyond the scope of human-designed format(s) and implementation(s). (via Semantic Scholar)
Source: Web Of Science
Added: June 12, 2023

2022 article

BALA-CPD: BALanced and Asynchronous Distributed Tensor Decomposition

Miao, Z., Li, J., Calhoun, J. C., & Ge, R. (2022, September 1).

author keywords: sparse tensor; tensor decomposition; CPD; asynchronous algorithm
topics (OpenAlex): Tensor decomposition and applications; Parallel Computing and Optimization Techniques; Advanced Neural Network Applications
TL;DR: A novel algorithm BALA-CPD is presented, which achieves the best overall workload balance, and effectively overlaps communication and computation for the popular distributed Canonical Polyadic Decomposition (CPD) algorithms. (via Semantic Scholar)
Source: Web Of Science
Added: February 27, 2023

2022 article

Editorial: High-performance tensor computations in scientific computing and data science

Napoli, E. D., Bientinesi, P., Li, J., & Uschmajew, A. (2022, September 23). Frontiers in Applied Mathematics and Statistics.

By: E. Napoli*, P. Bientinesi*, J. Li n & A. Uschmajew*

author keywords: tensor operation; tensor decomposition; tensor network; multilinear algebra; high performance optimization; low-rank approximation; Deep Learning; tensor library
topics (OpenAlex): Tensor decomposition and applications; Parallel Computing and Optimization Techniques; Quantum many-body systems
TL;DR: High-performance tensor computations in scientific computing and data science and that the original publication in this journal is cited, in accordance with accepted academic practice. (via Semantic Scholar)
Source: Web Of Science
Added: October 31, 2022

Citation Index includes data from a number of different sources. If you have questions about the sources of data in the Citation Index or need a set of data which is free to re-distribute, please contact us.

Certain data included herein are derived from the Web of Science© and InCites© (2026) of Clarivate Analytics. All rights reserved. You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.