Atomic reduction based sparse matrix-transpose vector multiplication on GPUs

Yuan Tao; Yangdong Deng; Shuai Mu; Mingfa Zhu; Limin Xiao; Li Ruan; Zhibin Huang

doi:10.1109/PADSW.2014.7097920

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Atomic reduction based sparse matrix-transpose vector multiplication on GPUs

Year: 2014, Pages: 987-992

DOI Bookmark: 10.1109/PADSW.2014.7097920

Authors

Yuan Tao, State Key Laboratory of Software Development Environment, Beihang University, Beijing, China
Yangdong Deng, Institute of Microelectronics, Tsinghua University, Beijing, China
Shuai Mu, Institute of Microelectronics, Tsinghua University, Beijing, China
Mingfa Zhu, School of Computer Science and Engineering, Beihang University, Beijing, China
Limin Xiao, School of Computer Science and Engineering, Beihang University, Beijing, China
Li Ruan, School of Computer Science and Engineering, Beihang University, Beijing, China
Zhibin Huang, Beijing Key Lab of Intelligent Telecommunication, Software and Multimedia, Beijing University of Posts and Telecommunications, China

Abstract

Sparse Matrix-Transpose Vector Product (SMTVP) is a frequently used computation pattern in High Performance Computing applications. It is typically solved by transposition followed by a Sparse Matrix-Vector Product (SMVP) in current linear algebra packages. However, the transposition process can be a serious bottleneck on modern parallel computing platforms. A previous work proposed a relatively complex data structure for efficiently computing SMTVP with multi-core CPUs, but it proved to be inefficient on GPUs. In this work, we show that the Compressed Sparse Row (CSR) based SMVP algorithm can also be efficient for SMTVP computation on modern GPUs. The proposed method exploits atomic operations to perform the reduce operation in the computation of each inner product of a row in the transposed matrix and the vector. Experimental results show that the simple technique can outperform the SMTVP flow of transposition plus SMVP released in the CUSPARSE package by up to 405-fold.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

On Implementing Sparse Matrix Multi-vector Multiplication on GPUs
2014 IEEE International Conference on High Performance Computing and Communications (HPCC), 2014 IEEE 6th International Symposium on Cyberspace Safety and Security (CSS) and 2014 IEEE 11th International Conference on Embedded Software and Systems (ICESS)
LightSpMV: Faster CSR-based sparse matrix-vector multiplication on CUDA-enabled GPUs
2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)
Efficient sparse matrix-vector multiplication on cache-based GPUs
Innovative Parallel Computing - Foundations & Applications of GPU, Manycore, and Heterogeneous Systems (INPAR 2012)
Automatic Tuning of Sparse Matrix-Vector Multiplication for CRS Format on GPUs
2012 IEEE 15th International Conference on Computational Science and Engineering
Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications
SC14: International Conference for High Performance Computing, Networking, Storage and Analysis
Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format
SC14: International Conference for High Performance Computing, Networking, Storage and Analysis
Exploiting Locality in Sparse Matrix-Matrix Multiplication on Many-Core Architectures
IEEE Transactions on Parallel & Distributed Systems
Locality-Aware Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication on Many-Core Processors
IEEE Transactions on Parallel & Distributed Systems
Optimization of GPU-based Sparse Matrix Multiplication for Large Sparse Networks
2020 IEEE 36th International Conference on Data Engineering (ICDE)
MatRaptor: A Sparse-Sparse Matrix Multiplication Accelerator Based on Row-Wise Product
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

Atomic reduction based sparse matrix-transpose vector multiplication on GPUs

Authors

Abstract

Related Articles