Improving performance and energy efficiency of matrix multiplication via pipeline broadcast

Li Tan; Longxiang Chen; Zizhong Chen; Ziliang Zong; Dong Li; Rong Ge

doi:10.1109/CLUSTER.2013.6702672

2013 IEEE International Conference on Cluster Computing (CLUSTER)

Improving performance and energy efficiency of matrix multiplication via pipeline broadcast

Year: 2013, Pages: 1-5

DOI Bookmark: 10.1109/CLUSTER.2013.6702672

Authors

Li Tan, University of California, Riverside, USA
Longxiang Chen, University of California, Riverside, USA
Zizhong Chen, University of California, Riverside, USA
Ziliang Zong, Texas State University-San Marcos, USA
Dong Li, Oak Ridge National Laboratory, USA
Rong Ge, Marquette University, USA

Abstract

Boosting performance and energy efficiency of scientific applications running on high performance computing systems arise cruicially nowadays. Software and hardware based solutions for improving communication performance have been recognized as significant means of achieving performance gain and thus energy savings for such applications. As a fundamental component of most numerical linear algebra algorithms, improving performance and energy efficiency of distributed matrix multiplication is of major concerns. For such purposes, we propose a high performance communication scheme that fully exploits network bandwidth via non-blocking pipeline broadcast with tuned chunk size. Empirically, substantial performance gain up to 8.4% and energy savings up to 6.9% are achieved compared to blocking pipeline broadcast, and against binomial tree broadcast, performance gain up to 6.5% and energy savings up to 6.1% are observed on a 64-core cluster.

Like what you’re reading?

Already a member?Sign In

Member Price

$11

Non-Member Price

$21

Add to Cart Sign In

Get this article FREE with a new membership!

eSMART: Energy-efficient Scalable Multimedia Broadcast for heterogeneous users
2014 IEEE 15th International Symposium on "A World of Wireless, Mobile and Multimedia Networks" (WoWMoM)
Network-coded broadcast incremental power algorithm for energy-efficient broadcasting in wireless ad-hoc network
2014 Applications and Innovations in Mobile Computing (AIMoC)
Analytical bounds on broadcast with hitch-hiking in wireless ad-hoc networks
IEEE International Conference on Mobile Adhoc and Sensor Systems Conference
Enhance content broadcast efficiency in routers with integrated caching
2011 IEEE Symposium on Computers and Communications (ISCC)
TX: Algorithmic Energy Saving for Distributed Dense Matrix Factorizations
2014 5th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA)
Energy Efficiency of Full Pipelining: A Case Study for Matrix Multiplication
2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)
Broadcast with Hitch-hiking in Wireless Ad-Hoc Networks (Invited Talk Abstract)
Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, International Conference on & Self-Assembling Wireless Networks, International Workshop on
All-To-All Broadcast and Matrix Multiplication in Faulty SIMD Hypercubes
IEEE Transactions on Parallel & Distributed Systems
Memory MISER: Improving Main Memory Energy Efficiency in Servers
IEEE Transactions on Computers
Joint Optimization of User-Experience and Energy-Efficiency in Wireless Multimedia Broadcast
IEEE Transactions on Mobile Computing

Improving performance and energy efficiency of matrix multiplication via pipeline broadcast

Authors

Abstract

Related Articles