(IEEE ISPASS) IEEE International Symposium on Performance Analysis of Systems and Software
Download PDF

Abstract

NVIDIA and AMD GPUs are are gaining traction in HPC for their performance and architectural aspects. It is very important to measure and analyze the relative power of each architecture. In this paper, we analyze the architecture of NVIDIA's Fermi and AMD's Evergreen processors and demonstrate the best practices and techniques to best utilize the capabilities of each architecture. We implemented the FFT on both cards utilizing our findings to reach new performance ceilings on both GPUs.
Like what you’re reading?
Already a member?
Get this article FREE with a new membership!

Related Articles