Speaker
Dr
M Clark
(NVIDIA)
Description
There has been much progress to date in developing scalable sparse linear solver algorithms on GPUs, utilizing well-known mathematical methods such as mixed precision, domain decomposition and pipelining to improve performance, allowing efficient use of large GPU installations such as Blue Waters and Titan. However, there has been less focus on deploying 'mathematically optimal' linear solvers, that have optimal O(N) complexity. In this work we utilize the QUDA framework to deploy adaptive multigrid solvers on GPUs, in particular we describe the architecture abstractions that allow for deployment on heterogeneous systems, utilizing both GPUs and CPUs. We discuss in general the suitability of heterogeneous architectures for hierarchical algorithms, and compare performance against a highly optimized CPU implementation.
Primary author
Dr
M Clark
(NVIDIA)
Co-authors
Dr
Michael Cheng
(Boston University)
Prof.
Richard Brower
(Boston University)