Article ID Journal Published Year Pages File Type
4630340 Applied Mathematics and Computation 2011 11 Pages PDF
Abstract
In this paper we present two new algorithmic variants to compute the Neville elimination, with and without pivoting, which improve data locality and cast most of the computations in terms of high-performance Level 3 BLAS. The experimental evaluation on a state-of-the-art multi-core processor demonstrates that the new blocked algorithms exhibit a much higher degree of concurrency and better cache usage, yielding higher performance while offering numerical accuracy akin to that of the traditional columnwise variant in most cases.
Related Topics
Physical Sciences and Engineering Mathematics Applied Mathematics
Authors
, , , ,