| Article ID | Journal | Published Year | Pages | File Type |
|---|---|---|---|---|
| 4630340 | Applied Mathematics and Computation | 2011 | 11 Pages |
Abstract
In this paper we present two new algorithmic variants to compute the Neville elimination, with and without pivoting, which improve data locality and cast most of the computations in terms of high-performance Level 3 BLAS. The experimental evaluation on a state-of-the-art multi-core processor demonstrates that the new blocked algorithms exhibit a much higher degree of concurrency and better cache usage, yielding higher performance while offering numerical accuracy akin to that of the traditional columnwise variant in most cases.
Related Topics
Physical Sciences and Engineering
Mathematics
Applied Mathematics
Authors
Pedro Alonso, Raquel Cortina, Enrique S. Quintana-OrtÃ, José Ranilla,
