کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
523771 | 868491 | 2015 | 16 صفحه PDF | دانلود رایگان |

• A parallel solver for general tridiagonal irreducible systems is described.
• Solver based on Spike framework and Givens-QR with occasional low-rank modification.
• Modifications handle singularities exposed by QR in blocks of the parallel partition.
• The GPU implementation has similar performance to existing methods.
• Method returns accurate results when current GPU tridiagonal solvers fail.
g-Spike, a parallel algorithm for solving general nonsymmetric tridiagonal systems for the GPU, and its CUDA implementation are described. The solver is based on the Spike framework, applying Givens rotations and QR factorization without pivoting. It also implements a low-rank modification strategy to compute the Spike DS decomposition even when the partitioning defines singular submatrices along the diagonal. The method is also used to solve the reduced system resulting from the Spike partitioning. Numerical experiments with problems of high order indicate that g-Spike is competitive in runtime with existing GPU methods, and can provide acceptable results when other methods cannot be applied or fail.
Journal: Parallel Computing - Volume 49, November 2015, Pages 101–116