Optimizing matrix multiplication for a short-vector SIMD architecture

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
524711	868848	2009	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Optimizing matrix multiplication for a short-vector SIMD architecture – CELL processor

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Single instruction multiple data Instruction level parallelism - همبستگی سطح آموزش Vectorization - وکتور سازی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر

پیش نمایش صفحه اول مقاله

Optimizing matrix multiplication for a short-vector SIMD architecture – CELL processor

چکیده انگلیسی

Matrix multiplication is one of the most common numerical operations, especially in the area of dense linear algebra, where it forms the core of many important algorithms, including solvers of linear systems of equations, least square problems, and singular and eigenvalue computations. The STI CELL processor exceeds the capabilities of any other processor available today in terms of peak single precision, floating point performance, aside from special purpose accelerators like Graphics Processing Units (GPUs).In order to fully exploit the potential of the CELL processor for a wide range of numerical algorithms, fast implementation of the matrix multiplication operation is essential. The crucial component is the matrix multiplication kernel crafted for the short vector Single Instruction Multiple Data architecture of the Synergistic Processing Element of the CELL processor. In this paper, single precision matrix multiplication kernels are presented implementing the C=C-A×BTC=C-A×BT operation and the C=C-A×BC=C-A×B operation for matrices of size 64×6464×64 elements. For the latter case, the performance of 25.55 Gflop/s is reported, or 99.80% of the peak, using as little as 5.9 kB of storage for code and auxiliary data structures.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Parallel Computing - Volume 35, Issue 3, March 2009, Pages 138–150

نویسندگان

Jakub Kurzak, Wesley Alvaro, Jack Dongarra,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Optimizing matrix multiplication for a short-vector SIMD architecture – CELL processor

دسترسی سریع

ارتباط

English Website