کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
431541 688576 2012 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions
چکیده انگلیسی

Empirical optimizers like ATLAS have been very effective in optimizing computational kernels in libraries. The best choice of parameters such as tile size and degree of loop unrolling is determined in ATLAS by executing different versions of the computation. In contrast, optimizing compilers use a model-driven approach to program transformation. While the model-driven approach of optimizing compilers is generally orders of magnitude faster than ATLAS-like library generators, its effectiveness can be limited by the accuracy of the performance models used. In this paper, we describe an approach where a class of computations is modeled in terms of constituent operations that are empirically measured, thereby allowing modeling of the overall execution time. The performance model with empirically determined cost components is used to select library calls and choose data layout transformations in the context of the Tensor Contraction Engine, a compiler for a high-level domain-specific language for expressing computational models in quantum chemistry. The effectiveness of the approach is demonstrated through experimental measurements on representative computations from quantum chemistry.


► Performance of tensor contraction code depends on layout and DGEMM parameters.
► Dynamic programming algorithm optimizes layout and selects library calls.
► Compile-time performance model uses empirically determined cost components.
► Measurements show this approach is effective on both clusters and multi-cores.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 72, Issue 3, March 2012, Pages 338–352
نویسندگان
, , , , , ,