کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
432731 689052 2014 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Accelerating sequential programs on commodity multi-core processors
ترجمه فارسی عنوان
تسریع در برنامه های متوالی در مورد پردازنده های چند هسته ای کالا
کلمات کلیدی
پردازنده های چند هسته ای کالا، موازی خط لوله، مکانیسم ارتباط جمعی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی


• We propose an efficient pipeline technique for commodity multi-core processors.
• We propose a clustered inter-core communication mechanism for low overhead.
• We give a clusteredQueue algorithm and find a regular overhead reduction pattern.
• We evaluate the technique on a commodity machine and conclude the effectiveness.

A recently proposed pipelined multithreading (PMT) technique exhibits wide applicability in parallelizing general sequential programs on multi-core processors. However, significant inter-core communication overhead limits PMT performance and prevents its commercial utilization. A simple and effective clustered pipelined multithreading (CPMT) approach is presented to accelerate sequential programs on commodity multi-core processors. This CPMT technique adopts a clustered communication mechanism that can yield very low average communication overhead by eliminating false sharing as well as reducing communication operation and transit delays in the software-only approach. A single-producer/single-consumer concurrent lock-free clusteredQueue algorithm based on a two-level queue structure is also proposed. The accuracy of CPMT is theoretically demonstrated. The performances of the algorithm and CPMT are evaluated on a commodity AMD Phenom four-core processor. The number of enqueue and dequeue times of the algorithm are 20.8 and 23 cycles given an appropriate parameter, respectively. The speedup of CPMT ranges from 13.1% to 119.8% for typical loops extracted from the SPEC CPU 2000 benchmark suite.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 74, Issue 4, April 2014, Pages 2257–2265
نویسندگان
, , ,