دانلود رایگان مقاله: تسریع در برنامه های متوالی در مورد پردازنده های چند هسته ای کالا

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
432731	689052	2014	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Accelerating sequential programs on commodity multi-core processors

ترجمه فارسی عنوان

تسریع در برنامه های متوالی در مورد پردازنده های چند هسته ای کالا

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

پردازنده های چند هسته ای کالا، موازی خط لوله، مکانیسم ارتباط جمعی

Pipeline parallelism

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات

پیش نمایش مقاله

تسریع در برنامه های متوالی در مورد پردازنده های چند هسته ای کالا

چکیده انگلیسی

• We propose an efficient pipeline technique for commodity multi-core processors.
• We propose a clustered inter-core communication mechanism for low overhead.
• We give a clusteredQueue algorithm and find a regular overhead reduction pattern.
• We evaluate the technique on a commodity machine and conclude the effectiveness.

A recently proposed pipelined multithreading (PMT) technique exhibits wide applicability in parallelizing general sequential programs on multi-core processors. However, significant inter-core communication overhead limits PMT performance and prevents its commercial utilization. A simple and effective clustered pipelined multithreading (CPMT) approach is presented to accelerate sequential programs on commodity multi-core processors. This CPMT technique adopts a clustered communication mechanism that can yield very low average communication overhead by eliminating false sharing as well as reducing communication operation and transit delays in the software-only approach. A single-producer/single-consumer concurrent lock-free clusteredQueue algorithm based on a two-level queue structure is also proposed. The accuracy of CPMT is theoretically demonstrated. The performances of the algorithm and CPMT are evaluated on a commodity AMD Phenom four-core processor. The number of enqueue and dequeue times of the algorithm are 20.8 and 23 cycles given an appropriate parameter, respectively. The speedup of CPMT ranges from 13.1% to 119.8% for typical loops extracted from the SPEC CPU 2000 benchmark suite.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 74, Issue 4, April 2014, Pages 2257–2265

نویسندگان

Yuanming Zhang, Gang Xiao, Takanobu Baba,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : تسریع در برنامه های متوالی در مورد پردازنده های چند هسته ای کالا

دسترسی سریع

ارتباط

English Website