Enhancing data parallelism for Ant Colony Optimization on GPUs

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
431889	688648	2013	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

TSP Metaheuristics - الگوریتم‌های فراابتکاری یا فراتکاملی یا فرااکتشافی GPU programming - برنامه نویسی GPU Ant Colony Optimization - بهینه‌سازی گروه مورچه‌ها Performance analysis - تجزیه و تحلیل عملکرد

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات

پیش نمایش صفحه اول مقاله

Enhancing data parallelism for Ant Colony Optimization on GPUs

چکیده انگلیسی

Graphics Processing Units (GPUs) have evolved into highly parallel and fully programmable architecture over the past five years, and the advent of CUDA has facilitated their application to many real-world applications. In this paper, we deal with a GPU implementation of Ant Colony Optimization (ACO), a population-based optimization method which comprises two major stages: tour construction and pheromone update. Because of its inherently parallel nature, ACO is well-suited to GPU implementation, but it also poses significant challenges due to irregular memory access patterns. Our contribution within this context is threefold: (1) a data parallelism scheme for tour construction tailored to GPUs, (2) novel GPU programming strategies for the pheromone update stage, and (3) a new mechanism called I-Roulette to replicate the classic roulette wheel while improving GPU parallelism. Our implementation leads to factor gains exceeding 20x for any of the two stages of the ACO algorithm as applied to the TSP when compared to its sequential counterpart version running on a similar single-threaded high-end CPU. Moreover, an extensive discussion focused on different implementation paths on GPUs shows the way to deal with parallel graph connected components. This, in turn, suggests a broader area of inquiry, where algorithm designers may learn to adapt similar optimization methods to GPU architecture.

► This is the first data-parallelism scheme on GPUs for the ACO tour construction stage.
► We introduce the I-Roulette method to replicate the classic roulette wheel.
► We discuss the implementation of the pheromone update stage on GPUs.
► We offer an in-depth analysis of the ACO on GPUs reaching up to 21x speed-up factor.
► A detailed discussion of performance implications on Fermi architecture.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 73, Issue 1, January 2013, Pages 42–51

نویسندگان

José M. Cecilia, José M. García, Andy Nisbet, Martyn Amos, Manuel Ujaldón,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Enhancing data parallelism for Ant Colony Optimization on GPUs

دسترسی سریع

ارتباط

English Website