کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
432835 | 689088 | 2011 | 10 صفحه PDF | دانلود رایگان |

The introduction of NVidia’s powerful Tesla GPU hardware and Compute Unified Device Architecture (CUDA) platform enable many-core parallel programming. As a result, existing algorithms implemented on a GPU can run many times faster than on modern CPUs. Relatively little research has been done so far on GPU implementations of discrete optimisation algorithms. In this paper, two approaches to parallel GPU evaluation of the Permutation Flowshop Scheduling Problem, with makespan and total flowtime criteria, are proposed. These methods can be employed in most population-based algorithms, e.g. genetic algorithms, Ant Colony Optimisation, Particle Swarm Optimisation, and Tabu Search. Extensive computational experiments, on Tabu Search for Flowshop with both criteria, followed by statistical analysis, confirm great computational capabilities of GPU hardware. A GPU implementation of Tabu Search runs up to 89 times faster than its CPU counterpart.
► Two efficient GPU (CUDA) implementations of Tabu Search for Flowshop are proposed.
► Two approaches to parallel evaluation for any population-based algorithm are proposed.
► Tabu Search for Flowshop runs up to 89 times faster on GPU than on CPU.
Journal: Journal of Parallel and Distributed Computing - Volume 71, Issue 6, June 2011, Pages 802–811