Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4959014 | Computers & Operations Research | 2017 | 19 Pages |
Abstract
We present a highly scalable parallel solution for the Subset-Sum Problem on Graphics Processing Units (GPUs) that substantially reduces the memory access by the device and, consequently, decreases the total runtime. We test this algorithm only on hard instances, which require the exhaustion of the entire search space, instead of simple random benchmarks. On a GPU with only 1.2Â GB of global memory, we address hard instances with 100,000 items limited to 106 and 200 items limited to 108. Our algorithm achieves excellent runtimes outperforming the best-known practical and parallel algorithms, reaching speed-ups higher than 1000 in the best case compared to its sequential version.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Science (General)
Authors
V.V. Curtis, C.A.A. Sanches,