A fast algorithm for constructing inverted files on heterogeneous platforms

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
433056	689222	2012	11 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Inverted files GPU - واحد پردازش گرافیکی Multicore - چندگانه

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات

پیش نمایش صفحه اول مقاله

A fast algorithm for constructing inverted files on heterogeneous platforms

چکیده انگلیسی

Given a collection of documents residing on a disk, we develop a new strategy for processing these documents and building the inverted files extremely quickly. Our approach is tailored for a heterogeneous platform consisting of multicore CPUs and highly multithreaded GPUs. Our algorithm is based on a number of novel techniques, including a high-throughput pipelined strategy, a hybrid trie and B-tree dictionary data structure, dynamic work allocation to CPU and GPU threads, and optimized CUDA indexer implementation. We have performed extensive tests of our algorithm on a single node (two Intel Xeon X5560 Quad-core CPUs) with two NVIDIA Tesla C1060 GPUs attached to it, and were able to achieve a throughput of more than 262 MB/s on the ClueWeb09 dataset. Similar results were obtained for widely different datasets. The throughput of our algorithm is superior to the best known algorithms reported in the literature even when compared to those run on large clusters.

► A high-throughput pipelined strategy involving parallel parsers and parallel indexers.
► A hybrid trie and B-tree dictionary with fast look-up table and string caches.
► Dynamic allocation of parsed streams to CPU and GPU threads based on sampling.
► Optimized CUDA indexer with effective global and shared memory accesses.
► Performance on single node superior to MapReduce on larger clusters.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Parallel and Distributed Computing - Volume 72, Issue 5, May 2012, Pages 728–738

نویسندگان

Zheng Wei, Joseph JaJa,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A fast algorithm for constructing inverted files on heterogeneous platforms

دسترسی سریع

ارتباط

English Website