کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
515844 867108 2014 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Compression of double array structures for fixed length keywords
ترجمه فارسی عنوان
فشرده سازی ساختارهای دو آرایه برای کلمات کلیدی طول ثابت
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• A new compression method of the double array is proposed.
• The BASE array is removed from the double array.
• The retrieval and construction algorithms are proposed.
• The space usage of our method is more compact than that of the double array.
• The retrieval speed of our method is almost the same as the double array.

A trie is one of the data structures for keyword matching. It is used in natural language processing, IP address routing, and so on. It is represented by the matrix form, the link form, the double array, and LOUDS. The double array representation combines retrieval speed of the matrix form with compactness of the list form. LOUDS is a succinct data structure using bit-string. Retrieval speed of LOUDS is not faster than that of the double array, but its space usage is smaller. This paper proposes a compressed version of the double array by dividing the trie into multiple levels and removing the BASE array from the double array. Moreover, a retrieval algorithm and a construction algorithm are proposed. According to the presented experimental results for pseudo and real data sets, the retrieval speed of the presented method is almost the same as the double array, and its space usage is compressed to 66% comparing with LOUDS for a large set of keywords with fixed length.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 50, Issue 5, September 2014, Pages 796–806
نویسندگان
, , , , ,