کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6856323 1437953 2018 20 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Column-wise compression of open relational data
ترجمه فارسی عنوان
فشرده سازی ستون ها از داده های ارتباط باز
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
In this paper, we provide an empirical analysis on the compression of open data provided in a relational format, such as comma-separated value files. We consider several compression tools and parameter settings. Furthermore, we propose using a novel column-wise compression strategy, where items that have similar properties, are compressed together. We perform a comprehensive analysis on 24 datasets from different domains, such as life sciences, governmental data, finance sector, and public transportation, which cover a wide range of file sizes (from a few MB to several GB). Our results show that the traversal strategy is of paramount importance for achieving high compression ratios; with improvements of up to one order of magnitude. This study further highlights a set of issues for future work on compressing open data.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volumes 457–458, August 2018, Pages 48-61
نویسندگان
, , ,