کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
427748 686551 2012 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Hash challenges: Stretching the limits of compare-by-hash in distributed data deduplication
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Hash challenges: Stretching the limits of compare-by-hash in distributed data deduplication
چکیده انگلیسی

We propose a technique for reducing communication overheads when sending data across a network. Our technique, called hash challenges, leverages existing deduplication solutions based on compare-by-hash by being able to determine redundant data chunks by exchanging substantially less meta-data. Hash challenges can be used directly on any existing compare-by-hash protocol, with no relevant additional computational complexity. Using real data from reference workloads, we show that hash challenges can save as much as 64%64% meta-data exchanged across the network, relatively to plain compare-by-hash. This implies reductions of up to 7%7% in overall transferred volume, and performance gains of up to 16%16% with typical asymmetrical broadband connections.


► We propose a novel distributed deduplication technique, called hash challenges.
► Substantial savings in meta-data overhead relatively to compare-by-hash.
► Formal analysis confirms advantages in network efficiency.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing Letters - Volume 112, Issue 10, 31 May 2012, Pages 380–385
نویسندگان
, , ,