کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
455829 695575 2015 18 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Differentially private maximal frequent sequence mining
ترجمه فارسی عنوان
معادله پیوسته مکرر خصوصی مکرر خصوصی
کلمات کلیدی
حریم خصوصی دیفرانسیل حداکثر معدن دنباله مکرر، استخراج مکرر مکرر، کاهش طول، آرامش آستانه
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر شبکه های کامپیوتری و ارتباطات
چکیده انگلیسی

In this paper, we study the problem of designing a differentially private algorithm for mining maximal frequent sequences, which can not only achieve high data utility and a high degree of privacy, but also provide high time efficiency. To solve this problem, we present a new differentially private algorithm, which is referred to as DP-MFSM. DP-MFSM consists of three phases: pre-processing phase, expected frequent sequence mining (ESM) phase, and candidate extraction and verification (CEV) phase. Specifically, in the pre-processing phase, we first extract some statistical information from the input database, and use the extracted information to determine the values of some variables which will be used in the ESM phase. Then, in the ESM phase, we randomly partition the input database into several sub-databases, and use a partition-based ESM technique to find expected frequent sequences, which are a subset of candidate frequent sequences and more likely to be frequent. At last, in the CEV phase, we extract candidate maximal frequent sequences from the discovered expected frequent sequences, and use a splitting-based technique to verify which candidates are actually frequent in the input database. Through privacy analysis, we show that our DP-MFSM algorithm is ε-differentially private. Extensive experiments on real-world datasets illustrate that our DP-MFSM algorithm can substantially outperform alternative approaches.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers & Security - Volume 55, November 2015, Pages 175–192
نویسندگان
, , , , ,