کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4955623 1364633 2017 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Availability of datasets for digital forensics - And what is missing
ترجمه فارسی عنوان
در دسترس بودن مجموعه داده های پزشکی قانونی دیجیتال - و چه چیزی گم شده است
کلمات کلیدی
دسترسی، جمع آوری داده ها، مجموعه داده اصل و نسب، آزمایش تولید شده، کاربر تولید شده، مخزن،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر شبکه های کامپیوتری و ارتباطات
چکیده انگلیسی

This paper targets two main goals. First, we want to provide an overview of available datasets that can be used by researchers and where to find them. Second, we want to stress the importance of sharing datasets to allow researchers to replicate results and improve the state of the art. To answer the first goal, we analyzed 715 peer-reviewed research articles from 2010 to 2015 with focus and relevance to digital forensics to see what datasets are available and focused on three major aspects: (1) the origin of the dataset (e.g., real world vs. synthetic), (2) if datasets were released by researchers and (3) the types of datasets that exist. Additionally, we broadened our results to include the outcome of online search results. We also discuss what we think is missing. Overall, our results show that the majority of datasets are experiment generated (56.4%) followed by real world data (36.7%). On the other hand, 54.4% of the articles use existing datasets while the rest created their own. In the latter case, only 3.8% actually released their datasets. Finally, we conclude that there are many datasets for use out there but finding them can be challenging.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Digital Investigation - Volume 22, Supplement, August 2017, Pages S94-S105
نویسندگان
, , ,