کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
424513 685582 2016 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A Data Quality in Use model for Big Data
ترجمه فارسی عنوان
یک مدل کیفیت داده در مدل استفاده برای داده های بزرگ
کلمات کلیدی
کیفیت داده؛ اطلاعات بزرگ؛ اندازه گیری؛ کیفیت در استفاده؛ مدل
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
چکیده انگلیسی


• Data Quality is basic to decide about the suitability of data for intended uses.
• A Data Quality-in-Use Model based on ISO/IEC 25012, 25024 is proposed for Big Data.
• The main concern when assessing the Data Quality-in-Use in Big Data is Adequacy.
• The model accomplishes all the challenges of a Data Quality program for Big Data.
• The results obtained must be understood in the context of each Big Data project.

Beyond the hype of Big Data, something within business intelligence projects is indeed changing. This is mainly because Big Data is not only about data, but also about a complete conceptual and technological stack including raw and processed data, storage, ways of managing data, processing and analytics. A challenge that becomes even trickier is the management of the quality of the data in Big Data environments. More than ever before the need for assessing the Quality-in-Use gains importance since the real contribution–business value–of data can be only estimated in its context of use. Although there exists different Data Quality models for assessing the quality of regular data, none of them has been adapted to Big Data. To fill this gap, we propose the “3As Data Quality-in-Use model”, which is composed of three Data Quality characteristics for assessing the levels of Data Quality-in-Use in Big Data projects: Contextual Adequacy, Operational Adequacy and Temporal Adequacy. The model can be integrated into any sort of Big Data project, as it is independent of any pre-conditions or technologies. The paper shows the way to use the model with a working example. The model accomplishes every challenge related to Data Quality program aimed for Big Data. The main conclusion is that the model can be used as an appropriate way to obtain the Quality-in-Use levels of the input data of the Big Data analysis, and those levels can be understood as indicators of trustworthiness and soundness of the results of the Big Data analysis.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Future Generation Computer Systems - Volume 63, October 2016, Pages 123–130
نویسندگان
, , , , ,