کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
391649 661904 2014 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Mining incomplete data with singleton, subset and concept probabilistic approximations
ترجمه فارسی عنوان
داده های ناقص معدن با تقسیم احتمالاتی تک تک، زیرمجموعه و مفهومی
کلمات کلیدی
تقریبی احتمالاتی، ضمیمههای تقریبی احتمالاتی، تقسیم احتمالاتی یکپارچه، زیرمجموعه و مفهوم، داده های ناقص
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

Rough set theory provides a very useful idea of lower and upper approximations for inconsistent data. For incomplete data these approximations are not unique. In this paper we investigate properties of three well-known generalizations of approximations: singleton, subset and concept. These approximations were recently further generalized as to include an additional parameter αα, interpreted as a probability. In this paper we report novel properties of singleton, subset and concept probabilistic approximations. Additionally, we validated such approximations experimentally. Our main objective was to test which of the singleton, subset and concept probabilistic approximations are the most useful for data mining. Our conclusion is that, for a given incomplete data set, all three approaches should be applied and the best approach should be selected as a result of ten-fold cross validation. Finally, we conducted experiments on complexity of rule sets and the total number of singleton, subset and concept approximations.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 280, 1 October 2014, Pages 368–384
نویسندگان
, , ,