کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
461196 696571 2011 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Adjusting Fuzzy Similarity Functions for use with standard data mining tools
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر شبکه های کامپیوتری و ارتباطات
پیش نمایش صفحه اول مقاله
Adjusting Fuzzy Similarity Functions for use with standard data mining tools
چکیده انگلیسی

Data mining is crucial in many areas and there are ongoing efforts to improve its effectiveness in both the scientific and the business world. There is an obvious need to improve the outcomes of mining techniques such as clustering and other classifiers without abandoning the standard mining tools that are popular with researchers and practitioners alike. Currently, however, standard tools do not have the flexibility to control similarity relations between attribute values, a critical feature in improving mining-clustering results. The study presented here introduces the Similarity Adjustment Model (SAM) where adjusted Fuzzy Similarity Functions (FSF) control similarity relations between attribute values and hence ameliorate clustering results obtained with standard data mining tools such as SPSS and SAS. The SAM draws on principles of binary database representation models and employs FSF adjusted via an iterative learning process that yields improved segmentation regardless of the choice of mining-clustering algorithm. The SAM model is illustrated and evaluated on three common datasets with the standard SPSS package. The datasets were run with several clustering algorithms. Comparison of “Naïve” runs (which used original data) and “Fuzzy” runs (which used SAM) shows that the SAM improves segmentation in all cases.


► Currently standard data mining tools do not support fuzzy data or Fuzzy Similarity Functions (FSF).
► The research proposes a Similarity Adjustment Model (SAM) that employs FSF adjustment via an iterative learning process.
► The SAM model is illustrated and evaluated on three common datasets.
► Evaluation tests show that SAM improves mining and segmentation results.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Systems and Software - Volume 84, Issue 12, December 2011, Pages 2374–2383
نویسندگان
, ,