کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
386000 660876 2011 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Thresholds based outlier detection approach for mining class outliers: An empirical case study on software measurement datasets
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Thresholds based outlier detection approach for mining class outliers: An empirical case study on software measurement datasets
چکیده انگلیسی

Predicting the fault-proneness labels of software program modules is an emerging software quality assurance activity and the quality of datasets collected from previous software version affects the performance of fault prediction models. In this paper, we propose an outlier detection approach using metrics thresholds and class labels to identify class outliers. We evaluate our approach on public NASA datasets from PROMISE repository. Experiments reveal that this novel outlier detection method improves the performance of robust software fault prediction models based on Naive Bayes and Random Forests machine learning algorithms.

Research highlights
► The quality of datasets affect the performance of fault prediction models.
► When outliers are removed, the performance of fault prediction models increase.
► Software metrics thresholds based outlier detection approach is an effective technique to identify outliers.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 38, Issue 4, April 2011, Pages 3440–3445
نویسندگان
, ,