دانلود رایگان مقاله: پیش بینی نقص نرم افزار با استفاده از یادگیری گروهی بر روی ویژگی های انتخاب شده

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
551051	1450775	2015	15 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Software defect prediction using ensemble learning on selected features

ترجمه فارسی عنوان

پیش بینی نقص نرم افزار با استفاده از یادگیری گروهی بر روی ویژگی های انتخاب شده

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

پیش بینی نقص، یادگیری گروهی کیفیت نرم افزار، انتخاب ویژگی، عدم تعادل داده، افزونگی ویژگی / همبستگی

Data imbalance Feature selection - انتخاب ویژگی Defect prediction - پیش بینی نقص Software quality - کیفیت نرم افزار Ensemble learning - یادگیری گروهی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر تعامل انسان و کامپیوتر

پیش نمایش مقاله

پیش بینی نقص نرم افزار با استفاده از یادگیری گروهی بر روی ویژگی های انتخاب شده

چکیده انگلیسی

• Propose an ensemble-learning algorithm for the software defect classification problem.
• Evaluate the performance of the proposed scheme using software detect datasets.
• Show the efficiency of feature selection techniques to handle feature redundancy.
• Enhance the proposed learning model to achieve robustness against data imbalance.
• Validate the “bad” nature of some standard software defect datasets.

ContextSeveral issues hinder software defect data including redundancy, correlation, feature irrelevance and missing samples. It is also hard to ensure balanced distribution between data pertaining to defective and non-defective software. In most experimental cases, data related to the latter software class is dominantly present in the dataset.ObjectiveThe objectives of this paper are to demonstrate the positive effects of combining feature selection and ensemble learning on the performance of defect classification. Along with efficient feature selection, a new two-variant (with and without feature selection) ensemble learning algorithm is proposed to provide robustness to both data imbalance and feature redundancy.MethodWe carefully combine selected ensemble learning models with efficient feature selection to address these issues and mitigate their effects on the defect classification performance.ResultsForward selection showed that only few features contribute to high area under the receiver-operating curve (AUC). On the tested datasets, greedy forward selection (GFS) method outperformed other feature selection techniques such as Pearson’s correlation. This suggests that features are highly unstable. However, ensemble learners like random forests and the proposed algorithm, average probability ensemble (APE), are not as affected by poor features as in the case of weighted support vector machines (W-SVMs). Moreover, the APE model combined with greedy forward selection (enhanced APE) achieved AUC values of approximately 1.0 for the NASA datasets: PC2, PC4, and MC1.ConclusionThis paper shows that features of a software dataset must be carefully selected for accurate classification of defective components. Furthermore, tackling the software data issues, mentioned above, with the proposed combined learning model resulted in remarkable classification performance paving the way for successful quality control.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information and Software Technology - Volume 58, February 2015, Pages 388–402

نویسندگان

Issam H. Laradji, Mohammad Alshayeb, Lahouari Ghouti,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : پیش بینی نقص نرم افزار با استفاده از یادگیری گروهی بر روی ویژگی های انتخاب شده

دسترسی سریع

ارتباط

English Website