Parametric methods for comparing the performance of two classification algorithms evaluated by k-fold cross validation on multiple data sets

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4969648	1449982	2017	44 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

k-fold cross validation Sampling distribution - توزیع نمونه گیری Parametric method - روش پارامتری Classification - طبقه بندی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش صفحه اول مقاله

Parametric methods for comparing the performance of two classification algorithms evaluated by k-fold cross validation on multiple data sets

چکیده انگلیسی

A popular procedure for identifying which one of two classification algorithms has a better performance is to test them on multiple data sets, and the accuracies resulting from k-fold cross validation are aggregated to draw a conclusion. Several nonparametric methods have been proposed for this purpose, while parametric methods will be a better choice to determine the superior algorithm when the assumptions for deriving sampling distributions can be satisfied. In this paper, we consider every accuracy estimate resulting from the instances in a fold or a data set as a point estimator instead of a fixed value to derive the sampling distribution of the point estimator for comparing the performance of two classification algorithms. The test statistics for both data-set and fold averaging levels are proposed, and the ways to calculate their degrees of freedom are also presented. Twelve data sets are chosen to demonstrate that our parametric methods can be used to effectively compare the performance of two classification algorithms on multiple data sets. Several critical issues in using our parametric methods and the nonparametric ones proposed in a previous study are then discussed.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 65, May 2017, Pages 97-107

نویسندگان

Tzu-Tsung Wong,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Parametric methods for comparing the performance of two classification algorithms evaluated by k-fold cross validation on multiple data sets

دسترسی سریع

ارتباط

English Website