دانلود رایگان مقاله: دستهبندی ماشینهای برنده در مقیاس بزرگ با کاهش داده های خام

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
409025	679052	2016	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Large-scale support vector machine classification with redundant data reduction

ترجمه فارسی عنوان

دستهبندی ماشینهای برنده در مقیاس بزرگ با کاهش داده های خام

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

ماشین بردار پشتیبانی، طبقه بندی، خوشه بندی کاهش داده های کاهش یافته

Clustering - خوشه بندی Classification - طبقه بندی Support vector machine - ماشین بردار پشتیبانی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

دستهبندی ماشینهای برنده در مقیاس بزرگ با کاهش داده های خام

چکیده انگلیسی

Large-scale image classification has shown great importance in object recognition and image retrieval as the vast amounts of social multimedia sharing on the networks. While the time and memory requirements for SVM training surge with an increase in the sample size, which makes SVM impractical even for a moderate problem as the number of training data reaches to the extent of hundreds of thousands. To solve this problem, many specially designed algorithms are proposed such as clustering-based SVM training which attempts to remove the clustered data points that lie far away from support vectors. In this paper, we further explore that there exist clustered and scattered data points in a cluster. The clustered data points that lie around the clustering centroid are the dense data points, which are in the inner layer of a cluster. Those data points are viewed as having no SVs and removed. While the scattered data points are the sparse data points in the outside layer of a cluster. Those data points are viewed as having SVs and thus reserved. The Fisher Discriminant Ratio is employed to determine a boundary between the clustered and scattered data points in one cluster, which is computed based on the distance densities of data points to the cluster centroid. The redundant clustered data points in clusters are thus removed to speed up SVM training process. Several experimental results show that our proposed method has good classification accuracy while training time is significantly reduced. The training time in our proposed method only accounts for about 17 percent of the time in LIBSVM on the large data set of Covertype.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 172, 8 January 2016, Pages 189–197

نویسندگان

Xiang-Jun Shen, Lei Mu, Zhen Li, Hao-Xiang Wu, Jian-Ping Gou, Xin Chen,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : دستهبندی ماشینهای برنده در مقیاس بزرگ با کاهش داده های خام

دسترسی سریع

ارتباط

English Website