دانلود رایگان مقاله: اثرات داده های بدون برچسب بر خطای طبقه بندی در تجزیه و تحلیل عاملی نرمال

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
1148314	1489774	2014	18 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Effects of unlabeled data on classification error in normal discriminant analysis

ترجمه فارسی عنوان

اثرات داده های بدون برچسب بر خطای طبقه بندی در تجزیه و تحلیل عاملی نرمال

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

داده های نیمه برچسب دار، داده های بدون برچسب، داده های گم شده، کارایی نسبی آستانه، تبعیض عادی، داده های غیر عادی، یادگیری نیمه نظارتی

Nonnormal data Unlabeled data - داده های بدون برچسب Missing data - داده های گم شده Asymptotic relative efficiency - راندمان نسبی نسبی Semi-supervised learning - یاگیری نیمه‌نظارتی

موضوعات مرتبط

مهندسی و علوم پایه ریاضیات ریاضیات کاربردی

پیش نمایش مقاله

اثرات داده های بدون برچسب بر خطای طبقه بندی در تجزیه و تحلیل عاملی نرمال

چکیده انگلیسی

• We propose a framework of normal discriminant analysis for partially labeled data.
• We differentiate between feature independent and dependent labeling mechanisms.
• We define a criterion ARE to compare partially labeled data and labeled data alone.
• We derive and compute the ARE for (non)normal data under two mechanisms.
• Noninformative unlabeled data can increase classification error.

Semi-supervised learning, i.e., the estimation of parameters based on both labeled and unlabeled data, is widely believed to be effective in constructing a boundary in classification problems. The present paper investigates whether this belief is true in the case of normal discrimination in terms of the classification error for normal and nonnormal data. For this investigation, we use the framework of missing-data analysis because data consisting of labeled and unlabeled individuals can be regarded as missing data. Based on this framework, we introduce two labeling mechanisms: feature-independent labeling and feature-dependent labeling. For each of these labeling mechanisms, we analytically derive the asymptotic relative efficiency based on the labeled data alone and based on both the labeled and unlabeled data. Numerical computations reveal that (i) under the feature-independent labeling mechanism, unlabeled data tend to contribute to the improvement of the classification error even for nonnormal data and (ii) under the feature-dependent labeling mechanism, unlabeled data from both normal and nonnormal distributions are helpful when the labeled data are informative, but unlabeled data can augment the classification error when the labeled data are not informative. Finally, we describe some future areas of research.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Statistical Planning and Inference - Volume 147, April 2014, Pages 66–83

نویسندگان

Keiji Takai, Kenichi Hayashi,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : اثرات داده های بدون برچسب بر خطای طبقه بندی در تجزیه و تحلیل عاملی نرمال

دسترسی سریع

ارتباط

English Website