کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6940746 1450018 2018 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A binning formula of bi-histogram for joint entropy estimation using mean square error minimization
ترجمه فارسی عنوان
یک فرمول بیوانسیون دو هیستوگرام برای برآورد انتروپی مشترک با استفاده از حداقل مربعات خطای متوسط
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی
Histograms have extensively been used as a simple tool for nonparametric probability density function estimation. However, practically, the accuracy of some histogram-based derived quantities, such as the marginal entropy (ME), the joint entropy (JE), or the mutual information (MI) depends on the number of bins chosen for the histogram. In this paper, we investigate the binning problem of bi-histogram for the estimation of JE. By minimizing a theoretical mean square error (MSE) of JE estimation, we derive a new formula for the optimal number of bins of bi-histogram for continuous random variables. This novel JE estimation has been used in the MI estimation to avoid the error accumulation of joint MI between the class variable and feature subset in the feature selection. In a synthetic Gaussian feature selection problem, only the proposed method permits to retrieve the exact number of relevant features that explain the class variable when compared to a concurrent univariate estimator based on binning formula that has been proposed for ME estimation. In speech and speaker recognition applications, the proposed method permits to select a limited number of features which guaranties approximately the same or an even better recognition rate than using the total number of features.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 101, 1 January 2018, Pages 21-28
نویسندگان
, ,