دانلود رایگان مقاله: تعیین خصوصیات آماری قابل توجه از جنگل های تصادفی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
407539	678146	2015	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Inferring statistically significant features from random forests

ترجمه فارسی عنوان

تعیین خصوصیات آماری قابل توجه از جنگل های تصادفی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

انتخاب ویژگی، گروه های درختی، آزمونهای اهمیت تجزیه و تحلیل داده های با ابعاد بزرگ

Feature selection - انتخاب ویژگی High-dimensional data analysis - تجزیه و تحلیل اطلاعات با ابعاد بزرگ Significance tests - تست های اهمیت

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

تعیین خصوصیات آماری قابل توجه از جنگل های تصادفی

چکیده انگلیسی

Embedded feature selection can be performed by analyzing the variables used in a Random Forest. Such a multivariate selection takes into account the interactions between variables but is not straightforward to interpret in a statistical sense. We propose a statistical procedure to measure variable importance that tests if variables are significantly useful in combination with others in a forest. We show experimentally that this new importance index correctly identifies relevant variables. The top of the variable ranking is largely correlated with Breiman׳s importance index based on a permutation test. Our measure has the additional benefit to produce p-values from the forest voting process. Such p-values offer a very natural way to decide which features are significantly relevant while controlling the false discovery rate. Practical experiments are conducted on synthetic and real data including low and high-dimensional datasets for binary or multi-class problems. Results show that the proposed technique is effective and outperforms recent alternatives by reducing the computational complexity of the selection process by an order of magnitude while keeping similar performances.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 150, Part B, 20 February 2015, Pages 471–480

نویسندگان

Jérôme Paul, Pierre Dupont,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : تعیین خصوصیات آماری قابل توجه از جنگل های تصادفی

دسترسی سریع

ارتباط

English Website