دانلود رایگان مقاله: مهندسی ویژگی های خودکار برای مدل های رگرسیون با یادگیری ماشین: محاسبات تکاملی و آمار ترکیبی

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6856862	1437971	2018	27 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Automatic feature engineering for regression models with machine learning: An evolutionary computation and statistics hybrid

ترجمه فارسی عنوان

مهندسی ویژگی های خودکار برای مدل های رگرسیون با یادگیری ماشین: محاسبات تکاملی و آمار ترکیبی

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

مهندسی ویژگی، فراگیری ماشین، رگرسیون نمادین، برنامه نویسی کایزن، رگرسیون خطی، برنامه نویسی ژنتیک، ترکیبی،

Genetic programming - برنامه نویسی ژنتیکی Hybrid - ترکیبی Linear regression - رگرسیون خطی symbolic regression - رگرسیون نمادین Feature engineering - مهندسی ویژگی Machine learning - یادگیری ماشین

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

مهندسی ویژگی های خودکار برای مدل های رگرسیون با یادگیری ماشین: محاسبات تکاملی و آمار ترکیبی

چکیده انگلیسی

Symbolic Regression (SR) is a well-studied task in Evolutionary Computation (EC), where adequate free-form mathematical models must be automatically discovered from observed data. Statisticians, engineers, and general data scientists still prefer traditional regression methods over EC methods because of the solid mathematical foundations, the interpretability of the models, and the lack of randomness, even though such deterministic methods tend to provide lower quality prediction than stochastic EC methods. On the other hand, while EC solutions can be big and uninterpretable, they can be created with less bias, finding high-quality solutions that would be avoided by human researchers. Another interesting possibility is using EC methods to perform automatic feature engineering for a deterministic regression method instead of evolving a single model; this may lead to smaller solutions that can be easy to understand. In this contribution, we evaluate an approach called Kaizen Programming (KP) to develop a hybrid method employing EC and Statistics. While the EC method builds the features, the statistical method efficiently builds the models, which are also used to provide the importance of the features; thus, features are improved over the iterations resulting in better models. Here we examine a large set of benchmark SR problems known from the EC literature. Our experiments show that KP outperforms traditional Genetic Programming - a popular EC method for SR - and also shows improvements over other methods, including other hybrids and well-known statistical and Machine Learning (ML) ones. More in line with ML than EC approaches, KP is able to provide high-quality solutions while requiring only a small number of function evaluations.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volumes 430â431, March 2018, Pages 287-313

نویسندگان

VinÃcius Veloso de Melo, Wolfgang Banzhaf,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : مهندسی ویژگی های خودکار برای مدل های رگرسیون با یادگیری ماشین: محاسبات تکاملی و آمار ترکیبی

دسترسی سریع

ارتباط

English Website