دانلود رایگان مقاله: استراتژی و توزیع یادگیری اصول ماشین در داده های بزرگ

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
478807	1364853	2016	17 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Strategies and Principles of Distributed Machine Learning on Big Data

ترجمه فارسی عنوان

استراتژی و توزیع یادگیری اصول ماشین در داده های بزرگ

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

فراگیری ماشین؛ هوش مصنوعی داده های بزرگ؛ مدل بزرگ؛ سیستم های توزیع شده؛ اصول؛ تئوری؛ داده-موازی؛ مدل-موازی

Principles - اصول Theory - تئوری Distributed systems - سیستم توزیع شده Data-parallelism - موازی داده ها Machine learning - یادگیری ماشین

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)

پیش نمایش مقاله

استراتژی و توزیع یادگیری اصول ماشین در داده های بزرگ

چکیده انگلیسی

ABSTRACTThe rise of big data has led to new demands for machine learning (ML) systems to learn complex models, with millions to billions of parameters, that promise adequate capacity to digest massive datasets and offer powerful predictive analytics (such as high-dimensional latent features, intermediate representations, and decision functions) thereupon. In order to run ML algorithms at such scales, on a distributed cluster with tens to thousands of machines, it is often the case that significant engineering efforts are required—and one might fairly ask whether such engineering truly falls within the domain of ML research. Taking the view that “big” ML systems can benefit greatly from ML-rooted statistical and algorithmic insights—and that ML researchers should therefore not shy away from such systems design—we discuss a series of principles and strategies distilled from our recent efforts on industrial-scale ML solutions. These principles and strategies span a continuum from application, to engineering, and to theoretical research and development of big ML systems and architectures, with the goal of understanding how to make them efficient, generally applicable, and supported with convergence and scaling guarantees. They concern four key questions that traditionally receive little attention in ML research: How can an ML program be distributed over a cluster? How can ML computation be bridged with inter-machine communication? How can such communication be performed? What should be communicated between machines? By exposing underlying statistical and algorithmic characteristics unique to ML programs but not typically seen in traditional computer programs, and by dissecting successful cases to reveal how we have harnessed these principles to design and develop both high-performance distributed ML software as well as general-purpose ML frameworks, we present opportunities for ML researchers and practitioners to further shape and enlarge the area that lies between ML and systems.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Engineering - Volume 2, Issue 2, June 2016, Pages 179–195

نویسندگان

Eric P. Xing, Qirong Ho, Pengtao Xie, Dai Wei,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : استراتژی و توزیع یادگیری اصول ماشین در داده های بزرگ

دسترسی سریع

ارتباط

English Website