کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6903198 1446751 2018 29 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A distributed evolutionary multivariate discretizer for Big Data processing on Apache Spark
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
پیش نمایش صفحه اول مقاله
A distributed evolutionary multivariate discretizer for Big Data processing on Apache Spark
چکیده انگلیسی
Nowadays the phenomenon of Big Data is overwhelming our capacity to extract relevant knowledge through classical machine learning techniques. Discretization (as part of data reduction) is presented as a real solution to reduce this complexity. However, standard discretizers are not designed to perform well with such amounts of data. This paper proposes a distributed discretization algorithm for Big Data analytics based on evolutionary optimization. After comparing with a distributed discretizer based on the Minimum Description Length Principle, we have found that our solution yields more accurate and simpler solutions in reasonable time.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Swarm and Evolutionary Computation - Volume 38, February 2018, Pages 240-250
نویسندگان
, , , ,