کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
7111772 1460840 2017 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Job schedulers for Big data processing in Hadoop environment: testing real-life schedulers using benchmark programs
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی کنترل و سیستم های مهندسی
پیش نمایش صفحه اول مقاله
Job schedulers for Big data processing in Hadoop environment: testing real-life schedulers using benchmark programs
چکیده انگلیسی
At present, big data is very popular, because it has proved to be much successful in many fields such as social media, E-commerce transactions, etc. Big data describes the tools and technologies needed to capture, manage, store, distribute, and analyze petabyte or larger-sized datasets having different structures with high speed. Big data can be structured, unstructured, or semi structured. Hadoop is an open source framework that is used to process large amounts of data in an inexpensive and efficient way, and job scheduling is a key factor for achieving high performance in big data processing. This paper gives an overview of big data and highlights the problems and challenges in big data. It then highlights Hadoop Distributed File System (HDFS), Hadoop MapReduce, and various parameters that affect the performance of job scheduling algorithms in big data such as Job Tracker, Task Tracker, Name Node, Data Node, etc. The primary purpose of this paper is to present a comparative study of job scheduling algorithms along with their experimental results in Hadoop environment. In addition, this paper describes the advantages, disadvantages, features, and drawbacks of various Hadoop job schedulers such as FIFO, Fair, capacity, Deadline Constraints, Delay, LATE, Resource Aware, etc, and provides a comparative study among these schedulers.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Digital Communications and Networks - Volume 3, Issue 4, November 2017, Pages 260-273
نویسندگان
, , ,