Sneha. M, Shoney Sebastian and Rajeshwari C N
Cloud computing is emerging as a new trend to store and analyze data, Hadoop is a very good and efficient framework that can be used to process the data through map reduce programming paradigm. This paper provides the insight for the rise of big data and what role does the hadoop plays to handle this. The structure and architecture view of hadoop with all the components and Yarn architecture is discussed. Scheduling is allot the set of jobs to process in the corresponding slots based on the few job scheduling algorithms, comparative study of all the scheduling algorithms are made which would be helpful while processing the data.
Map Reduce, Yarn, Hadoop, slots, HDFS.