Back close

Optimizing the Performance of Hadoop Clusters through Efficient Cluster Management Techniques

Publication Type : Journal Article

Publisher : (2018) International Journal of Engineering and Technology(UAE)

Source : (2018) International Journal of Engineering and Technology(UAE), 7 (2), pp. 19-22

Url : https://www.scopus.com/record/display.uri?eid=2-s2.0-85047848896&origin=resultslist

Keywords : Big data, Hadoop, Heterogeneous clusters, Map Reduce, Yarn, Zookeeper

Campus : Mysuru

School : School of Arts and Sciences

Department : Computer Science

Year : 2018

Abstract : The necessity for processing the huge data has become a critical task in the age of Internet, even though data processing has evolved into a next generation level still data processing and information extraction has many problems to solve. With the increase in data size retrieving useful information with a given span of time is a herculean task. The most optimal solution that has been adopted is usage of distributed computing environment supporting data processing involving suitable model architecture with large complex structure. Although processing has achieved good amount of improvement, efficiency, energy utilization and accuracy has been compromised. The research aims to propose an efficient environment for data processing with optimized energy utilization and increased performance. Hadoop environment common and popular among big data processing platform has been chosen as base for enhancement. Creating a multi node Hadoop cluster architecture on top of which an efficient cluster monitor is setup and an algorithm to manage efficiency of the cluster is formulated. Cluster monitor is incorporated with Zoo keeper, Yarn (Node and resource manager). Zoo keeper does the monitoring of cluster nodes of the distributed system and identifies critical performance problems. Yarn plays a vital role in managing the resources efficiently and controlling the nodes with the help of hybrid scheduler algorithm. Thus this integrated platform helps in monitoring the distributed cluster as well as improving the performance of the overall Big Data processing. © 2018 Authors.

Cite this Research Publication : Shraddha Bollamma, K.S., Manishankar, S., Vishnu, M.V. Optimizing the performance of hadoop clusters through efficient cluster management techniques (2018) International Journal of Engineering and Technology(UAE), 7 (2), pp. 19-22, 2018

Admissions Apply Now