Back close

HDFS-based parallel and scalable pattern mining using clouds for incremental data.

Publication Type : Journal Article

Publisher : International Journal of Computer Aided Engineering and Technology

Source : International Journal of Computer Aided Engineering and Technology, 13(1-2), pp.28-45

Url : https://www.inderscience.com/info/inarticle.php?artid=108102

Campus : Chennai

School : School of Engineering

Department : Computer Science and Engineering

Year : 2020

Abstract : Abstract: Increased usage of internet led to the migration of large amount of data to the cloud environment which uses Hadoop and map reduce framework for managing various mining applications in distributed environment. Earlier research activity in distributed mining comprises of solving complex problems using distributed computational techniques and new algorithmic designs. But as the nature of the data and user requirement becomes more complex and demanding, the existing distributed algorithms fails in multiple aspects. In our work, a new distributed frequent pattern algorithm, named Hadoop-based parallel frequent pattern mining (HPFP) has been proposed to optimally utilise the clusters efficiently and mine repeated patterns from large databases very effectively. The empirical evaluation shows that HPFP algorithm improves the performance of mining operation by increasing the level of parallelism and execution efficacy. HPFP achieves complete parallelism and delivers superior performance to become an efficient algorithm in HDFS, than existing distributed pattern mining algorithms.

Cite this Research Publication : Sountharrajan, S., Suganya, E., Aravindhraj, N., Sankarananth, S. and Rajan, C., 2020. HDFS-based parallel and scalable pattern mining using clouds for incremental data. International Journal of Computer Aided Engineering and Technology, 13(1-2), pp.28-45.

Admissions Apply Now