Publication Type : Journal Article
Publisher : International Journal of Computer Aided Engineering and Technology
Source : International Journal of Computer Aided Engineering and Technology, 13(1-2), pp.28-45
Url : https://www.inderscience.com/info/inarticle.php?artid=108102
Campus : Chennai
School : School of Engineering
Department : Computer Science and Engineering
Year : 2020
Abstract : Abstract: Increased usage of internet led to the migration of large amount of data to the cloud environment which uses Hadoop and map reduce framework for managing various mining applications in distributed environment. Earlier research activity in distributed mining comprises of solving complex problems using distributed computational techniques and new algorithmic designs. But as the nature of the data and user requirement becomes more complex and demanding, the existing distributed algorithms fails in multiple aspects. In our work, a new distributed frequent pattern algorithm, named Hadoop-based parallel frequent pattern mining (HPFP) has been proposed to optimally utilise the clusters efficiently and mine repeated patterns from large databases very effectively. The empirical evaluation shows that HPFP algorithm improves the performance of mining operation by increasing the level of parallelism and execution efficacy. HPFP achieves complete parallelism and delivers superior performance to become an efficient algorithm in HDFS, than existing distributed pattern mining algorithms.
Cite this Research Publication : Sountharrajan, S., Suganya, E., Aravindhraj, N., Sankarananth, S. and Rajan, C., 2020. HDFS-based parallel and scalable pattern mining using clouds for incremental data. International Journal of Computer Aided Engineering and Technology, 13(1-2), pp.28-45.