Back close

A mapreduce model for distributed self organizing map using apache spark

Publication Type : Journal Article

Publisher : Journal of Advanced Research in Dynamical and Control Systems, Institute of Advanced Scientific Research, Inc .

Source : Journal of Advanced Research in Dynamical and Control Systems, Institute of Advanced Scientific Research, Inc., Volume 10, Number 12, p.1229-1238 (2018)

Url : https://www.scopus.com/inward/record.uri?eid=2-s2.0-85064170575&partnerID=40&md5=d7fe2daedd2969a1feabfebac8e9a144

Campus : Amritapuri

School : Department of Computer Science and Engineering, School of Engineering

Center : AI (Artificial Intelligence) and Distributed Systems

Department : Computer Science

Verified : No

Year : 2018

Abstract : Training Neural network models for clustering is computationally expensive in distributed environment. This paper presents a Self Organizing Map (SOM) Model suitable for clustering based on data parallelism. Though the proposed approach is a generic MapReduce prototype for any neural network model, this paper extends the prototype to leverage into SOM model. Our technique differs from the existing technique in training and agglomeration of the results. MapReduce using Apache Spark is more efficient than Apache Hadoop, due to Resilient Distributed Datasets (RDDs). Experimentation on real data sets on Spark platform shows the feasibility of the proposed approach. Further, comparison with Distributed KMeans based on the metrics like Purity and Scalability shows efficacy of the proposed solution. © 2018, Institute of Advanced Scientific Research, Inc. All rights reserved.

Cite this Research Publication : A. Mukundan and Sandhya Harikumar, “A mapreduce model for distributed self organizing map using apache spark”, Journal of Advanced Research in Dynamical and Control Systems, vol. 10, pp. 1229-1238, 2018

Admissions Apply Now