Back close

Course Detail

Course Name Mining of Massive Datasets
Course Code 24CSC536
Program Integrated M. Sc. Mathematics and Computing
Credits 3
Campus Coimbatore

Syllabus

Basics of Data Mining – Computational Approaches – Statistical Limits on Data Mining – Bonferroni’s Principle – MapReduce – Distributed File Systems . MapReduce . Algorithms Using MapReduce .

Extensions to MapReduce. Finding Similar Items – Applications of Near-Neighbor Search – Shingling of Documents – Similarity-Preserving Summaries of Sets – Locality-Sensitive Hashing for Documents – Distance Measures

Mining Data Streams: The Stream Data Model – Sampling Data in a Stream – Filtering Streams. Link Analysis: PageRank – Efficient Computation of PageRank – Topic-Sensitive PageRank – Link Spam. Frequent Itemsets : The Market-Basket Model – Market Baskets and the A-Priori Algorithm – Handling Larger Datasets in Main Memory. Clustering: Introduction to Clustering Techniques – Hierarchical Clustering – K-means Algorithms – CURE algorithm.

Recommendation Systems: A Model for Recommendation Systems – Content-Based Recommendations – Collaborative Filtering – Dimensionality Reduction. Mining Social- Network Graphs: Social Networks as Graphs – Clustering of Social-Network Graphs – Direct Discovery of Communities – Partitioning of Graphs – Finding Overlapping Communities – Simrank. Dimensionality Reduction: Eigenvalues and Eigenvectors of Symmetric Matrices- Principal-Component Analysis – Singular-Value Decomposition . Large-Scale Machine Learning – Machine-Learning Model – Perceptrons – Support-Vector Machines .

Text Books / References

  1. Jure Leskovec , Anand Rajaraman, Jeffrey David Ullman, Mining of Massive Datasets, Cambridge University Press, 2014.
  2. Tom White, Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale , O’Reilly Media; 4 edition ,

DISCLAIMER: The appearance of external links on this web site does not constitute endorsement by the School of Biotechnology/Amrita Vishwa Vidyapeetham or the information, products or services contained therein. For other than authorized activities, the Amrita Vishwa Vidyapeetham does not exercise any editorial control over the information you may find at these locations. These links are provided consistent with the stated purpose of this web site.

Admissions Apply Now