How MapReduce Works – Anatomy of a MapReduce Job Run, Failures, Shuffle and Sort, Task Execution MapReduce Types and Formats – MapReduce Types, Input Formats, output formats, MapReduce Features- Counters, Sorting, Joins, Side Data Distribution
Simple analytics using MapReduce, Calculating frequency distributions and sorting using MapReduce, Calculating histograms using MapReduce, Calculating scatter plots using MapReduce. Hierarchical clustering, Clustering algorithm to large dataset, classification using Navie bayes classifier, other applications.