Big Data Analytics - Amrita Vishwa Vidyapeetham

Course Detail

Course Name	Big Data Analytics
Course Code	23CSE352
Program	B. Tech. in Computer Science and Engineering (CSE)
Credits	3
Campus	Amritapuri ,Coimbatore,Bengaluru, Amaravati, Chennai

Syllabus

PROFESSIONAL ELECTIVES

Electives Electives in Data Science

Unit I

Hadoop ecosystem in Brief –Basic Paradigm and system architecture, MapRedand HDFS, Making a small Hadoop cluster –Iterative and non-Iterative batch processing, Data stores, HBASE, HIVE, PIG-New generation Big data using Functional Programming in Scala: Basic Syntax-type inference and static types-function types and value types, closures.

Unit II

Immutability and immutable types-generic type Parameters-Recursive arbitrary collections –ConsList -Iterative arbitrary collections-Arrays-Tail recursion-factorial example-functional abstractions with examples-square root, fixed point, sequence summations. Higher order functions-MapReduce Template-Pattern Matching syntax. Similar higher order (Cons) List operations on arbitrary Collections-filter, fold, partition, span. Basic entity classes and objects in Scala.

Unit III

Apache Spark: -ResilientDistributed Datasets -Creating RDDs, Lineage and Fault tolerance, DAGs, Immutability, task division and partitions, transformations and actions, lazy evolutions and optimization -Formatting and housing data from spark RDDs–Persistence. Setting up a standalone Spark cluster-: spark-shell, basic API, Modules-Core, Key/Value pairs and other RDD features, MLlib-examples for bi-class SVM and logistic regression.

Objectives and Outcomes

Course Objectives

To provide in-depth knowledge about big data Technologies and tools used for big data.
To facilitate students to learn to implement and work on tools to handle large volumes of data in parallel and distributed environments.
To throw light on retrieval and analysis of unstructured data using NOSQL databases.
To impart in-depth knowledge of Spark and Spark MLlib.

Course Outcomes

CO1: Understand fundamental concepts of Big Data and its technologies.

CO2: Apply concepts of MapReduce framework for optimization.

CO3: Analyze appropriate NoSQL database techniques for storing and processing large volumes of structured and unstructured data.

CO4: Apply data analytics solutions using Hadoop ecosystems and Spark.

CO5: Explore modern big data processing packages for Machine learning.

CO-PO Mapping

PO/PSO	PO1	PO2	PO3	PO4	PO5	PO6	PO7	PO8	PO9	PO10	PO11	PO12	PSO1	PSO2
CO	PO1	PO2	PO3	PO4	PO5	PO6	PO7	PO8	PO9	PO10	PO11	PO12	PSO1	PSO2
CO1	3	2	1							1			3	2
CO2	3	3	2	2	3	2	2	2	2	2			3	2
CO3	3	3	2	2	3	2	2	2	2	2			3	2
CO4	3	3	2	2	3	2	2	2	2	2			3	2
CO5	2			2	3		2	3	2	2			3	2

Evaluation Pattern

Evaluation Pattern: 70:30

Assessment	Internal	External
Midterm	20
*Continuous Assessment (Theory) (CAT)	10
*Continuous Assessment (Lab) (CAL)	40
**End Semester		30 (50 Marks; 2 hours exam)

*CAT – Can be Quizzes, Assignments, and Reports

*CAL – Can be Lab Assessments, Project, and Report

**End Semester can be theory examination/ lab-based examination/ project presentation

Text Books / References

Textbook(s)

Holden Karau, Andy Konwinski, Patrick Wendell and MateiZaharia, “Learning Spark: Lightning-Fast Big Data Analysis”, 1st Edition,2015.

Reference(s)

Cay S. Horstmann, “Scala for the Impatient”, 2nd Edition,2017.

Bill Chambers and MateiZaharia, “Spark: The Definitive Guide: Big Data Processing Made Simple”, 1st Edition,2018.

Martin Odersky, Lex Spoon and Bill Venners, “Programming in Scala: A Comprehensive Step-by-Step Guide”, 3rd Edition,2008.

Holden Karau and Rachel Warren, “High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark”, 1st Edition,2017.

Tom White, “Hadoop: The Definitive Guide”, 4th Edition, 2015.

DISCLAIMER: The appearance of external links on this web site does not constitute endorsement by the School of Biotechnology/Amrita Vishwa Vidyapeetham or the information, products or services contained therein. For other than authorized activities, the Amrita Vishwa Vidyapeetham does not exercise any editorial control over the information you may find at these locations. These links are provided consistent with the stated purpose of this web site.

About Amrita Vishwa Vidyapeetham

Rankings

Accreditation

Governance

Chancellor

Leadership

Press Media

Newsletters

Amritapuri
Campus

Amaravati
Campus

Bengaluru
Campus

Chennai
Campus

Coimbatore
Campus

Faridabad
Campus

Kochi
Campus

Mysuru
Campus

Nagercoil
Campus

Haridwar

Research

Centers

Patents

Course

Course Detail

Syllabus

PROFESSIONAL ELECTIVES

Unit I

Unit II

Unit III

Objectives and Outcomes

Evaluation Pattern

Text Books / References

Interests

Programs

Research

About Amrita

Resources

Locations

Reports

About Amrita Vishwa Vidyapeetham

Amritapuri Campus

Amaravati Campus

Bengaluru Campus

Chennai Campus

Coimbatore Campus

Faridabad Campus

Kochi Campus

Mysuru Campus

Nagercoil Campus

Haridwar

Research

Programs

From the news

Others

Course

Course Detail

Syllabus

PROFESSIONAL ELECTIVES

Unit I

Unit II

Unit III

Objectives and Outcomes

Evaluation Pattern

Text Books / References

Amritapuri
Campus

Amaravati
Campus

Bengaluru
Campus

Chennai
Campus

Coimbatore
Campus

Faridabad
Campus

Kochi
Campus

Mysuru
Campus

Nagercoil
Campus