Professional Electives
Other Branches
Course Name | Introduction to Big Data Analytics |
Course Code | 23CSE434 |
Program | B. Tech. in Computer Science and Engineering (CSE) |
Credits | 3 |
Campus | Amritapuri ,Coimbatore,Bengaluru, Amaravati, Chennai |
Other Branches
Introduction: Overview of DBMS, File vs DBMS, elements of DBMS, Relational Data Model: Introduction to relational model, Structure of relational mode, domain, keys, tuples to relational models. SQL – table creation, relationships, basic queries DML and DDL, Joins, Grouping.
Introduction to Big Data: Types of Digital Data – Characteristics of Data – Evolution of Big Data – Definition of Big Data – Challenges with Big Data-3Vs of Big Data -Terminologies in Big Data – CAP Theorem – BASE Concept. NoSQL: Types of Databases – Advantages – NewSQL – SQL vs. NOSQL vs NewSQL. Introduction to Hadoop: Features – Advantages – Versions – Overview of Hadoop Eco systems – Hadoop distributions – Hadoop vs. SQL – RDBMS vs. Hadoop – Hadoop Components – Architecture – HDFS – Map Reduce: Mapper – Reducer – – Map Reduce: Mapper – Reducer – Combiner – Partitioner. Hadoop 2 (YARN): Architecture – Interacting with Hadoop Eco systems.
No SQL databases: Cassandra: Introduction – Features – Data types – CQLSH – Key spaces – CRUD operations – Collections – Counter – TTL – Alter commands – Import and Export – Querying System tables.
Course Objectives
Course Outcomes
CO1: Understand fundamental concepts of Databases and SQL.
CO2: Apply SQL for data storage and retrieval.
CO3: Understand fundamental concepts of Big Data and its technologies.
CO4: Apply Map reduce programming for big data.
CO5: Analyze appropriate NoSQL database techniques for storing and processing large volumes of structured and
unstructured data.
CO-PO Mapping
PO/PSO |
PO1 |
PO2 |
PO3 |
PO4 |
PO5 |
PO6 |
PO7 |
PO8 |
PO9 |
PO10 |
PO11 |
PO12 |
PSO1 |
PSO2 |
CO |
||||||||||||||
CO1 |
3 |
2 |
1 |
3 |
2 |
|||||||||
CO2 |
1 |
3 |
2 |
3 |
2 |
|||||||||
CO3 |
2 |
2 |
1 |
3 |
3 |
2 |
||||||||
CO4 |
3 |
2 |
2 |
3 |
3 |
2 |
||||||||
CO5 |
2 |
3 |
3 |
2 |
Evaluation Pattern: 70:30
Assessment |
Internal |
End Semester |
MidTerm Exam |
20 |
|
Continuous Assessment – Theory (*CAT) |
10 |
|
Continuous Assessment – Lab (*CAL) |
40 |
|
**End Semester |
30 (50 Marks; 2 hours exam) |
*CAT – Can be Quizzes, Assignments, and Reports
*CAL – Can be Lab Assessments, Project, and Report
**End Semester can be theory examination/ lab-based examination/ project presentation
Textbook(s)
Seema Acharya, Subhashini Chellappan. “Big Data and Analytics”, Wiley Publication; 2015.
Reference(s)
Hurwitz JS, Nugent A, Halper F, Kaufman M. “Big data for dummies”. John Wiley & Sons; 2013.
White T. “Hadoop: The definitive guide”, O’Reilly Media, Inc.”; 2012.
Bradberry R, Lubow E. “Practical Cassandra: a developer’s approach”. Addison-Wesley; 2013.
DISCLAIMER: The appearance of external links on this web site does not constitute endorsement by the School of Biotechnology/Amrita Vishwa Vidyapeetham or the information, products or services contained therein. For other than authorized activities, the Amrita Vishwa Vidyapeetham does not exercise any editorial control over the information you may find at these locations. These links are provided consistent with the stated purpose of this web site.