Back close

Course Detail

Course Name Introduction to Big Data Analytics
Course Code 23CSE434
Program B. Tech. in Computer Science and Engineering (CSE)
Credits 3
Campus Amritapuri ,Coimbatore,Bengaluru, Amaravati, Chennai

Syllabus

Professional Electives

Other Branches

Unit I

Introduction: Overview of DBMS, File vs DBMS, elements of DBMS, Relational Data Model: Introduction to relational model, Structure of relational mode, domain, keys, tuples to relational models. SQL – table creation, relationships, basic queries DML and DDL, Joins, Grouping.

Unit II

Introduction to Big Data: Types of Digital Data – Characteristics of Data – Evolution of Big Data – Definition of Big Data – Challenges with Big Data-3Vs of Big Data -Terminologies in Big Data – CAP Theorem – BASE Concept. NoSQL: Types of Databases – Advantages – NewSQL – SQL vs. NOSQL vs NewSQL. Introduction to Hadoop: Features – Advantages – Versions – Overview of Hadoop Eco systems – Hadoop distributions – Hadoop vs. SQL – RDBMS vs. Hadoop – Hadoop Components – Architecture – HDFS – Map Reduce: Mapper – Reducer – – Map Reduce: Mapper – Reducer – Combiner – Partitioner. Hadoop 2 (YARN): Architecture – Interacting with Hadoop Eco systems.

Unit III

No SQL databases: Cassandra: Introduction – Features – Data types – CQLSH – Key spaces – CRUD operations – Collections – Counter – TTL – Alter commands – Import and Export – Querying System tables.

Objectives and Outcomes

Course Objectives

  • The aim of this course is to provide an introduction to big data technologies and tools used for big data.
  • Basics of relational databases and its implementation strategy using SQL are discussed in the first phase.
  • The second phase discusses on concepts big data and its architecture, storage and processing of data in parallel and distributed system.
  • In the last phase retrieval and analysis of unstructured data are done using NOSQL databases.

Course Outcomes

CO1: Understand fundamental concepts of Databases and SQL.

CO2: Apply SQL for data storage and retrieval.

CO3: Understand fundamental concepts of Big Data and its technologies.

CO4: Apply Map reduce programming for big data.

CO5: Analyze appropriate NoSQL database techniques for storing and processing large volumes of structured and

unstructured data.

CO-PO Mapping

PO/PSO

PO1

PO2

PO3

PO4

PO5

PO6

PO7

PO8

PO9

PO10

PO11

PO12

PSO1

PSO2

CO

CO1

3

2

1

                 

3

2

CO2

1

3

2

                 

3

2

CO3

 

2

2

1

3

             

3

2

CO4

 

3

2

2

3

             

3

2

CO5

     

2

3

             

3

2

Evaluation Pattern

Evaluation Pattern: 70:30

Assessment

Internal

End Semester

MidTerm Exam

20

 

Continuous Assessment – Theory (*CAT)

10

 

Continuous Assessment – Lab (*CAL)

40

 

**End Semester

 

30 (50 Marks; 2 hours exam)

*CAT – Can be Quizzes, Assignments, and Reports

*CAL – Can be Lab Assessments, Project, and Report

**End Semester can be theory examination/ lab-based examination/ project presentation

Text Books / References

Textbook(s)

Seema Acharya, Subhashini Chellappan. “Big Data and Analytics”, Wiley Publication; 2015.

Reference(s)

Hurwitz JS, Nugent A, Halper F, Kaufman M. “Big data for dummies”. John Wiley & Sons; 2013.

White T. “Hadoop: The definitive guide”, O’Reilly Media, Inc.”; 2012.

Bradberry R, Lubow E. “Practical Cassandra: a developer’s approach”. Addison-Wesley; 2013.

DISCLAIMER: The appearance of external links on this web site does not constitute endorsement by the School of Biotechnology/Amrita Vishwa Vidyapeetham or the information, products or services contained therein. For other than authorized activities, the Amrita Vishwa Vidyapeetham does not exercise any editorial control over the information you may find at these locations. These links are provided consistent with the stated purpose of this web site.

Admissions Apply Now