Introduction: Scaling with Traditional Databases – NoSQL need – First Princples – Desired Properties- Lambda Architectures. Batch Layer- Big data model – properties – fact based modeling – graph schemas – Apache Thrift,
Data Storage on Batch Layers – Requirements- Solutions- Distributed File Systems and Partitioning- Hadoop basics, Computing on Batch Layer- Algorithms-Scalability-MapReduce, Batch Layer Architecture and Algorithms – Design Overview and Workflow, Ingesting New Data, Normalization.
Serving Layer- Performance Metrics, Requirements and Design, ElephantDB. Speed Layer- Realtime Views, Cassandra basics, Query and Stream Processing , Apache Storm