Syllabus
Unit I
Introduction, Causality and Experiments, Data Preprocessing: Data cleaning, Data reduction, Data transformation, Data discretization. Visualization and Graphing: Visualizing Categorical Distributions, Visualizing Numerical Distributions, Overlaid Graphs, plots, and summary statistics of exploratory data analysis, Randomness, Probability, Introduction to Statistics, Sampling, Sample Means and Sample Sizes.
Unit II
Descriptive statistics – Central tendency, dispersion, variance, covariance, kurtosis, five point summary, Distributions, Bayes Theorem, Error Probabilities; Permutation Testing, Statistical Inference; Hypothesis Testing, Assessing Models, Decisions and Uncertainty, Comparing Samples, A/B Testing, P-Values, Causality.
Unit III
Estimation, Prediction, Confidence Intervals, Inference for Regression, Classification , Graphical Models, Updating Predictions.