Syllabus
Unit I
Introduction to ML, Goals and applications of machine learning. Aspects of developing a learning system: training and testing data. Types of learning: Supervised, Unsupervised, and Reinforcement. Linear regression, variance, bias, gradient descent, R2, Ridge and Lasso regression.
Unit II
Logistic regression, decision boundary, classification parameters: Accuracy, precision, recall, F-measure, RoC curve. Bayesian learning: Probability theory and Bayes rule. Naive Bayes learning algorithm. Regression Decision trees: classification, entropy, information gain, ginni index and regression tree – random forest
Unit III
Perceptron and backpropagation neural network – k-nearest neighbor rule. Support vector machine: multicategory generalizations, Kernels for learning non-linear functions. ADA Boost classifier. Feature engineering and feature selection. PCA and LDA
Unit IV
Unsupervised learning. Clustering: Learning from unclassified data. Clustering. Hierarchical Agglomerative Clustering. k-means partitional clustering. Expectation maximization (EM) for soft clustering. Semi-supervised learning with EM using labeled and unlabled data