Prerequisite: Programming for Social Data Science I & II, Research Methods for Policy Studies I & II.
Summary: This course offers an introduction to machine learning tailored for research in social sciences. In the world where the volume of social data is rapidly expanding, mastering machine learning techniques becomes imperative to extract actionable insights for informed policy-making. Machine learning integrates insights from artificial intelligence, probability theory, and statistical inference to automate tasks like pattern recognition and prediction. We’ll explore supervised and unsupervised learning techniques, focusing on the variety of their applications in social research. Ethical considerations surrounding automated analysis and decision-making will be discussed, including their potential to mitigate or exacerbate human biases. Key topics include the bias-variance tradeoff, model selection, cross-validation, regularization, and dimension reduction. Techniques covered range from linear regression variations to tree-based methods and introductory neural networks. Unsupervised methods like principal component analysis and clustering techniques will also be examined. By the end of the course, students will be well acquainted with some of the state-of-the-art toolkits of machine learning and be able to apply them in their own projects.