TTIC 31250 - An Introduction to the Theory of Machine Learning (Spring 2022)

Mon/Wed 1:30-2:50pm, TTIC 530

Instructor: Avrim Blum, TA: Falcon Dai

Avrim Blum's office hours: Mon 3:00-4:00pm, TTIC 439
Falcon Dai's office hours: Wed 3:00-4:00pm, on zoom

Course description: This course will cover some of the basic theory underlying machine learning and the process of generalizing from data. We will talk about both the PAC model for batch learning (learning over one set of data with the intent of producing a predictor that performs well on new data) and models for learning from feedback over time (online learning). We will discuss important fundamental concepts including overfitting, uniform convergence, formal notions of Occam's razor, VC-dimension, and regularization, as well as several classic algorithms including the Perceptron algorithm, SVMs, algorithms for combining expert advice, and boosting. We will also discuss limited-feedback (bandit) algorithms, connections between learning and game theory, and formal guarantees on privacy. This will be a proof-oriented course: our focus will be on proving performance guarantees for algorithms that aim to generalize from data as well as understanding what kinds of performance guarantees we can hope to prove.

Prerequisites: The main prerequisite is comfort with a proof-oriented course, having taken some algorithms class, and comfort with basic probabilistic modeling and reasoning. For example, 1000 programs are submitted to a stock-market prediction challenge, and we find that one of those programs has correctly predicted whether the market will go up or down the next week for 10 weeks in a row; should we feel confident that the program is a good one? Comfort with thinking about points and vectors in high-dimensional space is a plus.

Coursework: We will have 5 homework assignments, a small course project (a 4-5 page writeup based on exploring a theoretical idea, or trying some experiments, or reading and explaining a recent research paper), and a take-home final worth 1-2 homeworks. In addition, students will be asked to help with grading one of the assignments. The course project is due on the day of the last class. The take-home final can be taken at your convenience during finals week.

Homeworks etc.

Homework 1. Solutions.
Homework 2. Solutions.
Homework 3. Solutions.
Homework 4. Solutions.
Homework 5. Solutions.
Course Project: Due May 25. Hand in using the following dropbox link.
Course evals: please do them. If you got email about both TTIC and UChicago course evals, then please do them both.
Take-home final: due 24 hours + 15 mins after you begin, or by 11:59pm on June 3, whichever is earlier (15 mins added to give you time to do the course evals). If you are a graduating student, I will need your handin by 9:15am on May 27. When you are ready to begin, click here and then hand in using the following dropbox link. Good luck!

Lecture Notes and Tentative Plan

03/28: Introduction. The PAC model, Overfitting, and Occam's razor. See also Sections 5.1-5.3 of BHK.
03/30: Online Learning: Mistake-Bounds, Combining Experts, Multiplicative Weights, Regret Minimization. See also 5.5.1, 5.5.2, and 5.7 of BHK and 4.1-4.3 of this book chapter.
04/04: The Perceptron Algorithm, Margins, and intro to Kernels. See also Sections 5.5.3, 5.5.4, and 5.6 of BHK.
04/06: [No class today]
04/11: [Class starts at 2:00pm today] Support Vector Machines. See also this Tutorial on SVMs and kernel methods by Hofmann, Scholkopf, and Smola.
04/13: Uniform Convergence and VC-Dimension I. See also Section 5.9 of BHK.
04/18: Uniform Convergence and VC-Dimension II. See also Section 5.9 of BHK.
04/20: Rademacher Bounds. See also the monograph by Bousquet, Boucheron, and Lugosi.
04/25: Boosting. See also Section 5.10 of BHK.
04/27: Statistical Query Model I. Here is a recent survey article.
05/02: Statistical Query Model II. For additional developments, see this paper of Vitaly Feldman.
05/04: Computational Hardness Results for Learning.
05/09: Learning and Game Theory I.
05/11: [No class today]
05/16: Learning and Game Theory II: Book Chapter on Learning, Regret Minimization, and Equilibria.
05/18: Bandit algorithms.
05/23: Learning Finite-State Environments.
05/25: Differential Privacy and Learning. [Course projects due today]

Recommended texts/readings:

A. Blum, J. Hopcroft, and R. Kannan, Foundations of Data Science (Chapter 5). Called "BHK" above.
S. Shalev-Shwartz and S. Ben-David, Understanding Machine Learning: From Theory to Algorithms
O. Bousquet, S. Boucheron, and G. Lugosi, Introduction to Statistical Learning Theory.