CS182: Introduction to Machine Learning

Prof. Ziping Zhao, ShanghaiTech University, Fall Term 2021-22.

Course Descriptions

Machine learning (ML) is the science of making computer artifacts improve their performance without requiring humans to program their behavior explicitly. Machine learning has accomplished successes in a wide variety of challenging applications, ranging from computational molecular biology to computer vision to social web analysis.

CS182 Introduction to Machine Learning is an undergraduate-level introductory course in machine learning. It is not only suitable for students who want to apply principled machine learning techniques competently to their application-oriented research areas, but is also suitable for students pursuing or planning to pursue research in machine learning or other related areas that focus on model and algorithm development.

(Developed based on the course materials by Ethem Alpaydin.)

Announcements

Piazza: https://piazza.com/shanghaitech.edu.cn/fall2021/cs182
Gradescope: https://www.gradescope.com/courses/324113

Prerequisites

Compulsory: Linear Algebra, Calculus, Probability and Statistics, Programming.
Recommended Postrequisites: Matrix Analysis and Computations, Convex Optimization, Machine Learning.

Textbooks and Optional References

Textbooks

Ethem Alpaydin, Introduction to Machine Learning (4th Edition), The MIT Press (ACML book series), 2020.
Christopher Bishop, Pattern Recognition and Machine Learning, Springer, 2006.
Richard O. Duda, Peter E. Hart, and David G. Stork, Pattern Classification (2nd Edition), Wiley, 2000.

References

Aaron Courville, Ian Goodfellow, and Yoshua Bengio, Deep Learning, The MIT Press (ACML book series), 2016.
Richard S. Sutton and Andrew G. Barto, Reinforcement Learning: An Introduction (2nd Edition), The MIT Press (ACML book series), 2018.
Tom Mitchell, Machine Learning, McGraw Hill, 1997.
Trevor Hastie, Robert Tibshirani, and Jerome Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction (2nd Edition), Springer, 2009.
Zhihua Zhou, Machine Learning, Tsinghua University Press, 2016. (a ref. in Chinese)
Hang Li, Statistical Learning Methods (2nd Edition), Tsinghua University Press, 2019. (a ref. in Chinese)
Shai Shalev-Shwartz and Shai Ben-David, Understanding Machine Learning: From Theory to Algorithms, Cambridge University Press, 2014.
Mehryar Mohri, Afshin Rostamizadeh, and Ameet Talwalkar, Foundations of Machine Learning (2nd Edition), The MIT Press, 2018.

AI Labs

DeepMind; OpenAI; MetaAI/FAIR; BlackRockAI; IBMAI.

ML Practice Platform

Kaggle; Tianchi.

Schedule (Subject to Change)

Topics

Topic 0: Overview
Topic 1: ML Introduction; Mathematical Foundations of ML (Linear Algebra, Probability and Statistics, Optimization Theory, Information Theory, etc.)
Topic 2: Bayesian Decision Theory
Topic 3: Parameter Estimation for Generative Models
Topic 4: Linear Discrimination Models
Topic 5: Feedforward Neural Networks
Topic 6: Support Vector Machines
Topic 7: Model Assessment and Selection
Topic 8: Deep Learning Models
Topic 9: Recurrent Neural Networks
Topic 10: Clustering and Mixture Models
Topic 11: Nonparametric Methods
Topic 12: Decision Trees
Topic 13: Dimensionality Reduction
Topic 14: Matrix Factorization
Topic 15: Probabilistic Graphical Models
Topic 16: Ensemble Learning
Topic 17: Reinforcement Learning
Guest Lecture: Deep Reinforcement Learning and Game AI (Dr. Junxiao Song from inspirAI) [slides]
Topic 18: Review

Selected Topics

S-Topic 1: Probabilistic Graphical Models
S-Topic 2: Probabilistic Topic Models
S-Topic 3: Hidden Markov Models
S-Topic 4: State Space Models
S-Topic 5: Reinforcement Learning
S-Topic 6: Numerical Optimization in ML

Note: All course materials are available on Piazza.com.

Assessment

30% assignments, 40% final exam, 30% final project.

Academic Integrity Policy

Group study and collaboration on problem sets are encouraged, as working together is a great way to understand new materials. Students are free to discuss the homework problems with anyone under the following conditions:

Students must write down their own solutions. Plagiarism is never allowed. Similar answers, MATLAB codes, etc., found in HWs will invite you into suspected plagiarism investigation.
Students must list the names of their collaborators (i.e., anyone with whom the assignment was discussed).
Students can not use old solution sets from other classes under any circumstances, unless the instructor grants special permission.

Students are encouraged to read the ShanghaiTech Policy on Academic Integrity.