ComS 573: Machine Learning
Department of Computer Science
Iowa State University
Spring 2008

 


STUDY GUIDE

The material to be covered each week and the assigned readings (along with online lecture notes, if available) are included on this page. The links to lecture notes will not be in place usually until a week after the lecture. The assigned readings are divided into required and recommended readings. You will be responsible for the material covered in the lectures and the assigned required readings. You are strongly encouraged to explore the recommended readings.


Survival Tips


Week 1 (starting January 14, 2008)

Overview of the course

Review of probability theory, Information theory.

Required Readings

Assignments

Recommended Readings for those unfamiliar with probability theory

Recommended Java Readings for those unfamiliar with Java.

Week 2 (starting January 21, 2008)

Review of Information theory (continued).

Bayesian Decision Theory.

Required Readings

Recommended Readings


Week 3 (starting January 28, 2008)

Probability density estimation: parameter estimation problem. Maximum-likelihood parameter estimation, Bayesian parameter estimation, parameter estimation for normal density, parameter estimation for discrete variables.

Naive Bayes Classifier: classifying text documents.

Required Readings

Assignments

Recommended Readings


Week 4 (starting February 4, 2008)

Evaluation of classifiers. Performance measures: Accuracy, Precision, Recall, ROC curves.

Evaluation of classifiers -- estimation of performance measures; confidence interval calculation for estimates; cross-validation; comparing two hypotheses; hypothesis testing; comparing two learning algorithms.
 

Required Readings

Assignments

Recommended Readings


Week 5 (starting February 11, 2008)

The decision tree classifier: decision tree learning algorithm (Quinlan's ID3 algorithm and extensions C4.5); The problem of overfitting, missing data.

Linear models for classification: Linear Discriminant Functions, Perceptrons
 

Required Readings

Recommended Readings


Week 6 (starting February 18, 2008)

Linear models for classification: Linear Discriminant Functions, Perceptrons, Perceptron Learning algorithm, Multi-category classification, Winner-Take-All (WTA) networks (linear machines).

Regression: Linear regression, Least Mean Squared (LMS) Error Criterion,  Gradient descent algorithm, LMS solution for classification.

Fisher Linear discriminant
 

Required Readings

Assignments

Recommended Readings


Week 7 (starting February 25, 2008)

Probabilistic generative models and linear classifiers

Discriminative models for classification, logistic regression models,  training logistic regression, regularized logistic regression

Instance-Based Learning, k-Nearest neighbor learning
 

Required Readings

Recommended Readings


Week 8 (starting March 3, 2008)

Multilayer neural networks: feedforward operations, expressive power of multilayer networks, Backpropagation algorithm, Regularization, stopping criterion, Practical techniques for improving backpropagation, , error functions for classification. Function optimization algorithms---conjugate gradient and quasi-Newton algorithms

Required Readings

Assignments

Recommended Readings


Week 9 (starting March 10, 2008)

Support Vector Machines: Dual Representation of the Perceptron Algorithm, Learning in feature spaces, Kernel Functions, Kernel-Induced Feature Spaces, Maximum Margin Classifiers, constrained optimization problem, Learning as optimization, Support Vectors, Non-Separable Case, The Soft-Margin Classifier

Required Readings

Assignments

Recommended Readings

Additional Information


Spring Break


Week 10 (starting March 24, 2008)

Bayesian Networks: Syntax and Semantics, D-separation.

Required Readings

Additional Information


Week 11 (starting March 31, 2008)

Bayesian Networks: modeling

Required Readings

Assignments

Additional Information


Week 12 (starting April 7, 2008)

Bayesian Networks: inference, learning

Required Readings

Recommended Readings

Additional Information


Week 13 (starting April 14, 2008)

Bayesian Networks: learning

Ensemble Classifiers: Bagging, The Adaboost Algorithm

Required Readings

Recommended Readings


Week 14 (starting April 21, 2008)

Ensemble Classifiers: Error correcting output coding

Mixture Models: Clustering, the EM algorithm, the K-means clustering algorithm

Required Readings

Recommended Readings