CSCA 5622: Introduction to Machine Learning: Supervised Learning

Preview this course in the non-credit experience today!

  Work you complete in the non-credit experience will transfer to the for-credit experience when you upgrade and pay tuition. See How It Works for details.

Cross-listed with DTSA 5509

  • Course Type: Breadth
  • Specialization: Machine Learning: Theory & Hands-On Practice with Python
  • Instructor: Dr. Geena Kim, Instructor
  • Prior knowledge needed:
    • Programming languages: Basic to intermediate experience with Python, Jupyter Notebook
    • Math: Basic level Probability and Statistics, Linear Algebra
    • Technical requirements: Windows or Mac, Linux, Jupyter Notebook

  View on Coursera

Learning Outcomes

  • Use modern machine learning tools and python libraries.
  • Explain how to deal with linearly-inseparable data.
  • Compare logistic regression’s strengths and weaknesses.
  • Explain what decision tree is & how it splits nodes.

Course Grading Policy

Assignment

Percentage of Grade

Week 1

 

Week 1 Quiz

3%

Week 1 Programming Assignment: Data Cleaning and EDA

4%

Week 1 Peer Review: Data Cleaning and EDA

5%

Week 2

 

Week 2 Quiz

3%

Week 2 Peer Review: Multiple Linear Regression

6%

Week 2 Programming Assignment: Multiple Linear Regression

3%

Week 3

 

Week 3 Quiz

3%

Week 3 Programming Assignment: Logistic Regression

5%

Week 3 Peer Review: Logistic Regression

2%

Week 4

 

Week 4 Quiz

3%

Week 4 Programming Assignment: Non-parametric Models

5%

Week 4 Peer Review: Non-parametric Models

3%

Week 5

 

Week 5 Quiz

3%

Week 5 Programming Assignment: Ensembles

4%

Week 5 Peer Review: Ensembles

4%

Week 6

 

Week 6 Quiz

3%

Week 6 Programming Assignment: SVM Lab

5%

Week 6 Peer Review: SVM Lab

3%

Week 7

 

CSCA 5622 Supervised Learning Final Project

33%

Total

100%

Course Content

Duration: 7 hours

This week, we will build our supervised machine learning foundation. Data cleaning and Exploratory Data Analysis (EDA) might not seem glamorous, but the process is vital for guiding your real-world data projects. The chances are that you have heard of linear regression before. With the buzz around machine learning, perhaps it seems surprising that we are starting with such a standard statistical technique. In "How Not to Be Wrong: The Power of Mathematical Thinking", Jordan Ellenberg refers to linear regression as "the statistical technique that is to social science as the screwdriver is to home repair. It's the one tool you're pretty much going to use, whatever the task". Linear regression is an excellent starting place for solving problems with a continuous outcome. Hopefully, this week will help you appreciate how much you can accomplish with a simple model like this.

Duration: 6 hours

This week we are building on last week's foundation and working with more complex linear regression models. After this week, you will be able to create linear models with several explanatory and categorical variables. Mathematically and syntactically, multiple linear regression models are a natural extension of the simpler linear regression models we learned last week. One of the differences that we must keep in mind this week is that our data space is now 3D instead of 2D. The difference between 3D and 2D has implications when considering how to do things like creating meaningful visualizations. It is essential to understand how to interpret coefficients. Machine learning involves strategically iterating and improving upon a model. In this week's lab and Peer Review, you will identify weaknesses with linear regression models and strategically improve on them. Hopefully, as you progress through this course specialization, you will get better and better at this iterative process.

Duration: 6 hours

Even though the name logistic regression might suggest otherwise, we will be shifting our attention from regression tasks to classification tasks this week. Logistic regression is a particular case of a generalized linear model. Like linear regression, logistic regression is a widely used statistical tool and one of the foundational tools for your data science toolkit. There are many real-world applications for classification tasks, including the financial and biomedical realms. In this week's lab, you will see how this classic algorithm will help you predict whether a biopsy slide from the famous Wisconsin Breast Cancer dataset shows a benign or malignant mass.

Duration: 6 hours

This week we will learn about non-parametric models. k-Nearest Neighbors makes sense on an intuitive level. Decision trees are a supervised learning model that can be used for either regression or classification tasks. In Module 2, we learned about the bias-variance tradeoff, and we've kept that tradeoff in mind as we've moved through the course. Highly flexible tree models have the benefit that they can capture complex, non-linear relationships. However, they are prone to overfitting. This week and next, we will explore strategies like pruning to avoid overfitting with tree-based models. In this week's lab, you will make a KNN classifier for the famous MNIST dataset and then build a spam classifier using a decision tree model. This week we will once again appreciate the power of simple, understandable models.

Duration: 6 hours

Last week, we learned about tree models. Despite all of the benefits of tree models, they had some weaknesses that were difficult to overcome. This week we will learn about ensembling methods to overcome tree models' tendency to overfit. The winner utilizes an ensemble approach in many machine learning competitions, aggregating predictions from multiple tree models. This week you will start by learning about random forests and bagging, a technique that involves training the same algorithm with different subset samples of the training data. Then you will learn about boosting, an ensemble method where models train sequentially. You will learn about two essential boosting algorithms: AdaBoost and Gradient Boosting.

Duration: 7 hours

Congratulations. You've reached the last week of course material. This week we will be exploring another advanced topic, Support Vector Machines. Don't let the name intimidate you. This week, we will work through understanding this powerful supervised learning model. Hopefully, you will build an intuitive understanding of essential concepts like the difference between hard and soft margins, the kernel trick, and hyperparameter tuning.

Duration: 30 hours

For the final project, you will identify a Supervised Learning problem to perform EDA and model analysis. You will submit three deliverables. The project has 140 total points. In the instructions is a summary of the criteria you will use to guide your submission and review others’ submissions. This prject is a peer reviewed project.

Notes

  • Cross-listed Courses: Courses that are offered under two or more programs. Considered equivalent when evaluating progress toward degree requirements. You may not earn credit for more than one version of a cross-listed course.
  • Page Updates: This page is periodically updated. Course information on the Coursera platform supersedes the information on this page. Click the View on Coursera button above for the most up-to-date information.