CSCA 5632: Unsupervised Algorithms in Machine Learning

Get a head start on program admission

Preview this course in the non-credit experience today!
Start working toward program admission and requirements right away. Work you complete in the non-credit experience will transfer to the for-credit experience when you upgrade and pay tuition. See How It Works for details.

Cross-listed with DTSA 5510

Important Update: Machine Learning Specialization Changes

We are excited to inform you the current Machine Learning: Theory and Hands-On Practice with Python Specialization (taught by Professor Geena Kim) is being retired and will be replaced with a new and improved version (to be taught by Professor Daniel Acuna) that reflects the latest advancements in the field. The last opportunity to sign up for the current version will now be November 28, 2025. The new version will be available Spring 1, 2026.

Course Type: Breadth (MS-CS) Pathway|Breadth (MS-AI)

Specialization: Machine Learning: Theory & Hands-On Practice with Python

Instructor: Dr. Daniel Acuna

Prior knowledge needed:

Programming languages: Basic to intermediate level experience of Python, Jupyter Notebook
Math: Basic knowledge of Probability and Statistics, Linear Algebra
Technical requirements: Windows or Mac, Linux, Jupyter Notebook

View on Coursera

Learning Outcomes

Explain the goals, challenges, and appropriate use cases of unsupervised learning.
Discover and interpret structure in data using clustering methods.
Apply dimensionality reduction techniques to analyze and visualize high-dimensional data.
Address missing data and recommender system problems using matrix completion techniques.

Course Grading Policy

Course Grading and AI Usage Policy

Check this course's reading on Assessment Expectations for what is allowed by the AI Usage Policy classification(s) listed below. If you are unsure of whether a particular use is approved, please reach out to your Course Facilitator before submitting your assignment.

Assignment	Percentage of Grade	AI Usage Policy
Quizzes (5)	40% (8% each)	Conditional
Programming Assignments (5)	40% (8% each)	Conditional
Final Exam	20%	No AI Use

Course Content

Week 1 | Unsupervised Learning Basics & Exploratory Data Analysis

Duration: 5 hours

Welcome to Introduction to Machine Learning: Unsupervised Learning. In this first module, you will explore how machine learning can uncover hidden patterns in data, without relying on labeled outcomes. You will learn how unsupervised learning differs from supervised learning, and why the absence of a “correct answer” makes interpretation both powerful and challenging. Through Principal Component Analysis (PCA), you will discover how to reduce the dimensionality of complex datasets while preserving the most important variation. You will compute principal components, interpret explained variance, and visualize high-dimensional data in two dimensions. By the end of this module, you will have a hands-on understanding of how PCA can reveal structure in seemingly chaotic data.

Week 2 | Principal Component Analysis (PCA)

Duration: 3 hours

Now that you understand the basics of Principal Component Analysis, this module focuses on how to apply it thoughtfully. You will learn how to decide how many components to retain by examining the proportion of variance explained and interpreting scree plots. You will also explore how to interpret principal component loadings to understand what each component reveals about the original features. Through hands-on practice, you will see how PCA can be used not only for visualization but also as a powerful pre-processing step before supervised learning. By the end of this module, you will be able to reduce dimensionality with purpose and insight.

Week 3 | K-Means Clustering

Duration: 3 hours

This module introduces you to the world of clustering, where the goal is to uncover natural groupings in data without any labels. You will learn how the k-means algorithm partitions observations into clusters based on similarity, and how it iteratively refines those groupings by updating centroids. Along the way, you will grapple with the challenge of choosing the right number of clusters and explore heuristic tools like the elbow method. Through hands-on work, you will evaluate clustering results and interpret what each group represents in context. Clustering is as much an art as it is a science, and this module will help you build intuition for both.

Week 4 | Hierarchical Clustering

Duration: 3 hours

In this module, you will explore hierarchical clustering—a method that builds nested groupings without requiring you to choose the number of clusters in advance. You will learn how the agglomerative approach works step by step and how to interpret dendrograms to uncover meaningful structure in your data. Unlike K-means, hierarchical clustering offers a full picture of how observations relate to one another at different levels of similarity. You will also examine how scaling and distance metrics can influence clustering results, and why evaluating clusters is often more subjective than definitive. This module encourages you to think critically about what makes a clustering solution useful, not just mathematically valid.

Week 5 | Matrix Completion, Missing Values, and Recommender Systems

Duration: 3 hours

This module introduces low-rank matrix completion as a principled approach to handling missing data and powering recommender systems. You will learn how PCA can be used as a matrix approximation tool to reconstruct missing entries, implement an iterative completion algorithm, and validate model choices via masking. A compact case study demonstrates practical trade-offs with small p, and the module concludes by mapping the same ideas to user–item recommendation with attention to preprocessing, evaluation, scale, and ethics.

Week 6 | Final Exam (For-Credit Experience Only)

Duration: 1.5 hours

Final Exam Format: In-course exam

This module contains materials for the final exam. If you've upgraded to the for-credit version of this course, please make sure you review the additional for-credit materials in the Introductory module and anywhere else they may be found.

This exam has 50 questions which cover the contents from the entire course. An 80% or higher is considered passing.

The time estimate for this exam is 90 minutes and there is a time limit of 90 minutes per attempt.

Please note that this exam allows only two attempts. You will only be able to submit once per timed attempt. Highest grade recorded.

Notes

Cross-listed Courses: Courses that are offered under two or more programs. Considered equivalent when evaluating progress toward degree requirements. You may not earn credit for more than one version of a cross-listed course.
Page Updates: This page is periodically updated. Course information on the Coursera platform supersedes the information on this page. Click the View on Coursera button above for the most up-to-date information

Learning Outcomes

Course Grading Policy

Course Grading and AI Usage Policy

Course Content

Notes

Departments

Programs

Affiliates & Partners

Search

Other ways to search:

CSCA 5632: Unsupervised Algorithms in Machine Learning

Learning Outcomes

Course Grading Policy

Course Grading and AI Usage Policy

Course Content

Notes

Departments

Programs

Affiliates & Partners