DTSA 5505 Data Mining Methods

Same as CSCA 5512

Specialization: Data Mining Foundations and Practice
Instructor: Dr. Qin (Christine) Lv, Associate Professor of Computer Science
Prior knowledge needed: Familiarity of functionalities in Python, basic idea on data structures and algorithms and concepts of probability

Learning Outcomes

Successful completion of this course demonstrate your achievement of the following learning outcomes for the MS-DS program:

Identify the core functionalities of data modeling in the data mining pipeline
Apply techniques that can be used to accomplish the core functionalities of data modeling and explain how they work.
Evaluate data modeling techniques, determine which is most suitable for a particular task, and identify potential improvements.

Course Content

Module 1 | Frequent Pattern Analysis

Duration 6h

This module starts with an overview of data mining methods, then focuses on frequent pattern analysis, including the Apriori algorithm and FP-growth algorithm for frequent itemset mining, as well as association rules and correlation analysis.

Module 2 | Classification

Duration 6h

This module introduces supervised learning, classification, prediction, and covers several core classification methods including decision tree induction, Bayesian classification, support vector machines, neural networks, and ensemble methods. It also discusses classification model evaluation and comparison.

Module 3 | Clustering

Duration 6h

This module introduces unsupervised learning, clustering, and covers several core clustering methods including partitioning, hierarchical, grid-based, density-based, and probabilistic clustering. Advanced topics for high-dimensional clustering, bi-clustering, graph clustering, and constraint-based clustering are also discussed.

Module 4 | Outlier Analysis

Duration 5h

This module discusses three different types of outliers (global, contextual, and collective) and how different methods may be used to identify and analyze such outliers. It also covers some advanced methods for mining complex data, as well as the research frontiers of the data mining field.

Module 5 | Final Exam

Duration 3h

You will complete a proctored exam worth 20% of your grade made up of multiple choice questions. You must attempt the final in order to earn a grade in the course. If you've upgraded to the for-credit version of this course, please make sure you review the additional for-credit materials in the Introductory module and anywhere else they may be found.

Note: This page is periodically updated. Course information on the Coursera platform supersedes the information on this page. Click View on Coursera button above for the most up-to-date information.

Search

Other ways to search:

DTSA 5505 Data Mining Methods

Learning Outcomes

Course Content