Instructor |
Professor Mauro Sozio
|
Teaching assistant |
Mr. Zhiyong Wu
|
Syllabus |
Data mining is the automatic discovery of statistically interesting and
potentially useful patterns from large amounts of data. The goal of the
course is to study the main methods used today for data mining and on-line
analytical processing. Topics include Data Mining Architecture; Data
Preprocessing; Mining Association Rules; Classification; Clustering; On-Line
Analytical Processing (OLAP); Data Mining Systems and Languages; Advanced
Data Mining (Web, Spatial, and Temporal data). |
Introduction by Instructor |
Advances in data collection and generation technologies are producing
massive amounts of data from which valuable information and knowledge
can be derived. In this course we study various data mining techniques,
which are powerful tools for data analysts to process data and to
extract from it interesting patterns and models. These models allow new
scientific discoveries and intelligent business decisions be made. |
Learning Outcomes |
|
Pre-requisites |
Nil |
Compatibility |
Nil |
Topics covered |
|
Assessment |
|
Course materials |
Prescribed textbook:
- Introduction to Data Mining, by Tan, Steinbach, and
Kumar, Addison Wesley, 2006.
- Mining of Massive Datasets, J. Leskovec, A.
Rajaraman, J. D. Ullman, Cambridge 2014 (Optional).
|
Session dates |
|
Add/drop |
18 January, 2021 - 4 February, 2021 |
Quota |
110 [For Engineering TPG students] |