| Instructor | 
			Dr. Mauro Sozio
			  | 
	| Teaching assistant | 
			Mr. Hui Li
		 | 
  
    | Syllabus | Data mining is the automatic discovery of statistically interesting and 
	potentially useful patterns from large amounts of data. The goal of the 
	course is to study the main methods used today for data mining and on-line 
	analytical processing. Topics include Data Mining Architecture; Data 
	Preprocessing; Mining Association Rules; Classification; Clustering; On-Line 
	Analytical Processing (OLAP); Data Mining Systems and Languages; Advanced 
	Data Mining (Web, Spatial, and Temporal data). | 
  
    | Introduction by Instructor | Advances in data collection and generation technologies are producing 
		massive amounts of data from which valuable information and knowledge 
		can be derived. In this course we study various data mining techniques, 
		which are powerful tools for data analysts to process data and to 
		extract from it interesting patterns and models. These models allow new 
		scientific discoveries and intelligent business decisions be made. | 
  
    | Learning Outcomes |  | 
  
    | Pre-requisites | Nil | 
  
    | Compatibility | Nil | 
  
    | Topics covered |  | 
  
    | Assessment |  | 
  
    | Course materials | Prescribed textbook: 
					Introduction to Data Mining, by Tan, Steinbach, and 
					Kumar, Addison Wesley, 2006.Mining of Massive Datasets, J. Leskovec, A. 
					Rajaraman, J. D. Ullman, Cambridge 2014 (Optional). | 
	| Session dates |  | 
 
    | Add/drop | 15 January, 2018 - 19 March, 2018 | 
 
    | Quota | 100 |