STAT462-12S2 (C) Semester Two 2012

Data Mining

0.1250 EFTS
09 Jul 2012 - 11 Nov 2012


Data Mining

STAT318 and STAT462 are courses in Data Mining, suited to anyone with an interest in analysing data.  In these courses we introduce you to the statistical analysis of large datasets for both classification and association purposes.  
We cover analysis of both numeric and qualitative data and make use of the professional software package MATLAB.
In these courses we will show you how to use the package, enter, manipulate and analyze data in MATLAB.

The Courses will:
• introduce data mining.
• introduce advanced data analysis techniques including classification and regression trees, ROC curves and FP-growth algorithm.
• introduce the use of the statistics computer package MATLAB.

For a full list of 2012 Honours courses, please refer to the Department of Mathematics and Statistics Honours Booklet Mathematics and Statistics Honours Booklet

Learning Outcomes

describe and conduct appropriate statistical modeling techniques for large datasets

  • be able to interpret the model results in such a way that a non-user of statistics can understand
  • use MATLAB competently
  • write a scientific and technical report
    • University Graduate Attributes

      This course will provide students with an opportunity to develop the Graduate Attributes specified below:


Subject to approval of the Head of School.

Streams Day Time Where Notes
Stream 01 Monday 10:00am-11:00am Erskine 446 3 Sep - 14 Oct
Wednesday 1:00pm-2:00pm Erskine 446 3 Sep - 14 Oct

Streams Day Time Where Notes
Stream 01 Thursday 1:00pm-2:00pm Erskine 442 (Computer Lab) 9 Jul - 19 Aug,
3 Sep - 14 Oct

Course Coordinator / Lecturer

Carl Scarrott


Blair Robertson


Assessment Due Date Percentage 
Internal Assessment - TBA 60%
Final Examination 40%

Assignments give you practice in analysing data and presenting results in a written report.
The project will give the opportunity to acquire presentation skills.

The lectures are complemented by computer labs where you will be guided in conducting approriate analysis and modelling.

Examination and Formal Tests

Exam Friday 02 Nov 2012 2:30pm-5:30pm  


Recommended reading:
Tan, Steinbach and Kumar 2006. Introduction to Data Mining. 769pp.

This is on a restricted loan in the Library.

Indicative Fees

Domestic fee $788.00

* Fees include New Zealand GST and do not include any programme level discount or additional course related expenses.

For further information see Mathematics and Statistics.

All STAT462 Occurrences