Data Mining Techniques

  • Course code: X_400108
  • Period: Period 5
  • Credits: 6.0
  • Language of tuition: English
  • Faculty: Faculteit der Exacte Wetenschappen
  • Coordinator: dr. M. Hoogendoorn
  • Examinator: dr. M. Hoogendoorn
  • Lecturers: dr. M. Hoogendoorn
  • Teaching method(s): Lecture
  • Level: 500

Course objective

The aim of the course is that students acquire data mining knowledge and
skills that they can apply in a business environment. How the aims are
to be achieved: Students will acquire knowledge and skills mainly
through the following: an overview of the most common data mining
algorithms and techniques (in lectures), a survey of typical and
interesting data mining applications, and practical assignments to gain
"hands on" experience. The application of skills in a business
environment will be simulated through various assignments of the course.

Course content

The course will provide a survey of basic data mining techniques and
their applications for solving real life problems. After a general
introduction to Data Mining we will discuss some "classical" algorithms
like Naive Bayes, Decision Trees, Association Rules, etc., and some
recently discovered methods such as boosting, Support Vector Machines,
and co-learning. A number of successful applications of data mining will
also be discussed: marketing, fraud detection, text and Web mining,
possibly bioinformatics. In addition to lectures, there will be an
extensive practical part, where students will experiment with various
data mining algorithms and data sets. The grade for the course will be
based on these practical assignments (i.e., there will be no final

Form of tuition

Lectures (h) and compulsory practical work (pra). Lectures are planned
to be interactive: there will be small questions, one-minute
discussions, etc.

Type of assessment

Practical assignments (i.e. there is no exam). There will be two
assignments done in groups of three. There is a possibility to get a
grade without doing these assignments: to do a real research project
instead (which will most likely to involve more work, but it can also be
more rewarding). For the regular assignments the first assignment counts
for 40% and the second for 60%. The grade of both assignments needs to
be sufficient to pass the course.

Course reading

Ian H. Witten, Eibe Frank, Mark A. Hall, Data Mining: Practical Machine
Learning Tools and Techniques (Third Edition). Morgan Kaufmann, January
ISBN 978-0-12-374856-0

Recommended background knowledge

Kansrekening and Statistiek or Algemene Statistiek (knowledge of
statistics and probabilities) or equivalent. Recommended: Machine

Target audience

mBA, mCS, mAI, mBio

© Copyright VU University Amsterdam