de | en

Data Mining and Knowledge Discovery

Module IN2030

This Module is offered by TUM Department of Informatics.

This module handbook serves to describe contents, learning outcome, methods and examination type as well as linking to current dates for courses and module examination in the respective sections.

Basic Information

IN2030 is a semester module in English language at Master’s level which is offered in winter semester.

This Module is included in the following catalogues within the study programs in physics.

  • Catalogue of non-physics elective courses
Total workloadContact hoursCredits (ECTS)
90 h 30 h 3 CP

Content, Learning Outcome and Preconditions

Content

- data sources, characteristics, and errors
- data preprocessing and filtering
- data visualization
- data projections (principal component analysis, multidimensional scaling, Sammon mapping, auto associator)
- data transformation and feature selection
- correlation and regression
- forecasting
- classification (Bayes, discriminance, support vector machine, nearest neighbor, learning vector quantization, decision trees)
- clustering (sequential, protype based, fuzzy, relational, heuristic)

Learning Outcome

On successful completion of the module, students
- understand the different types of data and relations;
- understand, apply, and evaluate data preparation, analysis, and visualization methods;
- understand, apply, and evaluate linear and nonlinear correlation, regression and forecasting methods;
- are able to compare classification and clustering, and to understand, apply, and evaluate the corresponding methods;
- are able to select, apply, and evaluate suitable data mining methods for given applications.
The main didactic goal is to introduce students to a variety of methods and provide them with the basic notions necessary to extend their knowledge by accessing the literature on their own. The work that the students must invest to achieve this goal corresponds the 3 credits assigned to the module.

Preconditions

Basic mathematics

Courses, Learning and Teaching Methods and Literature

Courses and Schedule

Learning and Teaching Methods

The module consists of a lecture. The lecture content is communicated via lectures and presentations. The students shall be motivated to study the literature, to work on exercise problems, and to become familiar with the content.

Media

Lecture notes, slides, board

Literature

- Runkler: Data Analytics, Springer
- Tan, Steinbach, Kumar: Introduction to Data Mining. Addison Wesley
- Dunham: Data Mining - Introductory and Advanced Topics. Prentice Hall.
- Theodoridis, Koutroumbas: Pattern Recognition. Academic Press

Module Exam

Description of exams and course work

The exam takes the form of a written 60 minutes test. Questions allow to assess the acquaintance with the different types of data, relations and algorithms of data mining, and to assess the ability to select, apply, and evaluate suitable data mining methods (correlation, regression, forecasting, classification, clustering).

Exam Repetition

The exam may be repeated at the end of the semester.

Top of page