Data Mining and Knowledge Discovery

Module IN2030

This Module is offered by TUM Department of Informatics.

This module handbook serves to describe contents, learning outcome, methods and examination type as well as linking to current dates for courses and module examination in the respective sections.

Basic Information

IN2030 is a semester module in English language at Master’s level which is offered in winter semester.

This Module is included in the following catalogues within the study programs in physics.

  • Catalogue of non-physics elective courses
Total workloadContact hoursCredits (ECTS)
90 h 30 h 3 CP

Content, Learning Outcome and Preconditions

Content

• data sources, characteristics, and errors • data preprocessing and filtering • data visualization • data projections (principal component analysis, multidimensional scaling, Sammon mapping, auto associator) • data transformation and feature selection • correlation and regression • forecasting • classification (Bayes, discriminance, support vector machine, nearest neighbor, learning vector quantization, decision trees) • clustering (sequential, protype based, fuzzy, relational, heuristic)

Learning Outcome

: On successful completion of the module, students • understand the different types of data and relations; • understand, apply, and evaluate data preparation, analysis, and visualization methods; • understand, apply, and evaluate linear and nonlinear correlation, regression and forecasting methods; • are able to compare classification and clustering, and to understand, apply, and evaluate the corresponding methods; • are able to select, apply, and evaluate suitable data mining methods for given applications. The main didactic goal is to introduce students to a variety of methods and provide them with the basic notions necessary to extend their knowledge by accessing the literature on their own. The work that the students must invest to achieve this goal corresponds the 3 credits assigned to the module.

Preconditions

Basic mathematics

Courses, Learning and Teaching Methods and Literature

Courses and Schedule

ArtSWSTitelDozent(en)Termine
VO 2 Data Mining and Knowledge Discovery (IN2030) Montag, 08:30–10:00

Learning and Teaching Methods

The module consists of a lecture. The lecture content is communicated via lectures and presentations. The students shall be motivated to study the literature, to work on exercise problems, and to become familiar with the content.

Media

Lecture notes, slides, board

Literature

• Runkler: Data Analytics, Springer • Tan, Steinbach, Kumar: Introduction to Data Mining. Addison Wesley • Dunham: Data Mining - Introductory and Advanced Topics. Prentice Hall. • Theodoridis, Koutroumbas: Pattern Recognition. Academic Press

Module Exam

Description of exams and course work

The exam takes the form of a written test. Questions allow to assess the acquaintance with the different types of data, relations and algorithms of data mining, and to assess the ability to select, apply, and evaluate suitable data mining methods (correlation, regression, forecasting, classification, clustering).

Exam Repetition

There is a possibility to take the exam at the end of the semester.

Condensed Matter

When atoms interact things can get interesting. Fundamental research on the underlying properties of materials and nanostructures and exploration of the potential they provide for applications.

Nuclei, Particles, Astrophysics

A journey of discovery to understanding our world at the subatomic scale, from the nuclei inside atoms down to the most elementary building blocks of matter. Are you ready for the adventure?

Biophysics

Biological systems, from proteins to living cells and organisms, obey physical principles. Our research groups in biophysics shape one of Germany's largest scientific clusters in this area.