This website is no longer updated.

As of 1.10.2022, the Faculty of Physics has been merged into the TUM School of Natural Sciences with the website https://www.nat.tum.de/. For more information read Conversion of Websites.

de | en

Applied Machine Learning - Deep Learning for Multimedia

Course 0000004263 in SS 2019

General Data

Course Type lecture with integrated exercises
Semester Weekly Hours 3 SWS
Organisational Unit Chair of Data Processing (Prof. Diepold)
Lecturers Christian Keimel
Dates

Further Information

Courses are together with exams the building blocks for modules. Please keep in mind that information on the contents, learning outcomes and, especially examination conditions are given on the module level only – see section "Assignment to Modules" above.

additional remarks 1. Deep Learning for Multimedia: Content generated for human consumption in the form of video, text, or audio, is unstructured from a machine perspective since the contained information is not readily available for processing. Information extraction from unstructured data describes therefore how one can extract the salient information from generic content in order to generate a descriptive structured representation. The thus created meta-data can then be further processed automatically, in particular for creating models explaining or predicting samples e.g. in recommendation systems. Aim of this lecture is therefore to introduce the methods, algorithms and underlying machine learning concepts for extracting information from audio, visual, and textual unstructured content using state-of-the art algorithms, especially deep learning based algorithms and architectures e.g. CNN, Autoencoder, LTSM. In addition, existing frameworks and libraries (e.g. Keras, Scikit-learn) and how to use them with audio, visual, and textual content countered in (multi-) media applications and services will be discussed. The following topics will be covered: - Why information extraction? - Introduction to deep learning - Image/video content - Object recognition - Face recognition - Character recognition (OCR) - Quality of Experience (QoE) - Audio/textual content - Automatics speech recognition (ASR) - Natural language processing (NLP) - Python eco-system of frameworks/libraries for information extraction Selected topics will be examined more in-depth during the lecture and the team oriented semester project.
Links E-Learning course (e. g. Moodle)
TUMonline entry
Top of page