Lehrveranstaltungen sind neben Prüfungen Bausteine von Modulen. Beachten Sie daher, dass Sie Informationen zu den Lehrinhalten und insbesondere zu Prüfungs- und Studienleistungen in der Regel nur auf Modulebene erhalten können (siehe Abschnitt "Zuordnung zu Modulen" oben).
ergänzende Hinweise |
This lab course covers various advanced Machine Learning techniques, mainly in the area of classification, for processing speech from audio signals. It is a programming course in which ML concepts are applied to audio corpora in Python. Proficiency in any object oriented language is sufficient as Python is amazingly easy to use.
Covered areas (a.o.):
- Voice activity detection
- Speaker detection
- Emotion detection
- Channel compensation
Covered methods (a.o.):
- Features extraction: Mel Frequency Cepstral Coefficients (MFCC)
- Generic classifiers: Random Forests, Support Vector Machines, Gaussian Mixture Models
- Classifiers for audio: GMM-UBMs and MAP adaption, supervector GMMs
- Factor Analysis for channel compensation: Joint Factor Analysis, I-Vectors
- Performance metrics |
Links |
TUMonline-Eintrag
|