Dear subscribers of the colloquium newsletter,

we are happy to inform you about the next date of our Communication Technology Colloquium.

Wednesday, April 10, 2024
Speaker: Henning Konermann
Time: 2:30 p.m.
Location: hybrid - Lecture room 4G and

https://rwth.zoom.us/j/61215027648?pwd=MTJvayt5bkdka04raWZVempPZGE0Zz09

                    Meeting-ID: 612 1502 7648
                    Passwort: 380386

Master-Lecture:

Investigations on Phase-Aware Speech Enhancement Using Deep Neural Networks

Speech enhancement aims to improve speech quality and intelligibility by removing noise from noisy speech signals. Currently Machine Learning (ML) based speech enhancement has become mainstream and is used in hundreds of millions of devices. This is crucial in various applications, from telecommunications to hearing aids. Historically, the phase component was considered unimportant for this task when using the analysis-modification-synthesis approach. However, with the rise of ML and, in particular, Deep Neural Networks (DNNs), these technologies have become increasingly important in recent times. This thesis presents an in-depth study of phase-aware speech enhancement using DNNs, initially focusing on the theoretical benefits of integrating phase information into the speech enhancement process through oracle experiments. A significant emphasis of this work is on the recently proposed Consistent-Inconsistent Phase (CIP) approach, discussing its advantages and disadvantages to phase estimation. Traditional magnitude estimation, with and without additional phase information, serves as the baseline for comparison. It has been demonstrated that CIP offers theoretical advantages over pure phase estimation and could, in theory, perform equally well as magnitude estimation without additional phase information while adopting the noisy phase for synthesis. However, the practical implementation does not fully realize its theoretical potential when validating the theoretical results by replicating the experiments with state-of-the-art DNNs. Solely in the context of background noise removal, a combination of magnitude and CIP estimation proves clear superiority to other techniques evaluated in this study. The estimation of the CIP emerges as a viable alternative to direct estimation of the clean phase, especially in noise-dominated signals.

All interested parties are cordially invited, registration is not required.

General information on the colloquium, as well as a current list of dates of the Communication Technology Colloquium can be found at:
https://www.iks.rwth-aachen.de/aktuelles/kolloquium


Simone Sedgwick
Secretariat
Institute of Communication Systems(IKS)
Prof. Dr.-Ing. Peter Jax
RWTH Aachen University
Muffeter Weg 3a, 52074 Aachen, Germany
+49 241 80 26956(phone)
+49 241 80 22254(fax) 
sedgwick@iks.rwth-aachen.de
https://www.iks.rwth-aachen.de/