Dear subscribers of the colloquium newsletter,
we are happy to inform you about the next date of our Communication Technology Colloquium.
Monday, November 25, 2024
Speaker: Eylül Ercandogu
Time: 2:00 p.m.
Location: hybrid - Lecture room 4G and
https://rwth.zoom.us/j/61215027648?pwd=MTJvayt5bkdka04raWZVempPZGE0Zz09
Meeting-ID: 612 1502 7648
Passwort: 380386
Master-Lecture:
Low-Complexity Noise Reduction for Hearables
In everyday life, noise interference significantly affects speech quality and intelligibility in various applications, such as hearing aids, earbuds, and mobile communication devices. These devices, known for their compactness and limited processing power, face challenges in implementing complex signal-processing algorithms. This work focuses on designing and evaluating efficient, real-time speech enhancement models tailored for low-complexity hearables. We adopted the Grouped Temporal Convolutional Recurrent Network (GTCRN), which reduces computational demands through efficient design elements. The real-time adaptation, RT-GTCRN, achieves competitive performance with minor quality trade-offs, making it well-suited for deployment on low-power devices. Additionally, we explored the axial self-attention mechanism to capture long-range dependencies efficiently. To improve speech quality, we incorporated advanced loss functions, including perceptual losses, which led to significant improvements in perceptual speech quality metrics. To enhance model generalization, we investigated data augmentation techniques and evaluated their impact across inter- and intra-corpus assessments. These data augmentation methods demonstrated promising improvements in handling diverse acoustic scenarios. Furthermore, knowledge distillation techniques were explored to enhance model efficiency by maintaining a small model while improving its performance. While the method demonstrated promise, further optimization could be beneficial to bridge the performance gap between the teacher and student models. The RT-GTCRN model was successfully implemented on the GAP9 processor, yielding promising results despite the observation of minor artifacts during implementation. These findings highlight potential areas for future refinement.
All
interested parties are cordially invited, registration is not
required.
Best regards
Anett Schindler Secretariat Institute of Communication Systems(IKS) Prof. Dr.-Ing. Peter Jax RWTH Aachen University Muffeter Weg 3a, 52074 Aachen, Germany +49 241 80 26958(phone) +49 241 80 22254(fax) schindler@iks.rwth-aachen.de https://www.iks.rwth-aachen.de/