Communication Technology Collquium at IKS
Dear subscribers of the Colloquium Newsletter, we are happy to inform you about the next date of our Communication Technology Colloquium. *Wednesday, 30. November 2022** **Speaker:* Frederick Pietschmann *Time*: 14:00 *Location:* Lecture room 4G and https://rwth.zoom.us/j/97904157921?pwd=SWpsbDl0MWhrWjY1ZkZaeFRoYmErZz09 Meeting-ID: 979 0415 7921 Passwort: 481650 *Master-Lecture*: Perceptual Optimization and Evaluation of a Binaural Signal Modification Algorithm With individualized binaural signals, it is possible to reproduce auditory scenes such that the signal is perceived similar to the real scene. However, perceptual similarity is no longer achieved when the binaural signal doesn’t fully adapt to different listeners and different orientations of the listener’s head. To address these problems, a perceptually motivated algorithm referred to as the Binaural Cue Adaptation (BCA) system has been developed at the Institute of Communication Systems. The BCA system is capable of adding both interactivity and individualization to existing binaural signals, thereby achieving a higher degree of perceptual similarity to a corresponding real auditory scene. In this thesis, a perceptual optimization of the existing BCA system is conducted in that new approaches for some components of the algorithm are proposed, all parametrization options are identified and the overall best parametrization is chosen. To identify the best parametrization, both an isolated analysis of individual components is conducted and a perceptually motivated optimization procedure for a full system analysis is proposed and implemented. Finally, a perceptual evaluation based on the result of the perceptual optimization is realized. For this, two listening tests with a total number of 17 participants are conducted – one for a normal and one for a highly reverberant scenario. The results of these listening tests suggest that signals produced by the optimized BCA system achieve a high degree of perceptual plausibility for both reverberation scenarios, with an averaged 2AFC probability to detect a BCA-generated signal of 0.563 for the normal scenario and 0.604 for the highly reverberant scenario. All interested parties are cordially invited, registration is not required. General information on the colloquium, as well as a current list of dates of the Communication Technology Colloquium can be fount at: https://www.iks.rwth-aachen.de/aktuelles/kolloquium -- Irina Esser Institute of Communication Systems (IKS) RWTH Aachen University Muffeter Weg 3a, 52074 Aachen, Germany +49 241 80 26958 (phone) esser@iks.rwth-aachen.de http://www.iks.rwth-aachen.de/
Dear subscribers of the Colloquium Newsletter, we are happy to inform you about the next date of our Communication Technology Colloquium. *Thursday, 29. Juni 2023** **Speaker:* Maximilian Kentgens *Time*: 11:30 *Location:* Lecture room 4G *Doctoral-Lecture*:*Signal Processing Concepts for User Movement in Scene-Based Spatial Audio* This dissertation addresses prospective immersive communication, telepresence, and multimedia systems, in which a user moves around virtually in a remote or recorded acoustic scene. More specifically, the problem of sound field translation of a single-perspective higher-order Ambisonics (HOA) acoustic scene representation is considered. Recent standardization activities and ongoing research have considered two competing approaches for immersive spatial audio formats: object-based audio on the one hand and scene-based audio on the other hand, each of which has different advantages and disadvantages. Object-based audio is the preferred choice for artist- or computer-generated acoustic scenes. In contrast, scene-based audio is better suited to capture an actual acoustic scene in its entirety, which makes it well suited in the above-mentioned applications. Scene-based audio is commonly realized using HOA, i.e., the spherical-harmonics-based (SH) representation of the sound field. This approach allows the direct acquisition of real-world content using spherical microphone arrays. In contrast to object-based audio that inherently supports user interaction in six degrees of freedom (6DoF), HOA recordings allow for user rotation only (3DoF). HOA lacks a built-in possibility to adapt the listener’s position in space. This dissertation aims to alleviate this shortcoming of scene-based audio by novel HOA signal processing methods exploiting signal characteristics and properties of the human auditory system. State-of-the-art methods for HOA sound field translation in literature are either limited to very small displacements or laboratory conditions with almost no reverberation. Others require additional microphones or are limited to first-order Ambisonics. In contrast, the present dissertation considers the application of user movement in immersive audio with HOA signal content recorded at a single position in space under adverse conditions with reverberation. Methodologically, the problem is approached via a theoretical elaboration of signal processing concepts based on acoustic considerations and psychoacoustic evidence. The main focus is on the mathematical and simulative analysis and the comparison of the proposed approaches. The work is rounded off by a perceptual study showing the potential of the methods under realistic boundary conditions. All interested parties are cordially invited, registration is not required. General information on the colloquium, as well as a current list of dates of the Communication Technology Colloquium can be fount at: https://www.iks.rwth-aachen.de/aktuelles/kolloquium Simone Sedgwick Secretariat Institute of Communication Systems(IKS) Prof. Dr.-Ing. Peter Jax RWTH Aachen University Muffeter Weg 3a, 52074 Aachen, Germany +49 241 80 26956(phone) +49 241 80 22254(fax) sedgwick@iks.rwth-aachen.de https://www.iks.rwth-aachen.de/
Dear subscribers of the Colloquium Newsletter, we are happy to inform you about the next date of our Communication Technology Colloquium. *Wednesday, 18. October 2023** **Speaker:* Johannes Fabry *Time*: 11:00 a.m. *Location:* Lecture room 4G *Doctoral-Lecture*:***Least-Squares Methods for Individualization of Hearables with Active Noise Cancellation*** *Motivation, Goal and Task of the Dissertation* According to the World Health Organization (WHO), noise exposure is a major contributor to public health problems. Long-term exposure to occupational or recreational noise leads to noise-induced hearing loss or tinnitus, whereas environmental noise causes stress, sleep disturbances, chronic high annoyance, and cardiovascular diseases.Active noise cancellation (ANC) in headphones effectively supplements passive hearing protection to drastically reduce the perceived loudness of ambient noise, especially at lower frequencies. An ANC headphone plays back a cancellation signal via its loudspeaker, which destructively interferes with passively attenuated ambient noise. The cancellation signal is based on information from reference sensors integrated into the headphone.Due to strict processing power and battery constraints, the state of the art of ANC in headphones consists of either time-invariant or simple adaptive filters. The time-invariant filter approach implements a one-size-fits-all ANC solution. Consequently, the active attenuation of ambient noise varies strongly between users depending on the individual fit of the headphone. On the other hand, simple adaptive filters, such as the least mean squares algorithm, exhibit a slow convergence and tracking speed. Thus, the system cannot adjust to rapid changes in the ambient noise.The first goal of this dissertation is to develop methods that yield a more reliable ANC performance with respect to the fit of a headphone for individual users. The methods should allow a calibration of ANC headphones in the field as well as a configuration of the ANC transfer characteristic. The second goal is to develop efficient adaptive algorithms that yield a close-to-optimal ANC performance, are suitable for relevant audio codecs, and require almost no prior knowledge about the acoustic system. *Major Scientific Contributions* The first major contribution is a method to calibrate an ANC headphone in the field. It considers the individual fit of a user by estimating the acoustic primary and secondary path, which characterize the acoustic properties of a headphone and fit and from which an individualized feed-forward filter can be designed. A novel secondary path estimator explicitly considers the interference of a measurement signal with ambient sound. Compared to other methods it considerably increases the accuracy of the secondary path estimate. Since a direct measurement of the primary path in the field is not suitable due to lacking control over the ambient sound field, we developed an estimator for the primary path based on the coherency between the primary and secondary path. Evaluations on a vast dataset of acoustic paths for different users, which was created in the course of this work, show significant improvements of the active attenuation for an individualized filter compared to a one-size-fits-all feed-forward filter.The second major contribution introduces active acoustic equalization (AAE) as a framework for designing feed-forward filters that result in an arbitrary target transfer function for a given ANC headphone. Thereby, users can adjust how they hear ambient sound to their personal liking. We analyze the influence of the secondary path delay as well as the desired attenuation on the actual transfer function and propose effective measures to improve the accuracy of the filter design with respect to the desired transfer function. Since only the magnitude spectrum of the transfer function is relevant for the perception of AAE, we furthermore propose a frequency domain filter optimization that considers the energy spectrum of a desired transfer function. A novel cost function explicitly considers the nonlinear perception of magnitude and frequency of the human auditory system. By the example of a hear-through, we show how the proposed optimization results in a better median performance compared to the state of the art.The third major contribution is the development of a system-theoretic model and of adaptive algorithms for feed-forward ANC. The model explicitly considers self-induced disturbances that couple into the reference sensors, such as footfall or chewing sounds. We then introduce the Kalman filter as the optimal, unbiased estimator for the model, highlight the importance of accurate process parameter estimates, and subsequently introduce respective estimators for the measurement noise, process noise, and fading factor. Furthermore, we derive the novel composed Kalman filter (CKF) as an efficient implementation of the time domain Kalman filter. The CKF assumes that the Kalman filter’s state error covariance matrix is subject to a band-matrix-like structure. We validate this assumption by deriving a stationary solution for this matrix. For reasonable settings, the CKF's computational complexity is almost an order of magnitude lower whilst its performance is comparable to that of the original Kalman filter. We also propose a numerically stable implementation of the CKF based on a UD factorization and experimentally validate its robustness using a quasi fixed point arithmetic. Lastly, we propose an online secondary path estimator, which, compared to the state of the art, improves the frequency-dependent signal-to-noise ratio of the measurement signal whilst rendering it inaudible to users of an ANC headphone. The process parameter estimators, the CKF, as well as the online secondary path estimator are evaluated and compared under laboratory as well as realistic conditions, whereby various measured acoustic paths and disturbances, ambient sounds, and varying directions of arrival are considered. All interested parties are cordially invited, registration is not required. General information on the colloquium, as well as a current list of dates of the Communication Technology Colloquium can be fount at: https://www.iks.rwth-aachen.de/aktuelles/kolloquium Simone Sedgwick Secretariat Institute of Communication Systems(IKS) Prof. Dr.-Ing. Peter Jax RWTH Aachen University Muffeter Weg 3a, 52074 Aachen, Germany +49 241 80 26956(phone) +49 241 80 22254(fax) sedgwick@iks.rwth-aachen.de https://www.iks.rwth-aachen.de/
participants (2)
-
Irina Esser
-
Simone Sedgwick