Open this publication in new window or tab >>2024 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]
Listeners with normal-hearing often overlook their ability to comprehend speech in noisy environments effortlessly. Our brain’s adeptness at identifying and amplifying attended voices while suppressing unwanted background noise, known as the cocktail party problem, has been extensively researched for decades. Yet, many aspects of this complex puzzle remain unsolved and listeners with hearing-impairment still struggle to focus on a specific speaker in noisy environments. While recent intelligent hearing aids have improved noise suppression, the problem of deciding which speaker to enhance remains unsolved, leading to discomfort for many hearing aid users in noisy environments.
In this thesis, we explore the complexities of the human brain in challenging auditory environments. Two datasets are investigated where participants were tasked to selectively attend to one of two competing voices, replicating a cocktail-party scenario. The auditory stimuli trigger neurons to generate electrical signals that propagate in all directions. When a substantial number of neurons fire simultaneously, their collective electrical signal becomes detectable by small electrodes placed on the head. This method of measuring brain activity, known as electroencephalography (EEG), holds potential to provide feedback to the hearing aids, enabling adjustments to enhance attended voice(s).
EEG data is often noisy, incorporating neural responses with artifacts such as muscle movements, eye blinks and heartbeats. In the first contribution of this thesis, we focus on comparing different manual and automatic artifact-rejection techniques and assessing their impact on auditory attention decoding (AAD).
While EEG measurements offer high temporal accuracy, spatial resolution is inferior compared to alternative tools like magnetoencephalography (MEG). This difference poses a considerable challenge for source localization with EEG data. In the second contribution of this thesis, we demonstrate anticipated activity in the auditory cortex using EEG data from a single listener, employing Neuro-Current Response Functions (NCRFs). This method, previously evaluated only with MEG data, holds significant promise in hearing aid development.
EEG data may involve both linear and nonlinear components due to the propagation of the electrical signals through brain tissue, skull, and scalp with varying conductivities. In the third contribution, we aim to enhance source localization by introducing a binning-based nonlinear detection and compensation method. The results suggest that compensating for some nonlinear components produces more precise and synchronized source localization compared to original EEG data.
In the fourth contribution, we present a novel domain adaptation framework that improves AAD performances for listeners with initially low classification accuracy. This framework focuses on classifying the direction (left or right) of attended speech and shows a significant accuracy improvement when transporting poor data from one listener to the domain of good data from different listeners.
Taken together, the contributions of this thesis hold promise for improving the lives of hearing-impaired individuals by closing the loop between the brain and hearing aids.
Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2024. p. 42
Series
Linköping Studies in Science and Technology. Licentiate Thesis, ISSN 0280-7971 ; 1993
National Category
Signal Processing
Identifiers
urn:nbn:se:liu:diva-202438 (URN)10.3384/9789180756327 (DOI)9789180756310 (ISBN)9789180756327 (ISBN)
Presentation
2024-05-16, Ada Lovelace, B Building, Campus Valla, Linköping, 13:15 (English)
Opponent
Supervisors
2024-04-152024-04-152024-04-15Bibliographically approved