N-HANS: A neural network-based toolkit for in-the-wild audio enhancement
Open Access
- 3 June 2021
- journal article
- research article
- Published by Springer Science and Business Media LLC in Multimedia Tools and Applications
- Vol. 80 (18), 28365-28389
- https://doi.org/10.1007/s11042-021-11080-y
Abstract
The unprecedented growth of noise pollution over the last decades has raised an always increasing need for developing efficient audio enhancement technologies. Yet, the variety of difficulties related to processing audio sources in-the-wild, such as handling unseen noises or suppressing specific interferences, makes audio enhancement a still open challenge. In this regard, we present (the Neuro-Holistic Audio-eNhancement System), a Python toolkit for in-the-wild audio enhancement that includes functionalities for audio denoising, source separation, and —for the first time in such a toolkit—selective noise suppression. The architecture is specially developed to automatically adapt to different environmental backgrounds and speakers. This is achieved by the use of two identical neural networks comprised of stacks of residual blocks, each conditioned on additional speech- and noise-based recordings through auxiliary sub-networks. Along to a Python API, a command line interface is provided to researchers and developers, both of them carefully documented. Experimental results indicate that achieves great performance w. r. t. existing methods, preserving also the audio quality at a high level; thus, ensuring a reliable usage in real-life application, e. g., for in-the-wild speech processing, which encourages the development of speech-based intelligent technology.Keywords
Funding Information
- Universität Augsburg
This publication has 64 references indexed in Scilit:
- Librispeech: An ASR corpus based on public domain audio booksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- A Regression Approach to Speech Enhancement Based on Deep Neural NetworksIEEE/ACM Transactions on Audio, Speech, and Language Processing, 2014
- Supervised deep learning with auxiliary networksPublished by Association for Computing Machinery (ACM) ,2014
- Understanding noise stress-induced cognitive impairment in healthy adults and its implications for schizophreniaNoise and Health, 2014
- The diverse environments multi-channel acoustic noise database: A database of multichannel environmental noise recordingsThe Journal of the Acoustical Society of America, 2013
- Noise Pollution: A Modern PlagueSouthern Medical Journal, 2007
- Selective signal cancellation for multiple-listener audio applications using eigenfiltersIEEE Transactions on Multimedia, 2003
- A survey of urban noise annoyance in a large Brazilian city: the importance of a subjective analysis in conjunction with an objective analysisEnvironmental Impact Assessment Review, 2003
- Strategy-selective noise reduction for binaural digital hearing aidsSpeech Communication, 2003
- Annoyance from transportation noise: relationships with exposure metrics DNL and DENL and their confidence intervals.Environmental Health Perspectives, 2001