Audiogmenter: a MATLAB toolbox for audio data augmentation
Open Access
- 22 September 2021
- journal article
- research article
- Published by Emerald in Applied Computing and Informatics
- Vol. ahead-of-p (ahead-of-p)
- https://doi.org/10.1108/aci-03-2021-0064
Abstract
Purpose: Create and share a MATLAB library that performs data augmentation algorithms for audio data. This study aims to help machine learning researchers to improve their models using the algorithms proposed by the authors. Design/methodology/approach: The authors structured our library into methods to augment raw audio data and spectrograms. In the paper, the authors describe the structure of the library and give a brief explanation of how every function works. The authors then perform experiments to show that the library is effective. Findings: The authors prove that the library is efficient using a competitive dataset. The authors try multiple data augmentation approaches proposed by them and show that they improve the performance. Originality/value: A MATLAB library specifically designed for data augmentation was not available before. The authors are the first to provide an efficient and parallel implementation of a large number of algorithms.Keywords
This publication has 8 references indexed in Scilit:
- Deep convolutional network for animal sound classification and source attribution using dual audio recordingsThe Journal of the Acoustical Society of America, 2019
- Domestic Cat Sound Classification Using Transfer LearningInternational Journal of Fuzzy Logic and Intelligent Systems, 2018
- Automated bird acoustic event detection and robust species classificationEcological Informatics, 2017
- Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound ClassificationIEEE Signal Processing Letters, 2017
- Classifying environmental sounds using image recognition networksProcedia Computer Science, 2017
- The Large Time-Frequency Analysis Toolbox 2.0Lecture Notes in Computer Science, 2014
- Gradient-based learning applied to document recognitionProceedings of the IEEE, 1998
- Principal warps: thin-plate splines and the decomposition of deformationsIEEE Transactions on Pattern Analysis and Machine Intelligence, 1989