A General Flexible Framework for the Handling of Prior Information in Audio Source Separation
Top Cited Papers
- 17 October 2011
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Audio, Speech, and Language Processing
- Vol. 20 (4), 1118-1133
- https://doi.org/10.1109/tasl.2011.2172425
Abstract
Most audio source separation methods are developed for a particular scenario characterized by the number of sources and channels and the characteristics of the sources and the mixing process. In this paper, we introduce a general audio source separation framework based on a library of structured source models that enable the incorporation of prior knowledge about each source via user-specifiable constraints. While this framework generalizes several existing audio source separation methods, it also allows to imagine and implement new efficient methods that were not yet reported in the literature. We first introduce the framework by describing the model structure and constraints, explaining its generality, and summarizing its algorithmic implementation using a generalized expectation-maximization algorithm. Finally, we illustrate the above-mentioned capabilities of the framework by applying it in several new and existing configurations to different source separation problems. We have released a software tool named Flexible Audio Source Separation Toolbox (FASST) implementing a baseline version of the framework in Matlab.Keywords
This publication has 39 references indexed in Scilit:
- Blind Separation and Dereverberation of Speech Mixtures by Joint OptimizationIEEE Transactions on Audio, Speech, and Language Processing, 2010
- The 2010 Signal Separation Evaluation Campaign (SiSEC2010): Audio Source SeparationLecture Notes in Computer Science, 2010
- Bayesian Inference for Nonnegative Matrix Factorisation ModelsComputational Intelligence and Neuroscience, 2009
- Efficient model-based speech separation and denoising using non-negative subspace analysisInternational Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2008
- Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source SeparationIEEE Transactions on Audio, Speech, and Language Processing, 2007
- Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness CriteriaIEEE Transactions on Audio, Speech, and Language Processing, 2007
- Independent Vector Analysis for Convolutive Blind Speech SeparationPublished by Springer Science and Business Media LLC ,2007
- Analysis of Musical Instrument Sounds by Source-Filter-Decay ModelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Blind Separation of Speech Mixtures via Time-Frequency MaskingIEEE Transactions on Signal Processing, 2004
- Maximum likelihood for blind separation and deconvolution of noisy signals using mixture modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002