A General Flexible Framework for the Handling of Prior Information in Audio Source Separation

Top Cited Papers

17 October 2011

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Audio, Speech, and Language Processing

Vol. 20 (4), 1118-1133
https://doi.org/10.1109/tasl.2011.2172425

Abstract

Most audio source separation methods are developed for a particular scenario characterized by the number of sources and channels and the characteristics of the sources and the mixing process. In this paper, we introduce a general audio source separation framework based on a library of structured source models that enable the incorporation of prior knowledge about each source via user-specifiable constraints. While this framework generalizes several existing audio source separation methods, it also allows to imagine and implement new efficient methods that were not yet reported in the literature. We first introduce the framework by describing the model structure and constraints, explaining its generality, and summarizing its algorithmic implementation using a generalized expectation-maximization algorithm. Finally, we illustrate the above-mentioned capabilities of the framework by applying it in several new and existing configurations to different source separation problems. We have released a software tool named Flexible Audio Source Separation Toolbox (FASST) implementing a baseline version of the framework in Matlab.

Keywords

This publication has 39 references indexed in Scilit:

Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization
IEEE Transactions on Audio, Speech, and Language Processing, 2010
The 2010 Signal Separation Evaluation Campaign (SiSEC2010): Audio Source Separation
Lecture Notes in Computer Science, 2010
Bayesian Inference for Nonnegative Matrix Factorisation Models
Computational Intelligence and Neuroscience, 2009
Efficient model-based speech separation and denoising using non-negative subspace analysis
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2008
Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation
IEEE Transactions on Audio, Speech, and Language Processing, 2007
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria
IEEE Transactions on Audio, Speech, and Language Processing, 2007
Independent Vector Analysis for Convolutive Blind Speech Separation
Published by Springer Science and Business Media LLC ,2007
Analysis of Musical Instrument Sounds by Source-Filter-Decay Model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2007
Blind Separation of Speech Mixtures via Time-Frequency Masking
IEEE Transactions on Signal Processing, 2004
Maximum likelihood for blind separation and deconvolution of noisy signals using mixture models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002

Cited by 175 articles