Source separation with scattering Non-Negative Matrix Factorization
- 1 April 2015
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 1876-1880
- https://doi.org/10.1109/icassp.2015.7178296
Abstract
This paper presents a single-channel source separation method that extends the ideas of Nonnegative Matrix Factorization (NMF). We interpret the approach of audio demixing via NMF as a cascade of a pooled analysis operator, given for example by the magnitude spectrogram, and a synthesis operators given by the matrix decomposition. Instead of imposing the temporal consistency of the decomposition through sophisticated structured penalties in the synthesis stage, we propose to change the analysis operator for a deep scattering representation, where signals are represented at several time resolutions. This new signal representation is invariant to smooth changes in the signal, consistent with its temporal dynamics. We evaluate the proposed approach in a speech separation task obtaining promising results.Keywords
This publication has 20 references indexed in Scilit:
- Discriminatively trained recurrent neural networks for single-channel speech separationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- Supervised non-euclidean sparse NMF via bilevel optimization with applications to speech enhancementPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- Static and Dynamic Source Separation Using Nonnegative Factorizations: A unified viewIEEE Signal Processing Magazine, 2014
- Invariant Scattering Convolution NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence, 2013
- Audio Imputation Using the Non-negative Hidden Markov ModelLecture Notes in Computer Science, 2012
- Algorithms for Nonnegative Matrix Factorization with the β-DivergenceNeural Computation, 2011
- Online dictionary learning for sparse codingPublished by Association for Computing Machinery (ACM) ,2009
- Wind Noise Reduction using Non-Negative Sparse Coding2007 IEEE Workshop on Machine Learning for Signal Processing, 2007
- An audio-visual corpus for speech perception and automatic speech recognitionThe Journal of the Acoustical Society of America, 2006
- Learning the parts of objects by non-negative matrix factorizationNature, 1999