Source separation with scattering Non-Negative Matrix Factorization

1 April 2015

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 1876-1880
https://doi.org/10.1109/icassp.2015.7178296

Abstract

This paper presents a single-channel source separation method that extends the ideas of Nonnegative Matrix Factorization (NMF). We interpret the approach of audio demixing via NMF as a cascade of a pooled analysis operator, given for example by the magnitude spectrogram, and a synthesis operators given by the matrix decomposition. Instead of imposing the temporal consistency of the decomposition through sophisticated structured penalties in the synthesis stage, we propose to change the analysis operator for a deep scattering representation, where signals are represented at several time resolutions. This new signal representation is invariant to smooth changes in the signal, consistent with its temporal dynamics. We evaluate the proposed approach in a speech separation task obtaining promising results.

Keywords

This publication has 20 references indexed in Scilit:

Discriminatively trained recurrent neural networks for single-channel speech separation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2014
Supervised non-euclidean sparse NMF via bilevel optimization with applications to speech enhancement
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2014
Static and Dynamic Source Separation Using Nonnegative Factorizations: A unified view
IEEE Signal Processing Magazine, 2014
Invariant Scattering Convolution Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013
Audio Imputation Using the Non-negative Hidden Markov Model
Lecture Notes in Computer Science, 2012
Algorithms for Nonnegative Matrix Factorization with the β-Divergence
Neural Computation, 2011
Online dictionary learning for sparse coding
Published by Association for Computing Machinery (ACM) ,2009
Wind Noise Reduction using Non-Negative Sparse Coding
2007 IEEE Workshop on Machine Learning for Signal Processing, 2007
An audio-visual corpus for speech perception and automatic speech recognition
The Journal of the Acoustical Society of America, 2006
Learning the parts of objects by non-negative matrix factorization
Nature, 1999

Cited by 7 articles