Deep Scattering Spectrum
- 29 May 2014
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Signal Processing
- Vol. 62 (16), 4114-4128
- https://doi.org/10.1109/tsp.2014.2326991
Abstract
A scattering transform defines a locally translation invariant representation which is stable to time-warping deformation. It extends MFCC representations by computing modulation spectrum coefficients of multiple orders, through cascades of wavelet convolutions and modulus operators. Second-order scattering coefficients characterize transient phenomena such as attacks and amplitude modulation. A frequency transposition invariant representation is obtained by applying a scattering transform along log-frequency. State-the-of-art classification results are obtained for musical genre and phone classification on GTZAN and TIMIT databases, respectively.Keywords
Other Versions
Funding Information
- ANR 10-BLAN-0126
- ERC Invariant Class 320959
This publication has 33 references indexed in Scilit:
- Sound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound SynthesisNeuron, 2011
- LIBSVMACM Transactions on Intelligent Systems and Technology, 2011
- IRLbotACM Transactions on the Web, 2009
- Efficient auditory codingNature, 2006
- Musical genre classification of audio signalsIEEE Transactions on Speech and Audio Processing, 2002
- Auditory images. How complex sounds are represented in the auditory system.Acoustical Science and Technology, 2000
- Speaker-independent phone recognition using hidden Markov modelsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
- Signal estimation from modified short-time Fourier transformIEEE Transactions on Acoustics, Speech, and Signal Processing, 1984
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentencesIEEE Transactions on Acoustics, Speech, and Signal Processing, 1980
- An iterative technique for the rectification of observed distributionsThe Astronomical Journal, 1974