Heart sound classification based on equal scale frequency cepstral coefficients and deep learning
- 15 February 2023
- journal article
- research article
- Published by Walter de Gruyter GmbH in Biomedizinische Technik/Biomedical Engineering
- Vol. 68 (3), 285-295
- https://doi.org/10.1515/bmt-2021-0254
Abstract
Heart diseases represent a serious medical condition that can be fatal. Therefore, it is critical to investigate the measures of its early prevention. The Mel-scale frequency cepstral coefficients (MFCC) feature has been widely used in the early diagnosis of heart abnormity and achieved promising results. During feature extraction, the Mel-scale triangular overlapping filter set is applied, which makes the frequency response more in line with the human auditory property. However, the frequency of the heart sound signals has no specific relationship with the human auditory system, which may not be suitable for processing of heart sound signals. To overcome this issue and obtain a more objective feature that can better adapt to practical use, in this work, we propose an equal scale frequency cepstral coefficients (EFCC) feature based on replacing the Mel-scale filter set with a set of equally spaced triangular overlapping filters. We further designed classifiers combining convolutional neural network (CNN), recurrent neural network (RNN) and random forest (RF) layers, which can extract both the spatial and temporal information of the input features. We evaluated the proposed algorithm on our database and the PhysioNet Computational Cardiology (CinC) 2016 Challenge Database. Results from ten-fold cross-validation reveal that the EFCC-based features show considerably better performance and robustness than the MFCC-based features on the task of classifying heart sounds from novel patients. Our algorithm can be further used in wearable medical devices to monitor the heart status of patients in real time with high precision, which is of great clinical importance.Keywords
This publication has 46 references indexed in Scilit:
- Hyperbolically-warped cepstral coefficients for improved micro-Doppler classificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- S1 and S2 Heart Sound Recognition Using Deep Neural NetworksIEEE Transactions on Biomedical Engineering, 2016
- Deep Belief Neural Networks and Bidirectional Long-Short Term Memory Hybrid for Speech RecognitionArchives of Acoustics, 2015
- Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognitionSpeech Communication, 2012
- Classification of speech dysfluencies with MFCC and LPCC featuresExpert Systems with Applications, 2012
- An algorithm for FHR estimation from foetal phonocardiographic signalsBiomedical Signal Processing and Control, 2010
- Feature Extraction Based on Mel-Scaled Wavelet Transform for Heart Sound Analysis2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2005
- Long Short-Term MemoryNeural Computation, 1997
- Signal modeling techniques in speech recognitionProceedings of the IEEE, 1993
- Some windows with very good sidelobe behaviorIEEE Transactions on Acoustics, Speech, and Signal Processing, 1981