The Effect of Speech Fragmentation and Audio Encodings on Automatic Parkinson’s Disease Recognition

Open Access

1 January 2022

journal article
research article
Published by Scientific Research Publishing, Inc. in Journal of Biomedical Science and Engineering

Vol. 15 (01), 6-25
https://doi.org/10.4236/jbise.2022.151002

Abstract

Parkinson’s disease is a neurological disease which is incurable according to current clinical knowledge. Therefore, early detection and provision of appropriate treatment are of primary importance. Speech is one of the biomarkers that enable the detection of Parkinson’s disease affection. Numerous researches are based on recordings from controlled environments; nonetheless fewer apply real circumstances. In the present study, three objectives were examined: recording fragmentation (paragraph, sentences, time-based), variable encodings (Pulse-Code Modulation [PCM], GSM-Full Rate [FR], G.723.1) and majority voting on 8 kHz records using multiple classifiers. Support Vector Machine (SVM), Long Short-Term Memory (LSTM), i-vector and x-vector classifiers were evaluated in contrast with SVM as baseline. The highest results in accuracy and F1-score were achieved using i-vector models. Although variable encodings generally caused decrease in Parkinson-disease recognition, decline was within 2% - 3% at best. Moreover, fragmentation did not yield a clear outcome though some classifiers performed with the very similar efficiency along the differently fragmented sets. Majority voting did produce a slight increase in classification performance compared to as if no aggregation is used.

Keywords

This publication has 24 references indexed in Scilit:

Current approaches to the treatment of Parkinson’s Disease
Bioorganic & Medicinal Chemistry Letters, 2017
Parkinson disease
Nature Reviews Disease Primers, 2017
Effect of acoustic conditions on algorithms to detect Parkinson's disease from speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2017
Automatic Evaluation of Articulatory Disorders in Parkinson’s Disease
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2014
Speech and swallowing disorders in Parkinson disease
Current Opinion in Otolaryngology & Head and Neck Surgery, 2008
Probabilistic Linear Discriminant Analysis
Lecture Notes in Computer Science, 2006
Rehabilitation for Parkinson's disease: a systematic review of available evidence
Clinical Rehabilitation, 2004
Phonatory and Articulatory Changes Associated With Increased Vocal Intensity in Parkinson Disease: A Case Study
Journal of Speech, Language, and Hearing Research, 1995
Robust text-independent speaker identification using Gaussian mixture speaker models
IEEE Transactions on Speech and Audio Processing, 1995
Parkinsonism
Neurology, 1967

Cited by 3 articles