Raw Speech Waveform Based Classification of Patients with ALS, Parkinson’s Disease and Healthy Controls Using CNN-BLSTM

Jhansi Mallela, Aravind Illa, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh


Analysis of speech waveform through automated methods in patients with Amyotrophic Lateral Sclerosis (ALS), and Parkinson’s disease (PD) can be used for early diagnosis and monitoring disease progression. Many works in the past have used different acoustic features for the classification of patients with ALS and PD with healthy controls (HC). In this work, we propose a data-driven approach to learn representations from raw speech waveform. Our model comprises of 1-D CNN layer to extract representations from raw speech followed by BLSTM layers for the classification tasks. We consider 3 different classification tasks (ALS vs HC), (PD vs HC), and (ALS vs PD). We perform each classification task using four different speech stimuli in two scenarios: i) trained and tested in a stimulus-specific manner, ii) trained on data pooled from all stimuli, and test on each stimulus separately. Experiments with 60 ALS, 60 PD, and 60 HC show that the frequency responses of the learned 1-D CNN filters are low pass in nature, and the center frequencies lie below 1kHz. The learned representations form raw speech perform better than MFCC which is considered as baseline. Experiments with pooled models yield a better result compared to the task-specific models.


 DOI: 10.21437/Interspeech.2020-2221

Cite as: Mallela, J., Illa, A., Belur, Y., Atchayaram, N., Yadav, R., Reddy, P., Gope, D., Ghosh, P.K. (2020) Raw Speech Waveform Based Classification of Patients with ALS, Parkinson’s Disease and Healthy Controls Using CNN-BLSTM. Proc. Interspeech 2020, 4586-4590, DOI: 10.21437/Interspeech.2020-2221.


@inproceedings{Mallela2020,
  author={Jhansi Mallela and Aravind Illa and Yamini Belur and Nalini Atchayaram and Ravi Yadav and Pradeep Reddy and Dipanjan Gope and Prasanta Kumar Ghosh},
  title={{Raw Speech Waveform Based Classification of Patients with ALS, Parkinson’s Disease and Healthy Controls Using CNN-BLSTM}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={4586--4590},
  doi={10.21437/Interspeech.2020-2221},
  url={http://dx.doi.org/10.21437/Interspeech.2020-2221}
}