Evaluation of train and test performance of machine learning algorithms and Parkinson diagnosis with statistical measurements

Avuclu E., Elen A.

MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, cilt.58, ss.2775-2788, 2020 (SCI İndekslerine Giren Dergi) identifier identifier identifier

  • Cilt numarası: 58 Konu: 11
  • Basım Tarihi: 2020
  • Doi Numarası: 10.1007/s11517-020-02260-3
  • Sayfa Sayıları: ss.2775-2788


Parkinson's disease is a neurological disorder that causes partial or complete loss of motor reflexes and speech and affects thinking, behavior, and other vital functions affecting the nervous system. Parkinson's disease causes impaired speech and motor abilities (writing, balance, etc.) in about 90% of patients and is often seen in older people. Some signs (deterioration of vocal cords) in medical voice recordings from Parkinson's patients are used to diagnose this disease. The database used in this study contains biomedical speech voice from 31 people of different age and sex related to this disease. The performance comparison of the machine learning algorithms k-Nearest Neighborhood (k-NN), Random Forest, Naive Bayes, and Support Vector Machine classifiers was performed with the used database. Moreover, the best classifier was determined for the diagnosis of Parkinson's disease. Eleven different training and test data (45 x 55, 50 x 50, 55 x 45, 60 x 40, 65 x 35, 70 x 30, 75 x 25, 80 x 20, 85 x 15, 90 x 10, 95 x 5) were processed separately. The data obtained from these training and tests were compared with statistical measurements. The training results of the k-NN classification algorithm were generally 100% successful. The best test result was obtained from Random Forest classifier with 85.81%. All statistical results and measured values are given in detail in the experimental studies section.