Diverse training data boosts EEG-based Parkinson's detection accuracy to 94%

May 1, 2026 · 2 min read · Research

A new study from researchers at the University of South Dakota and Oregon Health & Science University demonstrates that training machine learning models on electroencephalography (EEG) data from diverse populations dramatically improves the reliability of Parkinson's disease detection. The work, published on arXiv, establishes a framework for creating EEG biomarkers that work across different clinical settings.

The research

Nicholas Rasmussen and colleagues analyzed EEG recordings from five independent cohorts of people with Parkinson's disease and healthy controls. Rather than assuming data from different hospitals or recording devices are interchangeable, the team systematically tested all possible cross-population training combinations — 75 directional evaluations in total. Their "population-aware" approach used an n-gram expansion strategy to ensure no population-specific artifacts biased the results.

When models were trained on data from a single cohort and tested on another, accuracy often dropped significantly. However, training on multiple diverse cohorts yielded much more robust models. The best model achieved 94.1% accuracy on held-out cohorts, and the stability of selected EEG biomarkers improved as training population diversity increased. A theoretical analysis based on mixture risk optimization and hypothesis space contraction explained why: multi-population training forces the model to learn disease-relevant neural patterns rather than site-specific noise.

The nested cross-validation design with integrated channel selection ensured that biomarker identification was prospective and not contaminated by population leakage — a common pitfall in previous studies.

Why it matters

Parkinson's disease affects over 10 million people worldwide, and early, accurate diagnosis remains challenging. EEG is a non-invasive, low-cost tool, but its clinical adoption has been limited by inconsistent results across clinics. This study provides a principled method to develop EEG-based biomarkers that truly generalize. For the average person, it means that future diagnostic tools may be more reliable regardless of where or how the EEG is collected, reducing false positives and missed diagnoses.

What you can do

While you cannot directly apply this research, you can support cognitive health through regular brain training and staying informed about evidence-based diagnostics. If you're curious about your own cognitive baseline, consider taking a scientifically validated IQ test.

Source: arXiv q-bio.NC

Curious about your own brain? Take our free adaptive IQ test or try 306 brain training levels.