A team from Google Research and Duke University develops gradient boosting models trained on mobile app–collected surveys, functional tests, and wearable signals to forecast high-severity MS symptoms up to three months ahead.

Key points

  • Implementation of a mobile app to capture weekly self-reported MS symptoms, bi-weekly functional tests, and wearable signals over three years.
  • Training and validation of five models (logistic regression, MLP, GBC, RNN, TCN) on 713 users, with GBC achieving AUROCs up to 0.899 on a 20% blind test set.
  • Feature ablation reveals past symptom trajectory as top predictor, while passive signals and functional tests also contribute to multi-modal forecasting.
  • Subgroup analyses demonstrate consistent predictive performance across MS subtypes and age categories.
  • Calibration via Brier scores confirms reliable probability estimates for clinical decision support.

Why it matters: Early forecasting of MS symptom flares via a scalable mobile platform could guide proactive interventions and improve patient outcomes.

Q&A

  • What data does the MS Mosaic app collect?
  • Why use gradient boosting over deep learning?
  • How is symptom severity labeled?
  • What performance metrics were achieved?
  • Can this approach apply to other chronic diseases?
Copy link
Facebook X LinkedIn WhatsApp
Share post via...


Read full article

Machine Learning for Chronic Disease Monitoring

Introduction
Machine learning (ML) offers powerful tools to analyze complex health data streams and forecast clinical events. In chronic diseases like multiple sclerosis (MS), symptoms can evolve subtly between clinic visits. By combining continuous digital data collection with predictive algorithms, ML enables early detection of symptom flares and guides proactive care.

Data Sources and Digital Health Tools

Modern smartphones and wearable devices capture diverse physiological and behavioral signals. Common data sources include:

  • Self-reported symptom surveys: Simple scales (e.g., 5-point Likert) for patients to record symptom changes daily.
  • Functional assessments: App-based tests like finger tapping, walking speed, and memory challenges scheduled bi-weekly.
  • Passive wearables: Continuous metrics such as step count, sleep patterns, and heart rate from smartwatches or fitness bands.

Feature Engineering in Longitudinal Models

Time-series data must be aggregated into model-friendly features. Key steps include:

  1. Sliding windows: Segment data into fixed-length intervals (e.g., 7 days) to capture temporal context.
  2. Statistical summaries: Compute medians, variances, and trends for each feature within windows.
  3. Label creation: Define prediction horizons (e.g., 3 months) and generate binary outcomes based on symptom thresholds.

Model Selection and Performance

Tree-ensemble methods like Gradient Boosting Classifiers (GBC) often excel on tabular medical data by handling missing values and non-linearities. Deep learning architectures (RNN, TCN) can capture sequential dependencies but may require larger datasets. Key performance measures include:

  • AUROC: Assesses discrimination between flare vs. non-flare instances.
  • Precision-Recall: Evaluates performance under class imbalance.
  • Brier Score: Measures calibration of predicted probabilities.

Clinical Implications and Next Steps

Accurate short-term forecasting of MS symptom severity allows clinicians to:

  • Schedule interventions (e.g., physical therapy) before symptom deterioration.
  • Adjust medications proactively to mitigate flares.
  • Provide personalized feedback through digital dashboards.

Future research will refine predictive models, integrate additional biomarkers, and validate digital-health frameworks in diverse chronic disorders.

Performance of machine learning models for predicting high-severity symptoms in multiple sclerosis