A team from Shantou University and Peking University applied five machine learning algorithms, including logistic regression and SHAP explanations, to CHARLS health data, building four-year fall risk models for middle-aged and older adults with and without pain.
Key points
- Logistic regression model achieved highest AUC-ROC (0.732 for pain, 0.692 for non-pain) among five ML algorithms on CHARLS data.
- SHAP analysis revealed shared predictors (fall history, height) and exclusive features like WBC, platelets, functional limitations for pain cohort versus cognitive function and environment for non-pain.
- LASSO feature selection identified 24 variables in the pain model and 27 in non-pain, enabling interpretable and targeted fall risk profiling.
Why it matters: This interpretable ML approach pinpoints unique fall risk factors, improving precision prevention and personalized care for older adults with and without pain.
Q&A
- What is CHARLS data?
- Why use SHAP for model interpretation?
- Why did logistic regression outperform complex models?
- What is the SPPB test?