A team from Indonesia’s National Cardiovascular Center and Universitas Indonesia applies XGBoost machine learning to screen hypertension using 11 non-laboratory variables—family history, age, waist circumference, BMI, occupation, education, sex, smoking, physical activity, diet, and alcohol consumption. They trained models on 204,315 participants with cross-validation and validated externally on 63,895 individuals, achieving 97% sensitivity and 0.75 AUC, demonstrating an efficient non-invasive screening method.
Key points
- XGBoost model trained on 204,315 Indonesian participants using 11 non-lab risk factors achieves 97% sensitivity and 0.748 AUC in external validation.
- Incorporating continuous variables for age, waist circumference, and BMI improves discrimination compared to categorical encoding, boosting ML performance substantially.
- Family history of hypertension, age, waist circumference, BMI, and occupation intensity rank as the top five predictive contributors in the ML screening model.
Why it matters: This non-invasive, high-sensitivity ML screening approach can accelerate hypertension detection in resource-limited settings, reducing undiagnosed cases and associated cardiovascular risks.
Q&A
- What are non-laboratory risk factors?
- Why is XGBoost effective for hypertension screening?
- What does AUC indicate in model performance?
- How can this model be applied in telemedicine?