Machine Learning Outperforms CHA₂DS₂-VASc for Atrial Fibrillation Stroke Risk

Novel machine learning models predict 1-year stroke risk in newly diagnosed atrial fibrillation with dramatic improvements over the standard CHA₂DS₂-VASc score, offering clinicians better tools for personalized anticoagulation decisions.

Background

Stroke prevention in atrial fibrillation relies on the CHA₂DS₂-VASc risk score, but this decade-old tool has limitations. Researchers evaluated whether machine learning using age, comorbidities, and medications could provide more accurate risk stratification for 1-year stroke events in newly diagnosed patients.

Key Findings

  • Logistic regression and XGBoost models achieved internal AUROC of 0.914-0.915 versus 0.614-0.621 for CHA₂DS₂-VASc
  • External validation in 2,542 independent patients confirmed performance with AUROC of 0.877-0.886
  • At a risk threshold of 0.2, ML models identified over 100 additional high-risk patients per 1,000 without false positive increases
  • Models demonstrated equitable performance across genders with no significant differences
  • Long-term follow-up showed high-risk ML patients had better outcomes with DOAC therapy

Why It Matters

These interpretable models integrate into clinical decision support systems for real-world hospital deployment. Superior discrimination and calibration enable personalized DOAC initiation strategies, potentially improving stroke prevention in newly diagnosed atrial fibrillation patients.

Limitations

The study was conducted in an Asian healthcare system; validation across diverse populations and healthcare settings remains important. The models use standard clinical variables; additional biomarkers might enhance predictions further.

Original paper: Interpretable machine learning models for stroke risk prediction in patients with newly diagnosed atrial fibrillation. — NPJ digital medicine. 10.1038/s41746-026-02470-3