Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
A new study comparing large language models with traditional machine learning reveals that while LLMs show promise for predicting bone cement leakage after spine surgery, they fall short for more complex complications and lack clinical readiness.
Percutaneous kyphoplasty treats osteoporotic vertebral compression fractures, but postoperative complications—particularly bone cement leakage (BCL) and new vertebral fractures (NVF)—can significantly impact patient outcomes. Predictive models could help surgeons identify high-risk patients preoperatively.
Researchers compared two state-of-the-art LLMs (GPT-5 and DeepSeek R1) with traditional machine learning models and spine surgeon predictions:
These findings challenge assumptions that advanced LLMs universally outperform traditional approaches. LLMs appear to have selective value in clinical decision support—useful for specific tasks but not replacements for conventional ML or surgeon expertise.
The single-center design limits generalizability. Authors emphasize that LLMs currently lack the maturity and reliability for clinical implementation. Further validation is necessary before real-world deployment in surgical practice.
Original paper: Comparative performance of LLMs and machine learning in predicting complications after percutaneous kyphoplasty for osteoporotic vertebral compression fractures. — NPJ digital medicine. 10.1038/s41746-026-02588-4