Vision-language model for report generation and outcome prediction in CT pulmonary angiogram

Title

AI Model for CT Scan Reports and Outcome Prediction

One-Sentence Summary

Researchers developed an AI framework that integrates vision and language models to analyze CT pulmonary angiogram scans, generating structured diagnostic reports and predicting patient survival outcomes for pulmonary embolism.

Overview

This study addresses the challenge of interpreting Computed Tomography Pulmonary Angiography (CTPA) for pulmonary embolism (PE), a process that can be complex and time-consuming. The authors created an agent-based AI framework that combines Vision-Language Models (VLMs) and Large Language Models (LLMs) to automate key aspects of the diagnostic workflow. Trained and validated on over 69,000 CTPA studies from three large, multi-institutional datasets, the framework performs three main tasks. First, it classifies 32 PE-related abnormalities from CT scans. Second, it generates structured, clinically relevant radiology reports. Third, it predicts patient survival by integrating imaging features, clinical data, and AI-generated diagnostic text. The model demonstrated strong performance, achieving an Area Under the Receiver Operating Characteristic curve (AUROC) for abnormality classification of up to 0.788. For survival prediction, the multimodal model achieved a concordance index of 0.863, outperforming the standard Pulmonary Embolism Severity Index (PESI) score.

Novelty

The novelty of this work lies in its integrated, multi-task approach specifically tailored for PE diagnosis. While previous models have focused on either general image captioning or single-task predictions, this framework unifies three distinct clinical needs: fine-grained abnormality detection, structured report generation, and multimodal outcome prediction into a single, cohesive pipeline. A key innovation is the abnormality-guided reporting strategy, where the system first identifies specific findings and then uses this information to generate a focused, structured report. This method mimics the systematic reasoning of a radiologist, moving beyond generic descriptions to produce clinically actionable text.

My Perspective

I find the agent-based architecture particularly insightful. Instead of relying on a single monolithic model, the framework decomposes the complex task of CTPA interpretation into specialized sub-tasks handled by different AI agents—a classifier for detection and a VLM-LLM combination for reporting. This modular approach is a pragmatic strategy for tackling multifaceted medical problems. It enhances transparency because the output of one agent (the abnormality predictions) serves as an explicit input for the next, making the model’s reasoning process easier to trace. This step toward explainability is important for building clinical trust and moving beyond the “black box” reputation of some AI systems.

Potential Clinical / Research Applications

In a clinical setting, this framework could serve as a valuable assistant for radiologists. By automatically generating draft reports, it could streamline workflows, reduce turnaround times for critical PE diagnoses, and improve the consistency of reporting across different physicians. The survival prediction module could aid in risk stratification, helping clinicians identify high-risk patients who may benefit from more aggressive treatment. For research, the framework provides a versatile template that could be adapted for other diseases and imaging modalities. For instance, the same agent-based methodology could be applied to detect cancer metastases or characterize interstitial lung disease on chest CTs, fostering the development of specialized, end-to-end diagnostic AI tools.

Similar Posts

  • A study of 950 AI medical devices found that lack of clinical validation and public company status were linked to higher odds of early recalls.

    Original Title: Early Recalls and Clinical Validation Gaps in Artificial Intelligence-Enabled Medical Devices Journal: JAMA health forum DOI: 10.1001/jamahealthforum.2025.3172 AI Medical Device Recalls and Validation Gaps Overview Artificial intelligence-enabled medical devices (AIMDs) are increasingly common in clinical practice, yet many receive US Food and Drug Administration (FDA) clearance through an accelerated pathway that does not require prospective human testing. This raises concerns about their performance and safety after entering the market. This study investigated the frequency of recalls among AIMDs and examined whether recalls were associated with two key factors: the lack of premarket clinical validation and the type of manufacturer (publicly traded vs. privately held). Researchers analyzed 950 FDA-cleared…

  • Evaluating AI and Human Performance in Spinal Surgery SSI

    Original Title: A Commentary on "Artificial Intelligence-Based Prediction Model for Surgical Site Infection in Metastatic Spinal Disease: a Multicenter Development and Validation Study" Journal: International journal of surgery (London, England) DOI: 10.1097/JS9.0000000000003123 Overview The commentary evaluates a multicenter study that developed a gradient boosting machine learning model to predict surgical site infection in metastatic spinal disease. The original research aimed to provide individualized risk stratification using prospectively collected data. A key feature was a performance comparison between the model and five experienced spine surgeons with ten to fifteen years of experience. The results showed a significant statistical difference: the artificial intelligence achieved an area under the receiver operating characteristic curve…

  • Robust CRC Diagnosis via Causal and Uncertainty-Aware AI

    Original Title: Uncertainty-aware and causal test-time adaptive foundation model for robust colorectal cancer pathology diagnosis Journal: NPJ digital medicine DOI: 10.1038/s41746-025-02149-1 Overview Colorectal cancer remains a major global health challenge, requiring precise histopathological analysis for effective treatment. While computational pathology has advanced with the use of large-scale foundation models, these systems frequently encounter obstacles when deployed in real-world clinical settings. Key issues include domain shifts caused by variations in staining protocols and scanner hardware, as well as the tendency for models to provide overconfident yet incorrect predictions. This paper introduces UAD-FM, an uncertainty-aware and causally adaptive foundation model designed to address these limitations. The framework integrates a variational Bayesian approach…

  • Multimodal Cancer Prognosis via Clinical Prompt Integration

    Original Title: Multimodal deep learning for cancer prognosis prediction with clinical information prompts integration Journal: NPJ digital medicine DOI: 10.1038/s41746-025-02257-y Overview Survival analysis is a critical component of oncological care, providing the scientific basis for treatment planning and outcome evaluation. While multimodal deep learning has advanced this field by integrating pathology images and genomic data, clinical records are frequently underutilized due to their discrete and low-dimensional nature. This study introduces SurvPGC, a framework designed to bridge this gap by transforming clinical characteristics into high-dimensional embeddings using text templates and a language foundation model. The researchers validated SurvPGC using data from The Cancer Genome Atlas, specifically focusing on liver hepatocellular carcinoma,…

  • Role of stem-like cells in chemotherapy resistance and relapse in pediatric T-cell acute lymphoblastic leukemia

    Title T-ALL relapse linked to stem-like cancer cells One-Sentence Summary This study identifies a subpopulation of quiescent, stem-like leukemia cells that resists chemotherapy and expands at relapse in pediatric T-cell acute lymphoblastic leukemia, linking their presence at diagnosis to higher treatment failure risk. Overview While treatment for pediatric T-cell acute lymphoblastic leukemia (T-ALL) has improved, relapse remains a major challenge with poor outcomes. To understand the mechanisms of relapse, this study performed single-cell RNA sequencing on leukemia cells from 18 pediatric T-ALL patients. The analysis compared samples taken at initial diagnosis with those taken at relapse. The investigation identified a distinct subpopulation of T-ALL cells with stem-like characteristics in 11…

  • AI for Cancer Risk Assessment in Oral Disorders

    Original Title: Artificial Intelligence in cancer risk assessment of oral potentially malignant disorders: applications and challenges Journal: International journal of surgery (London, England) DOI: 10.1097/JS9.0000000000003363 Overview This article examines the role of artificial intelligence in evaluating the risk of malignant transformation in oral potentially malignant disorders. Traditionally, clinicians rely on oral epithelial dysplasia grading to determine cancer risk. However, this method is often limited by human subjectivity and an inability to incorporate various risk factors simultaneously. Artificial intelligence offers a method to integrate diverse datasets, including demographic information, smoking history, clinical images, and histopathology slides. By analyzing both structured and unstructured data, these computational models can provide an objective assessment…

Leave a Reply

Your email address will not be published. Required fields are marked *

CAPTCHA