Multimodal Cancer Prognosis via Clinical Prompt Integration

Original Title: Multimodal deep learning for cancer prognosis prediction with clinical information prompts integration

Journal: NPJ digital medicine

DOI: 10.1038/s41746-025-02257-y

Overview

Survival analysis is a critical component of oncological care, providing the scientific basis for treatment planning and outcome evaluation. While multimodal deep learning has advanced this field by integrating pathology images and genomic data, clinical records are frequently underutilized due to their discrete and low-dimensional nature. This study introduces SurvPGC, a framework designed to bridge this gap by transforming clinical characteristics into high-dimensional embeddings using text templates and a language foundation model. The researchers validated SurvPGC using data from The Cancer Genome Atlas, specifically focusing on liver hepatocellular carcinoma, breast invasive carcinoma, and colorectal adenocarcinoma. The model utilizes a dual-path cross-attention mechanism to facilitate interaction between pathology images, genomic pathways, and clinical prompts. Experimental results indicate that SurvPGC achieves superior performance compared to existing state-of-the-art methods. For instance, the model reached a C-index of 0.701 in both liver and breast cancer datasets, and 0.676 in the colorectal dataset, demonstrating its ability to effectively rank patient risks across diverse cancer types.

Novelty

The primary innovation of this research lies in the methodology used to integrate low-dimensional clinical data with high-dimensional imaging and genomic features. Unlike traditional approaches that simply concatenate risk scores or standardized features, SurvPGC employs natural language processing techniques to create descriptive text prompts from tabular clinical data. By using the CONCH foundation model, these prompts are converted into rich embeddings that allow for deep participation in the model’s training process. Furthermore, the study implements a bidirectional cross-attention fusion strategy. This allows the model to not only exchange information between modalities but also to visualize how different data types influence the focus on specific regions within whole slide images. The results show that clinical information often highlights broader tissue areas, including lymphocytes and stroma, which provides a complementary perspective to the specific focus of genomic data.

Potential Clinical / Research Applications

This research has direct implications for clinical decision support systems. By integrating easily accessible clinical data with complex pathology and genomic information, SurvPGC can provide precise survival estimates at various time points, such as one, three, or five years. In a research setting, the attention visualization tools can help pathologists identify new morphological biomarkers associated with specific clinical profiles or genomic pathways. Furthermore, the framework’s ability to handle diverse data sources makes it a candidate for personalized medicine, where it could assist in tailoring therapeutic interventions based on a comprehensive, multimodal understanding of a patient’s specific disease progression and overall health status.

Similar Posts

  • Ensuring Health Equity in the Medical AI Revolution

    Original Title: Keeping Health Equity at the Forefront of the Artificial Intelligence Revolution in Medicine and Health Journal: JAMA health forum DOI: 10.1001/jamahealthforum.2025.6477 Overview OverviewThe rapid deployment of artificial intelligence in healthcare offers potential for increased efficiency and improved health outcomes. However, significant concerns exist regarding its impact on health equity. Historically, technological innovations have often benefited advantaged populations first, a phenomenon known as the 'inverse equity hypothesis'. Evidence from studies across 89 low- and middle-income countries demonstrates that without deliberate strategies, new technologies widen existing health gaps. Digital health tools frequently sustain inequities related to socioeconomic status, race, and geographic location. For instance, individuals with lower socioeconomic status are…

  • Harnessing protein language model for structure-based discovery of highly efficient and robust PET hydrolases

    Title AI-Driven Discovery of Efficient PET Hydrolases One-Sentence Summary This study introduces a computational pipeline using a protein language model and structure-based search to discover a novel, highly efficient, and thermostable PET hydrolase from nature. Overview Polyethylene terephthalate (PET) plastic waste poses a significant environmental problem. While some enzymes, known as PET hydrolases (PETases), can break down PET, their performance is often limited. This research introduces VenusMine, a computational pipeline designed to discover new and more effective PETases. The process began by using the known structure of an existing enzyme, IsPETase, as a template to search for structurally similar proteins from vast biological databases. A protein language model (PLM) was…

  • Evaluating AI and Human Performance in Spinal Surgery SSI

    Original Title: A Commentary on "Artificial Intelligence-Based Prediction Model for Surgical Site Infection in Metastatic Spinal Disease: a Multicenter Development and Validation Study" Journal: International journal of surgery (London, England) DOI: 10.1097/JS9.0000000000003123 Overview The commentary evaluates a multicenter study that developed a gradient boosting machine learning model to predict surgical site infection in metastatic spinal disease. The original research aimed to provide individualized risk stratification using prospectively collected data. A key feature was a performance comparison between the model and five experienced spine surgeons with ten to fifteen years of experience. The results showed a significant statistical difference: the artificial intelligence achieved an area under the receiver operating characteristic curve…

  • Efficient Evidential Deep Learning for Atomic Potentials

    Original Title: Evidential deep learning for interatomic potentials Journal: Nature communications DOI: 10.1038/s41467-025-67663-y Overview Molecular dynamics simulations are essential for understanding physical and chemical processes at the atomic level. Machine learning interatomic potentials have emerged as an efficient tool to achieve the accuracy of first-principles calculations with significantly reduced computational costs. However, the reliability of these models depends on the quality of training data. Uncertainty quantification is required to identify when a model encounters unfamiliar atomic configurations. Conventional methods like ensemble modeling require training multiple networks, which is computationally expensive and slow. This paper introduces the evidential interatomic potential, a framework that provides robust uncertainty estimates through a single forward…

  • Adoption of Artificial Intelligence in Health Care

    Original Title: Adoption of Artificial Intelligence in the Health Care Sector Journal: JAMA health forum DOI: 10.1001/jamahealthforum.2025.5029 Overview This study investigated the adoption of artificial intelligence (AI) in the US health care sector compared to other industries from September 2023 to May 2025. Using data from the US Census Bureau's Business Trends and Outlook Survey, which included responses from about 119,300 health care firms, the researchers analyzed biweekly trends in AI use. They found that while AI adoption in health care is increasing, it remains lower than in other sectors. In 2025, 8.3% of health care firms reported using AI, compared to 23.2% in information services and 19.2% in professional,…

  • Expert Consensus on Sonazoid CEUS for Liver Lesions

    Original Title: Expert consensus regarding the clinical application of liver contrast-enhanced US with Sonazoid (Sonazoid CEUS) Journal: International journal of surgery (London, England) DOI: 10.1097/JS9.0000000000003510 Overview This document presents an expert consensus on the clinical use of Sonazoid contrast-enhanced ultrasound for managing focal liver lesions. Sonazoid is a second-generation agent that functions as both a blood pool and a Kupffer-cell agent, with a phagocytic rate of 99 percent. Unlike pure blood-pool agents, it provides a stable post-vascular phase that lasts for approximately sixty minutes, enabling thorough liver scans. The consensus covers surveillance, diagnosis of hepatocellular carcinoma, detection of metastases, and interventional guidance. In high-risk patients, Sonazoid improves the detection of…

Leave a Reply

Your email address will not be published. Required fields are marked *

CAPTCHA