Multimodal AI for Predicting IVF Pregnancy Outcomes

Original Title: Multimodal intelligent prediction model for in vitro fertilization

Journal: NPJ digital medicine

DOI: 10.1038/s41746-025-02331-5

Overview

This study introduces VaTEP, a multimodal deep learning framework that integrates time-lapse system videos of developing embryos with tabular clinical data. Developed and validated using data from 9,786 participants across three medical centers, VaTEP predicts three clinical outcomes: fetal heartbeat presence, singleton versus multiple pregnancy, and miscarriage versus live birth. Using a multi-task learning approach, the system optimizes these predictions simultaneously. Results show the model achieved an area under the curve (AUC) of 0.8000 for fetal heartbeat, 0.8823 for singleton versus multiple pregnancy, and 0.9258 for live birth versus miscarriage. These values exceeded the performance of senior embryologists. Analysis identified maternal age, anti-Müllerian hormone levels, and endometrial thickness as significant variables informing the model's decisions. The framework provides a quantitative tool for embryo selection accounting for both embryonic development and maternal physiology.

Novelty

The novelty lies in the integrated end-to-end architecture and specific pre-training tasks for enhanced video representation. Unlike models that treat video and clinical data separately, this approach uses a cross-attention mechanism for deep interaction between modalities. A technical contribution is the use of two pre-training tasks: video reconstruction and embryo developmental phase prediction. These allow the encoder to learn spatiotemporal patterns and biological milestones before fine-tuning for outcomes. The model also uses a multiple frame sampling strategy to capture information from the entire developmental sequence efficiently. Expanding prediction targets to include multiple pregnancy risks and live birth outcomes represents a comprehensive approach. This multi-task framework enables feature sharing across related clinical endpoints, improving generalization compared to single-task systems.

Potential Clinical / Research Applications

Clinical and research applications include using this technology as a standardized decision-support tool to reduce multiple pregnancies. By identifying embryos with the highest live-birth potential, clinicians can confidently recommend single embryo transfers, minimizing risks like preterm birth. In research, the model's identification of influential variables, such as hormone levels, helps scientists understand the interaction between embryonic quality and uterine receptivity. The framework could be adapted to other medical tasks involving temporal data, such as monitoring fetal development or analyzing endoscopic videos. Since the model uses accessible clinical data and standard imaging, it could be deployed in resource-limited settings where expensive genetic testing is unavailable, helping to standardize care quality across different regions.

Similar Posts

  • SR-DLR for Coronary CT Angiography Stenosis Assessment

    Original Title: Super-Resolution Deep Learning Reconstruction for Coronary CT Angiography: Coronary Stenosis Assessment and CAD-RADS Reclassification Journal: Radiology DOI: 10.1148/radiol.252163 Overview Coronary CT angiography is a standard noninvasive tool for evaluating coronary artery disease, yet its accuracy is often limited by spatial resolution, particularly in the presence of calcified lesions. This study evaluates a super-resolution deep learning reconstruction algorithm, trained on ultra-high-resolution CT data, to improve stenosis quantification. Using invasive coronary angiography as the reference standard, researchers prospectively analyzed 204 participants with 605 plaques across ten medical centers. The results demonstrate that the super-resolution approach significantly improves diagnostic performance compared to traditional hybrid iterative reconstruction. At the lesion level, the…

  • Supervised Contrastive Learning for Lacune Detection in MRI

    Original Title: Biomarkers Journal: Alzheimer's & dementia : the journal of the Alzheimer's Association DOI: 10.1002/alz70856_099645 Overview Lacunes are small, deep brain infarcts that indicate vascular disease and increase the risk of cognitive decline. Detecting these features manually is time-consuming and prone to error due to their small size and similarity to other structures like perivascular spaces. This study presents a deep learning framework designed to automate the segmentation of lacunes using 2D T2-FLAIR MRI scans. The researchers utilized a dataset of 427 images, which underwent preprocessing to segment intracranial volume and white matter hyperintensities. The core architecture employed is an Attention U-Net. To address the challenge of imbalanced data…

  • Role of stem-like cells in chemotherapy resistance and relapse in pediatric T-cell acute lymphoblastic leukemia

    Title T-ALL relapse linked to stem-like cancer cells One-Sentence Summary This study identifies a subpopulation of quiescent, stem-like leukemia cells that resists chemotherapy and expands at relapse in pediatric T-cell acute lymphoblastic leukemia, linking their presence at diagnosis to higher treatment failure risk. Overview While treatment for pediatric T-cell acute lymphoblastic leukemia (T-ALL) has improved, relapse remains a major challenge with poor outcomes. To understand the mechanisms of relapse, this study performed single-cell RNA sequencing on leukemia cells from 18 pediatric T-ALL patients. The analysis compared samples taken at initial diagnosis with those taken at relapse. The investigation identified a distinct subpopulation of T-ALL cells with stem-like characteristics in 11…

  • AI-based Alzheimer’s Detection via Retinal OCT Imaging

    Original Title: Biomarkers Journal: Alzheimer's & dementia : the journal of the Alzheimer's Association DOI: 10.1002/alz70856_100619 Overview Alzheimer's disease presents a significant global health challenge, with early detection being a priority for effective intervention. This study investigates the use of retinal optical coherence tomography (OCT) as a non-invasive biomarker for Alzheimer's disease. The researchers developed deep learning models designed to analyze both en face images and conventional analysis reports, including retinal nerve fiber layer (RNFL), macular thickness, and ganglion cell-inner plexiform layer (GCIPL) data. The primary dataset for training and internal validation consisted of 3,228 paired OCT reports and images from 1,239 subjects, comprising individuals with Alzheimer's dementia and cognitively…

  • AI-driven preclinical disease risk assessment using imaging in UK biobank

    AI Predicts Disease Risk Using MRI Scans One-Sentence Summary This study demonstrates that artificial intelligence models can predict the three-year risk of developing various diseases before clinical symptoms appear by integrating information from whole-body MRI scans with non-imaging health data. Overview Identifying individuals at high risk for disease before symptoms emerge is crucial for effective prevention. This research explores the use of artificial intelligence (AI) to assess preclinical disease risk by combining different types of health data. Using the large-scale UK Biobank dataset, the authors developed and evaluated AI models to predict the 3-year risk for seven conditions: cardiovascular disease (CVD), pancreatic disease, liver disease, cancer, chronic obstructive pulmonary disease…

  • Amyloid and Vascular Subtypes in Alzheimer’s Disease

    Original Title: Biomarkers Journal: Alzheimer's & dementia : the journal of the Alzheimer's Association DOI: 10.1002/alz70856_100574 Overview Alzheimer’s disease is a heterogeneous condition often occurring alongside cerebral small vessel disease. This study examines 262 individuals across two cohorts: the longitudinal TRIAD cohort, representing a low burden of small vessel disease, and the MITNEC-C6 cohort, which includes real-world patients with mixed dementia and moderate-to-severe vascular lesions. Using a deep learning segmentation tool and the Subtype and Stage Inference algorithm, the research team identified distinct imaging-derived subtypes based on amyloid deposition, white matter hyperintensities, perivascular spaces, and diffusion markers. The study tracked 202 individuals at baseline, with follow-ups at two and three…

Leave a Reply

Your email address will not be published. Required fields are marked *

CAPTCHA