Multimodal AI for Predicting IVF Pregnancy Outcomes

Original Title: Multimodal intelligent prediction model for in vitro fertilization

Journal: NPJ digital medicine

DOI: 10.1038/s41746-025-02331-5

Overview

This study introduces VaTEP, a multimodal deep learning framework that integrates time-lapse system videos of developing embryos with tabular clinical data. Developed and validated using data from 9,786 participants across three medical centers, VaTEP predicts three clinical outcomes: fetal heartbeat presence, singleton versus multiple pregnancy, and miscarriage versus live birth. Using a multi-task learning approach, the system optimizes these predictions simultaneously. Results show the model achieved an area under the curve (AUC) of 0.8000 for fetal heartbeat, 0.8823 for singleton versus multiple pregnancy, and 0.9258 for live birth versus miscarriage. These values exceeded the performance of senior embryologists. Analysis identified maternal age, anti-Müllerian hormone levels, and endometrial thickness as significant variables informing the model's decisions. The framework provides a quantitative tool for embryo selection accounting for both embryonic development and maternal physiology.

Novelty

The novelty lies in the integrated end-to-end architecture and specific pre-training tasks for enhanced video representation. Unlike models that treat video and clinical data separately, this approach uses a cross-attention mechanism for deep interaction between modalities. A technical contribution is the use of two pre-training tasks: video reconstruction and embryo developmental phase prediction. These allow the encoder to learn spatiotemporal patterns and biological milestones before fine-tuning for outcomes. The model also uses a multiple frame sampling strategy to capture information from the entire developmental sequence efficiently. Expanding prediction targets to include multiple pregnancy risks and live birth outcomes represents a comprehensive approach. This multi-task framework enables feature sharing across related clinical endpoints, improving generalization compared to single-task systems.

Potential Clinical / Research Applications

Clinical and research applications include using this technology as a standardized decision-support tool to reduce multiple pregnancies. By identifying embryos with the highest live-birth potential, clinicians can confidently recommend single embryo transfers, minimizing risks like preterm birth. In research, the model's identification of influential variables, such as hormone levels, helps scientists understand the interaction between embryonic quality and uterine receptivity. The framework could be adapted to other medical tasks involving temporal data, such as monitoring fetal development or analyzing endoscopic videos. Since the model uses accessible clinical data and standard imaging, it could be deployed in resource-limited settings where expensive genetic testing is unavailable, helping to standardize care quality across different regions.

Similar Posts

  • Supervised Contrastive Learning for Lacune Detection in MRI

    Original Title: Biomarkers Journal: Alzheimer's & dementia : the journal of the Alzheimer's Association DOI: 10.1002/alz70856_099645 Overview Lacunes are small, deep brain infarcts that indicate vascular disease and increase the risk of cognitive decline. Detecting these features manually is time-consuming and prone to error due to their small size and similarity to other structures like perivascular spaces. This study presents a deep learning framework designed to automate the segmentation of lacunes using 2D T2-FLAIR MRI scans. The researchers utilized a dataset of 427 images, which underwent preprocessing to segment intracranial volume and white matter hyperintensities. The core architecture employed is an Attention U-Net. To address the challenge of imbalanced data…

  • AI enhanced diagnostic accuracy and workload reduction in hepatocellular carcinoma screening

    Title AI Enhances Liver Cancer Screening Efficiency One-Sentence Summary A study of AI-human collaboration in liver cancer screening found that a specific workflow maintained high detection sensitivity while improving specificity, significantly reducing radiologists’ workload. Overview This study evaluated the utility of artificial intelligence (AI) in ultrasound screening for hepatocellular carcinoma (HCC). Researchers developed two AI models—UniMatch for lesion detection and LivNet for classification—which were trained and tested on 21,934 ultrasound images. The study compared the conventional radiologist-only screening method with four different human-AI interaction strategies. The most effective approach, Strategy 4, involved AI performing an initial triage, with radiologists reviewing specific cases flagged as negative by the AI. Compared to…

  • Role of stem-like cells in chemotherapy resistance and relapse in pediatric T-cell acute lymphoblastic leukemia

    Title Stem-like Cells Drive T-ALL Relapse One-Sentence Summary This study identifies a subpopulation of quiescent, stem-like leukemia cells that expands at relapse in pediatric T-cell acute lymphoblastic leukemia, linking their chemotherapy resistance to specific transcriptional and splicing programs. Overview Relapse in pediatric T-cell acute lymphoblastic leukemia (T-ALL) is associated with chemotherapy resistance and poor outcomes. To understand the underlying mechanisms, this research conducted longitudinal single-cell RNA sequencing on patient-derived samples collected at both diagnosis and relapse. The analysis included 13 patients who relapsed and 5 who did not. The study identified a distinct subpopulation of T-ALL cells with stem-like characteristics in 11 of the 18 patient samples. These cells, which…

  • Volumetric Brain Matter Changes in Mild Cognitive Impairment

    Original Title: Biomarkers Journal: Alzheimer's & dementia : the journal of the Alzheimer's Association DOI: 10.1002/alz70856_106355 Overview Mild cognitive impairment (MCI) serves as a critical transitional stage between the typical cognitive changes of aging and the onset of Alzheimer's disease. This study explores structural brain alterations associated with this condition by quantifying gray matter and white matter volumes using high-resolution T1-weighted magnetic resonance imaging. The research team utilized a specialized deep neural network named Vb-Net to perform automated segmentation and volumetric analysis on healthy controls and individuals with MCI. Patients with MCI experienced a 4.60% reduction in gray matter volume and a 5.60% decrease in white matter volume compared to…

  • Large-Scale Human Brain Single-Cell Atlas for Alzheimer’s

    Original Title: Basic Science and Pathogenesis Journal: Alzheimer's & dementia : the journal of the Alzheimer's Association DOI: 10.1002/alz70855_107196 Overview This research presents the development of the Alzheimer's Cell Atlas, a comprehensive resource for understanding the molecular mechanisms of neurodegenerative diseases at the level of individual cells. The study utilized single-nuclei RNA-sequencing data from 2,239 human postmortem samples, encompassing a wide spectrum of conditions including 658 Alzheimer's disease cases, 110 cases of cognitive resilience, and 1,031 control samples. The dataset is notable for its scale, containing approximately 14 million nuclei, which represents a significant expansion over previous efforts. By integrating data across 33 different brain regions and age ranges from…

  • WeChat-Based AI Agent for Postoperative Orthopedic Care

    Original Title: A randomized controlled trial of a WeChat-based artificial intelligence agent for postoperative care in orthopedic patients Journal: NPJ digital medicine DOI: 10.1038/s41746-025-02269-8 Overview This randomized controlled trial evaluated a GPT-4-powered artificial intelligence agent delivered via the WeChat platform to support postoperative recovery in orthopedic patients. The study included 261 participants, with 140 assigned to the AI-driven intervention and 121 to traditional physician-led communication. Effective postoperative management is often hindered by limited access to timely support and poor adherence to rehabilitation protocols. The AI system demonstrated a significantly faster response time of 0.5 ± 0.6 minutes compared to 358 ± 47.5 minutes in the doctor-led group (p < 0.05)….

Leave a Reply

Your email address will not be published. Required fields are marked *

CAPTCHA