Deep indel mutagenesis reveals a checkerboard regulatory architecture for exon splicing and enables prediction of therapeutic oligonucleotide targets.

Original Title: Deep indel mutagenesis reveals the regulatory and modulatory architecture of alternative exon splicing

Journal: Nature communications

DOI: 10.1038/s41467-025-62957-7

Mapping Splicing Regulation with Indel Mutagenesis

Overview

Alternative splicing is a fundamental process that allows a single gene to produce multiple protein variants. Errors in this process are linked to numerous human diseases. This study investigates how small genetic insertions and deletions (indels), a common type of mutation, affect alternative splicing. Using a method called deep indel mutagenesis (DIM), researchers systematically tested thousands of variants—including all possible single-nucleotide substitutions and deletions—within the 63-nucleotide-long *FAS* exon 6. This high-throughput approach allowed them to create a detailed functional map of the sequences that enhance or silence splicing, revealing a complex regulatory architecture.

Novelty

While the effects of single base substitutions on splicing have been studied, this work provides a comprehensive analysis of indels. The systematic deletion scanning revealed that regulatory elements are not isolated but are arranged in an alternating “checkerboard” pattern of enhancers and silencers that covers most of the exon’s length. The study also uncovered a specific mechanism for exon repression where a core component of the splicing machinery, U2 snRNP, binds to cryptic intron-like sequences located within the exon itself. This binding competes with the legitimate splicing signals, contributing to the exon being skipped from the final messenger RNA.

My Perspective

The discovery of a dense, checkerboard-like regulatory landscape challenges a simpler view of exons containing only a few discrete control elements. From my perspective, this architecture suggests that alternative splicing is a highly tunable system. The balance between many closely-spaced, opposing signals may allow for subtle, graded responses to cellular signals, rather than a simple on-or-off switch. This fine-tuning is likely critical for establishing precise ratios of protein isoforms. Furthermore, the strong correlation shown between the effects of deletions and antisense oligonucleotides (AONs) is insightful. It suggests that the primary mechanism of action for many AONs is simply steric hindrance—blocking access for regulatory proteins—which is effectively mimicked by deleting the binding site itself. This provides a functional rationale for why *in silico* deletion scanning is a promising strategy for therapeutic design.

Potential Clinical / Research Applications

The findings have direct applications in developing new therapeutics. The study demonstrates that the effects of deletions can predict the effectiveness of antisense oligonucleotides (AONs), a class of drugs used to correct splicing defects. The correlation between the effects of 21-nucleotide deletions and corresponding AONs was strong (Spearman’s rho = 0.75). Based on this principle, the authors created DANGO, a genome-wide computational resource. DANGO predicts the impact of 21-nucleotide deletions across all human exons, effectively serving as a map to identify promising target sites for AONs in diseases like spinal muscular atrophy or Duchenne muscular dystrophy. This resource could significantly streamline the pre-clinical design of AON-based therapies by prioritizing the most potent targets. For researchers, the DIM methodology provides a powerful tool to dissect the regulatory code of any alternatively spliced exon, helping to understand how patient-derived mutations might lead to disease.

Similar Posts

  • Efficient Evidential Deep Learning for Atomic Potentials

    Original Title: Evidential deep learning for interatomic potentials Journal: Nature communications DOI: 10.1038/s41467-025-67663-y Overview Molecular dynamics simulations are essential for understanding physical and chemical processes at the atomic level. Machine learning interatomic potentials have emerged as an efficient tool to achieve the accuracy of first-principles calculations with significantly reduced computational costs. However, the reliability of these models depends on the quality of training data. Uncertainty quantification is required to identify when a model encounters unfamiliar atomic configurations. Conventional methods like ensemble modeling require training multiple networks, which is computationally expensive and slow. This paper introduces the evidential interatomic potential, a framework that provides robust uncertainty estimates through a single forward…

  • AI, Mobile Tech, and Social Media for Health in Africa

    Original Title: Scoping review of artificial intelligence via mobile technology and social media for health in Africa Journal: Nature communications DOI: 10.1038/s41467-025-64766-4 Overview This scoping review investigates the integration of artificial intelligence with mobile technology and social media to address health challenges in Africa. Following the PRISMA approach, researchers screened 469 articles published between 2014 and 2023, ultimately synthesizing 116 papers with a focused analysis of 29 studies. The results indicate that these digital tools are primarily utilized for infectious disease monitoring and diagnosis. Specifically, malaria was the subject of 17.2% of the studies, while COVID-19 accounted for 13.8%. Other conditions frequently studied include Ebola at 10.3%, cervical cancer at…

  • Plexin-B2 in CTC Clustering and Breast Cancer Metastasis

    Original Title: Computational ranking identifies Plexin-B2 in circulating tumor cell clustering with monocytes in breast cancer metastasis Journal: Nature communications DOI: 10.1038/s41467-025-62862-z Overview Circulating tumor cell (CTC) clusters are significantly more effective at seeding metastases than single CTCs, but the molecular mechanisms driving their formation are not fully understood. This study employed a computational ranking system, integrating proteomic data from breast tumors and cell lines with clinical survival data, to identify key proteins involved in this process. The analysis pinpointed Plexin-B2 (PLXNB2) as a top candidate associated with poor patient outcomes. In clinical samples, high PLXNB2 expression was enriched in CTC clusters and correlated with unfavorable overall survival (Hazard Ratio…

  • Role of stem-like cells in chemotherapy resistance and relapse in pediatric T-cell acute lymphoblastic leukemia

    Title Stem-like cells in pediatric T-ALL relapse One-Sentence Summary This study uses single-cell RNA sequencing to identify a subpopulation of quiescent, stem-like leukemia cells in pediatric T-cell acute lymphoblastic leukemia that resists chemotherapy and expands at relapse. Overview Relapse in pediatric T-cell acute lymphoblastic leukemia (T-ALL) is associated with a poor prognosis, often driven by the development of chemotherapy resistance. To investigate the underlying cellular mechanisms, researchers performed longitudinal single-cell RNA sequencing on patient-derived xenograft (PDX) samples from 18 pediatric patients, including 13 with matched samples from both diagnosis and relapse. The analysis revealed a distinct subpopulation of T-ALL cells exhibiting stem-like features in 11 of the 18 cases. This…

  • Supervised Contrastive Learning for Lacune Detection in MRI

    Original Title: Biomarkers Journal: Alzheimer's & dementia : the journal of the Alzheimer's Association DOI: 10.1002/alz70856_099645 Overview Lacunes are small, deep brain infarcts that indicate vascular disease and increase the risk of cognitive decline. Detecting these features manually is time-consuming and prone to error due to their small size and similarity to other structures like perivascular spaces. This study presents a deep learning framework designed to automate the segmentation of lacunes using 2D T2-FLAIR MRI scans. The researchers utilized a dataset of 427 images, which underwent preprocessing to segment intracranial volume and white matter hyperintensities. The core architecture employed is an Attention U-Net. To address the challenge of imbalanced data…

  • Non-coding genetic elements of lung cancer identified using whole genome sequencing in 13,722 Chinese

    Title Lung Cancer’s Non-Coding Genetic Drivers One-Sentence Summary A whole-genome sequencing study of 13,722 Chinese individuals identifies common and rare non-coding genetic variants associated with lung cancer, implicating novel genes and regulatory pathways. Overview This study investigated the genetic basis of lung cancer in the Chinese population, focusing on non-coding regions of the genome that regulate gene activity. Researchers performed whole-genome sequencing on 13,722 individuals and analyzed both common and rare genetic variants. For common variants, the analysis confirmed associations with known genes like TP63 and, through a transcriptome-wide association study (TWAS), linked the expression of eight genes to lung cancer risk. The analysis of rare variants, which are less…

Leave a Reply

Your email address will not be published. Required fields are marked *

CAPTCHA