Scalable Protein Stability Prediction via Generative Models

Original Title: Generalizable and scalable protein stability prediction with rewired protein generative models

Journal: Nature communications

DOI: 10.1038/s41467-025-67609-4

Overview

Protein stability, typically measured by changes in Gibbs free energy (ΔΔG), is a fundamental property that dictates protein function and engineering potential. Accurately predicting how mutations influence this stability remains a significant challenge due to the scarcity of high-quality experimental data and the intricate nature of three-dimensional molecular interactions. This research introduces SPURS, a deep learning framework designed to address these limitations by integrating two distinct types of protein generative models. Specifically, it combines the evolutionary patterns captured by the protein language model ESM2 with the geometric constraints learned by the inverse folding model ProteinMPNN. By training on a filtered Megascale dataset containing over 270,000 measurements across nearly 300 proteins, the framework achieves a median Spearman correlation of 0.83, outperforming existing benchmarks. The system is designed for high efficiency, allowing the prediction of all possible single-point mutations for a protein in a single computational pass.

Novelty

The primary innovation of this work is the rewiring strategy that connects sequence-based and structure-based models through a lightweight adapter module. Unlike previous methods that simply concatenate model outputs or fine-tune entire massive architectures, SPURS updates only 9.9 million parameters, which is approximately 1.5% of the total parameter count. This parameter-efficient approach preserves the pre-trained evolutionary knowledge while specializing the model for stability prediction. Furthermore, the architecture introduces an all-mutants-per-pass inference paradigm, reducing the computational cost from a linear scale to a constant scale relative to the number of possible mutations. The framework also extends beyond single-point substitutions by incorporating a dedicated epistasis decoder. This component specifically models non-additive effects between multiple amino acid substitutions, allowing the system to accurately predict the stability of higher-order mutants that many existing additive models fail to capture.

Potential Clinical / Research Applications

This framework has significant implications for clinical research and protein engineering. In clinical settings, it enables the systematic analysis of the human proteome to distinguish between benign and pathogenic missense variants. The researchers demonstrated that 68% of pathogenic variants are significantly destabilizing, compared to only 19% of benign ones, providing a quantitative tool for disease variant interpretation. In the field of protein engineering, the high efficiency of the model supports large-scale screening for stabilizing mutations, a task previously hindered by computational bottlenecks. Additionally, by serving as an informative prior, SPURS enhances fitness prediction in low-N scenarios where experimental data is extremely limited. This capability is particularly useful for optimizing therapeutic proteins or industrial enzymes where only a few dozen variants can be tested in a laboratory setting. The model can also assist in identifying functional hotspots, such as binding interfaces and active sites, by detecting residues where mutations cause functional loss out of proportion to stability changes.

Similar Posts

  • LLMs for De-identifying Sensitive Health Information

    Original Title: Leveraging large language models for the deidentification and temporal normalization of sensitive health information in electronic health records Journal: NPJ digital medicine DOI: 10.1038/s41746-025-01921-7 Overview OverviewSharing electronic health records (EHRs) for research is vital but requires the removal of sensitive health information (SHI) to protect patient privacy. This process, known as de-identification, also involves temporal normalization, which standardizes date and time expressions to preserve a coherent patient timeline. This paper evaluates the effectiveness of large language models (LLMs) for these two tasks. It presents a detailed analysis based on the SREDH/AI CUP 2023 competition, which challenged 291 teams to develop systems for SHI recognition and temporal normalization using…

  • Role of stem-like cells in chemotherapy resistance and relapse in pediatric T-cell acute lymphoblastic leukemia

    Title Stem-like cells in pediatric T-ALL relapse One-Sentence Summary This study uses single-cell RNA sequencing to identify a subpopulation of quiescent, stem-like leukemia cells in pediatric T-cell acute lymphoblastic leukemia that resists chemotherapy and expands at relapse. Overview Relapse in pediatric T-cell acute lymphoblastic leukemia (T-ALL) is associated with a poor prognosis, often driven by the development of chemotherapy resistance. To investigate the underlying cellular mechanisms, researchers performed longitudinal single-cell RNA sequencing on patient-derived xenograft (PDX) samples from 18 pediatric patients, including 13 with matched samples from both diagnosis and relapse. The analysis revealed a distinct subpopulation of T-ALL cells exhibiting stem-like features in 11 of the 18 cases. This…

  • A Multisociety Syllabus for AI in Radiology Education

    Original Title: Teaching AI for Radiology Applications: A Multisociety-Recommended Syllabus from the AAPM, ACR, RSNA, and SIIM Journal: Radiology. Artificial intelligence DOI: 10.1148/ryai.250137 Overview This paper presents a recommended syllabus for artificial intelligence (AI) education in radiology, developed through a collaboration of four major U.S. societies: the American Association of Physicists in Medicine (AAPM), the American College of Radiology (ACR), the Radiological Society of North America (RSNA), and the Society for Imaging Informatics in Medicine (SIIM). The framework addresses the growing need for standardized competencies as AI tools become more common in medical imaging. It defines the required knowledge for four distinct professional roles, or “personas”: users of AI systems…

  • Immune Response in Pig-to-Human Heart Xenografts

    Original Title: Characterizing the Immune Response in Pig-to-human Heart Xenografts Using a Multimodal Diagnostic System Journal: Circulation DOI: 10.1161/CIRCULATIONAHA.125.074971 Overview This study aimed to characterize the early immune response in genetically modified pig hearts transplanted into humans. Researchers analyzed biopsies from two 10-gene-edited pig hearts 66 hours after transplantation into brain-dead human recipients. They employed a multimodal diagnostic approach that integrated traditional histology, electron microscopy, gene expression profiling, and advanced imaging. The latter used multiplex immunofluorescence combined with a deep learning algorithm for automated cell quantification. The key findings were that the xenografts showed mild microvascular inflammation dominated by innate immune cells, specifically neutrophils (CD15+) and macrophages (CD68+), with an…

  • Interpretable Survival Analysis for Alzheimer’s Progression

    Original Title: Basic Science and Pathogenesis Journal: Alzheimer's & dementia : the journal of the Alzheimer's Association DOI: 10.1002/alz70855_107083 Overview This research addresses the challenge of predicting the progression of Alzheimer’s disease and related dementias using survival analysis. While deep learning models offer high predictive performance, their complex architectures often obscure the biological factors driving their outputs. To resolve this, the authors introduce the Neural Additive Deep Clustering Survival Machines (NADCSM) framework. This model utilizes data from the Alzheimer’s Disease Neuroimaging Initiative, specifically focusing on AV45 Florbetapir PET imaging, genotyping, and demographic information to track the transition from mild cognitive impairment to early Alzheimer’s disease. The framework models survival times…

  • Staging of Alzheimer’s disease progression in Down syndrome using mixed clinical and plasma biomarker measures with machine learning

    論文「ダウン症候群におけるアルツハイマー病進行の機械学習を用いた臨床・血漿バイオマーカー混合指標によるステージング」の要約 タイトル 機械学習がダウン症候群におけるアルツハイマー病をステージング 1文での要約 本研究では、機械学習モデルを用いてダウン症候群の成人における認知機能と血漿バイオマーカーの変化の順序を明らかにし、一般集団と同様の前臨床期アルツハイマー病の進行パターンを明らかにしました。 概要 ダウン症候群(DS)の人は、遺伝的にアルツハイマー病(AD)のリスクが高いですが、その前臨床段階における一連の事象は完全には解明されていません。本研究は、まだ臨床的な認知症を発症していないDSの成人57名を対象に、ADの進行を調査しました。研究者らは、イベントベースモデル(EBM)と呼ばれる教師なし機械学習技術を用いて、認知機能テストと血漿バイオマーカー(アミロイドベータ(Aβ)42/40比、リン酸化タウ(p-tau)、神経フィラメント軽鎖(NfL)、グリア線維性酸性タンパク質(GFAP)など)の横断的データを分析しました。このモデルは、病理学的変化が起こる最も可能性の高い順序を推定しました。結果として、最も早期に検出された事象は血漿Aβ42/40比の低下であり、その直後に記憶能力の低下が続くことが示されました。続いて、神経変性マーカー(NfLおよびp-tau)に変化が生じ、これは実行機能および視覚運動機能の低下に先行していました。この一連の変化の最後の事象は、神経炎症マーカーであるGFAPの変化でした。さらなる分析により、39歳から52歳がこれらの血漿バイオマーカーが最も急速に変化する期間であることが特定されました。 新規性 本研究の主な貢献は、DSにおける前臨床ADをステージングするために、認知機能評価と血漿バイオマーカーの両方を統合するイベントベースモデルを適用した点にあります。これまでの研究では、これらのモダリティを別々に検討することが多かったのに対し、本研究はこの特定の集団において、両者をデータ駆動型の手法で組み合わせた最初の研究の一つです。このマルチモーダルなアプローチは、単一の種類のデータを分析するよりも、より全体的で頑健な疾患進行のタイムラインを提供します。臨床的マーカーと生物学的マーカーを組み合わせた順序付けを行うことで、このモデルは単純な相関関係を超え、横断的データに基づいて病理学的事象の起こりうる時間的順序を確立し、前臨床段階のカスケードのより詳細な全体像を提示します。 私の視点 データ駆動型のアプローチであるEBMの利用は特に説得力があると感じます。なぜなら、このモデルはアミロイドカスケード仮説のような既存の仮説をデータに押し付けないからです。モデルが独自に決定した変化の順序は、結果的にカスケード仮説と一致しており、DSの文脈におけるこの枠組みの妥当性を強固なものにしています。しかし、このモデルのバイアスのかからない性質は、もしデータが異なるパターンを支持していれば、それを検出できたであろうことを意味し、複雑な疾患を研究する上でのこのような予断を持たないアプローチの価値を浮き彫りにします。さらに、横断的データから縦断的な進行を推測できる能力は、特に長期的な研究が困難な集団にとって強力なツールです。この手法は疾患の軌跡を描くための実用的なテンプレートを提供しますが、その知見は縦断的データセットによる検証によってさらに確固たるものとなるでしょう。 臨床・研究への応用の可能性 この研究成果は、臨床実践と研究の両方に直接的な示唆を与えます。臨床的には、EBMによって提供される個別のステージングは、特定の前臨床段階でどのマーカーが異常を示す可能性が高いかを特定することにより、早期診断を向上させる可能性があります。研究面では、39歳から52歳の間というバイオマーカー変化の重要な期間を特定したことで、臨床試験のデザインを最適化するための強力な根拠がもたらされます。疾患修飾薬の試験では、治療効果を観察できる可能性を最大化するために、この年齢層の参加者を特異的に募集することが考えられます。また、脳脊髄液分析やPETイメージングよりも侵襲性が低くアクセスしやすい血漿バイオマーカーに依存することで、DS集団における将来のAD予防試験の実現可能性と拡張性を向上させることもできるでしょう。

Leave a Reply

Your email address will not be published. Required fields are marked *

CAPTCHA