AI-Powered Speech Analysis for Alzheimer’s Detection

Original Title: Biomarkers

Journal: Alzheimer's & dementia : the journal of the Alzheimer's Association

DOI: 10.1002/alz70856_107467

Overview

The study investigates the utility of spontaneous speech as a non-invasive biomarker for Alzheimer's disease by developing an automated analysis pipeline. Utilizing the ADReSS 2020 Challenge dataset, which comprises audio recordings from 108 training participants and 48 testing participants performing the Cookie Theft picture description task, the researchers explored the transition from raw audio to diagnostic classification. The methodology involved transcribing audio using commercial tools like OpenAI Whisper and AssemblyAI, followed by the generation of semantic vector embeddings using large language models. These embeddings were then used to train machine learning classifiers, including Support Vector Machines, Logistic Regression, and Random Forest models. The results indicated that individuals with Alzheimer's disease exhibited a higher frequency of filler words and reduced vocabulary diversity. The Support Vector Machine model demonstrated the highest performance, achieving an accuracy of 0.84 and a precision of 0.83.

Novelty

The technical contribution of this work lies in the integration of modern large language model vector embeddings into the diagnostic pipeline for cognitive impairment. Unlike traditional methods that rely heavily on manual transcription or basic linguistic features such as part-of-speech tags and syntax, this approach leverages automated transcription services to create a scalable workflow. The study specifically demonstrates that vector embeddings capture complex semantic and contextual patterns that are often missed by conventional linguistic analysis. A significant finding is that the inclusion of these high-dimensional embeddings led to a substantial improvement in the receiver operating characteristic values of the classification models. Furthermore, the research illustrates that while linguistic features provide some value, the automated embedding process serves as a powerful predictor for identifying Alzheimer's disease from spontaneous speech.

Potential Clinical / Research Applications

This automated pipeline has significant potential for integration into telehealth platforms and remote monitoring systems. By providing a cost-effective and non-invasive means of screening, it could facilitate earlier detection of cognitive decline in primary care settings where specialized neuropsychological testing is not always available. In a research context, this methodology can be applied to large-scale longitudinal studies to track the progression of Alzheimer’s disease through changes in speech patterns over time. The open-source nature of the project code encourages further collaboration and refinement within the scientific community. Additionally, the framework could be expanded into a multimodal diagnostic tool by incorporating other data types, such as acoustic features or physiological sensor data, to further enhance the sensitivity and specificity of the classification process.


Posted

in

by

Tags:

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *

CAPTCHA