HyperAIHyperAI

Command Palette

Search for a command to run...

Generative AI ermöglicht den Aufbau struktureller Hirnnetzwerke aus fMRI-Daten durch symmetrisches Diffusion Learning.

Qiankun Zuo Bangjun Lei Wanyu Qiu Changhong Jing Jin Hong Shuqiang Wang

Zusammenfassung

Die Abbildung von funktioneller Konnektivität (FC) auf strukturelle Konnektivität (SC) kann die multimodale Fusion von Gehirnnetzwerken erleichtern und potenzielle Biomarker für klinische Anwendungen identifizieren. Allerdings ist es schwierig, die zuverlässigen nichtlinearen Abbildungsbeziehungen zwischen SC und funktioneller Magnetresonanztomographie (fMRI) direkt zu überbrücken. In diesem Beitrag wird ein neuartiges, auf einem symmetrischen diffusionsbasierten generativen adversariellen Netzwerk (DiffGAN-F2S) beruhendes fMRI-zu-SC-Modell vorgeschlagen, das SC aus fMRI-Daten des Gehirns innerhalb eines einheitlichen Rahmens vorhersagt. Konkret nutzt das vorgeschlagene DiffGAN-F2S Denoising Diffusion Probabilistic Models (DDPMs) und adversariales Lernen, um durch wenige Schritte von fMRI ausgehend eine symmetrische und hochfidelie SC effizient zu generieren. Durch die Gestaltung von Dual-Channel Multi-Head Spatial Attention (DMSA) und Graph Convolutional Modulen erfasst der symmetrische Graph-Generator zunächst globale Beziehungen zwischen direkt und indirekt verbundenen Gehirnregionen und modelliert anschließend die lokalen Interaktionen zwischen Gehirnregionen. Damit können die komplexen Abbildungsbeziehungen zwischen fMRI und symmetrischer struktureller Konnektivität aufgedeckt werden. Darüber hinaus wird ein räumlich verbundener Konsistenzverlust entworfen, um den Generator so zu restringieren, dass globale und lokale topologische Informationen erhalten bleiben, was eine präzise Vorhersage der symmetrischen SC ermöglicht. Tests auf dem öffentlichen Datensatz der Alzheimer's Disease Neuroimaging Initiative (ADNI) zeigen, dass das vorgeschlagene Modell effektiv empirisch erhaltene Konnektivität aus vierdimensionalen Bilddaten generieren kann und im Vergleich zu anderen verwandten Modellen überlegene Leistung bei der SC-Vorhersage erzielt. Zudem identifiziert das vorgeschlagene Modell die überwiegende Mehrheit der wichtigen Gehirnregionen und Verbindungen, die aus empirischen Methoden abgeleitet wurden, und bietet somit eine alternative Möglichkeit zur Fusion multimodaler Gehirnnetzwerke und zur Analyse klinischer Hirnerkrankungen.

One-sentence Summary

The authors propose DiffGAN-F2S, a symmetric diffusion generative adversarial network that directly predicts structural connectivity from fMRI in one stage. By integrating dual-channel attention and a spatially connected consistency loss, this model outperforms prior methods in accuracy and efficiency for analyzing neurodegenerative diseases like Alzheimer's.

Key Contributions

  • The paper introduces DiffGAN-F2S, a symmetric diffusive generative adversarial network that predicts structural connectivity from fMRI data by integrating denoising diffusion probabilistic models with adversarial learning to generate high-fidelity graphs in a unified framework.
  • A dual-channel multi-head spatial attention mechanism combined with graph convolutional modules is designed to capture global relations among direct and indirect brain regions while modeling local interactions to uncover complex non-linear mapping relations.
  • Experiments on the ADNI dataset demonstrate that the method effectively preserves empirical connectivity patterns and outperforms existing models in structural connectivity prediction, successfully identifying critical brain regions and connections for clinical analysis.

Introduction

Modeling the relationship between functional connectivity derived from fMRI and structural connectivity is vital for uncovering pathological mechanisms in neurodegenerative diseases like Alzheimer's. Prior approaches often rely on inefficient two-stage pipelines that first compute functional connectivity before predicting structure, while also suffering from training instability and mode collapse in generative models. The authors leverage a novel symmetric diffusive generative adversarial network called DiffGAN-F2S to predict structural connectivity directly from fMRI in a unified, end-to-end framework. By integrating denoising diffusion probabilistic models with adversarial learning and a dual-channel multi-head spatial attention mechanism, their method efficiently generates high-fidelity symmetric graphs while preserving critical global and local topological information.

Dataset

  • Dataset Composition and Sources: The authors utilize data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) public repository, selecting 240 patients divided equally into two categories: 120 normal controls and 120 subjects with mild cognitive impairment (MCI), which includes both early and late stages.

  • Key Details for Each Subset:

    • Imaging Modalities: Every subject underwent scanning using a 3T magnetic resonance instrument for both functional MRI (fMRI) and Diffusion Tensor Imaging (DTI).
    • DTI Specifications: Acquisition parameters included a repetition time (TR) of 3.4 s to 17.5 s, an echo time (TE) of 56 ms to 105 ms, and 6 to 126 gradient directions.
    • fMRI Specifications: The TR varied between 0.607 s and 3.0 s, while the TE ranged from 30 ms to 32 ms.
  • Data Processing and Preprocessing:

    • DTI Pipeline: The authors processed DTI data using the PANDA toolbox and the anatomical automatic labeling (AAL90) atlas to generate an empirical structural connectivity (SC) matrix (A0A_0A0) with dimensions of 90×9090 \times 9090×90.
    • fMRI Pipeline: Instead of standard procedures, the team used the aal.nii atlas file to convert raw fMRI data into ROI-based time series without additional parameters, resulting in a primary sample matrix (FFF) of size 90×18790 \times 18790×187.
    • Matrix Definition: In the 90×18790 \times 18790×187 fMRI matrix, the 90 rows correspond to the ROIs defined by the AAL90 atlas, and each element represents the mean signal intensity of a specific ROI at a given time point.
  • Model Usage: The processed datasets serve as the foundation for testing the model's performance in predicting structural connectivities and comparing them against empirical SCs across different diffusive steps and subject groups.

Method

The authors propose the DiffGAN-F2S model to predict Structural Connectivity (SC) from brain fMRI data. The overall framework is illustrated in the first image, which depicts the transformation from raw brain data to fMRI, processed by the proposed model to yield structural connectivity.

The architecture consists of two primary components: a symmetric diffusive process and a conditional denoising process. As shown in the figure below, the diffusive process models the truth distribution of noisy SC by persistently injecting Gaussian noise into the empirical SC matrix A0\mathbf{A}_0A0 over TTT steps until it becomes a Gaussian matrix AT\mathbf{A}_TAT.

In the denoising process, the brain fMRI serves as a condition to guide the prediction of the clean SC A0\mathbf{A}_0'A0. To reduce computational cost, the denoising steps are compressed to T/dT/dT/d steps, where each step utilizes a conditional generative adversarial network. The detailed structure of a single denoising step involves a Symmetric Graph Generator and a Connectivity Discriminator.

The Symmetric Graph Generator employs a Non-parametric modulator (NPM) to transform raw fMRI into preliminary ROI-based time series without learnable parameters. These features, along with the noisy sample, pass through LLL layers containing a Dual-channel multi-head spatial attention (DMSA) module and GCN-based modules. The DMSA module is designed to capture global relations among directly and indirectly connected brain regions. The specific implementation of this attention mechanism is detailed in the following figure.

The module computes Query (QQQ), Key (K1,K2K_1, K_2K1,K2), and Value (VVV) vectors, where K1K_1K1 and K2K_2K2 represent direct and indirect connected ROI features derived from one-layer GCNs. The attention values are calculated separately for direct and indirect connections and combined to update the ROI features. Following the attention mechanism, a Posterior Computing Module (PCD) translates the features into a symmetrically blurred A˙0\dot{\mathbf{A}}_0A˙0 and predicts the noisy SC at the current step via posterior sampling.

The Connectivity Discriminator distinguishes whether the noisy SC originates from the generator or the empirical method. It utilizes three GCN layers where the edge represents the noisy SC and the node feature is a one-hot vector corresponding to the 90 brain regions defined by the AAL90 atlas. Temporal embeddings are inserted as bias terms into the discriminating computation.

To optimize the model, three loss functions are devised: the denoising adversarial loss to constrain distribution consistency, the mean absolute error (MAE) loss to measure edge strength differences, and the spatially connected consistency loss to capture local and global graphical properties. The spatially connected consistency loss includes terms for overall similarity and betweenness centrality to ensure accurate prediction of intrinsic structural patterns.

Experiment

  • The model transforms fMRI into structural connectivity in a single stage, validating its ability to denoise Gaussian noise into empirical connectivity while preserving both global and local patterns.
  • Comparative experiments against seven existing models demonstrate superior accuracy and topological consistency, with predicted results showing high correlation to empirical data and significantly lower errors in graph metrics.
  • Connectivity analysis confirms the model reliably identifies disease-relevant brain regions and abnormal connections associated with Mild Cognitive Impairment, matching empirical findings in the majority of key regions.
  • Ablation studies verify that the adversarial loss, spatially connected consistency loss, symmetric design, and specific attention mechanisms are critical for maintaining distribution consistency and capturing global topological features.
  • Efficiency tests show that the skipping step mechanism balances prediction speed and accuracy, reducing inference time to approximately 3 seconds without significant loss in performance.

KI mit KI entwickeln

Von der Idee bis zum Launch – beschleunigen Sie Ihre KI-Entwicklung mit kostenlosem KI-Co-Coding, sofort einsatzbereiter Umgebung und bestem GPU-Preis.

KI-gestütztes kollaboratives Programmieren
Sofort einsatzbereite GPUs
Die besten Preise

HyperAI Newsletters

Abonnieren Sie unsere neuesten Updates
Wir werden die neuesten Updates der Woche in Ihren Posteingang liefern um neun Uhr jeden Montagmorgen
Unterstützt von MailChimp