Command Palette
Search for a command to run...
L'IA générative permet la construction de réseaux cérébraux structuraux à partir de l'IRMf grâce à l'apprentissage par Diffusion symétrique.
L'IA générative permet la construction de réseaux cérébraux structuraux à partir de l'IRMf grâce à l'apprentissage par Diffusion symétrique.
Qiankun Zuo Bangjun Lei Wanyu Qiu Changhong Jing Jin Hong Shuqiang Wang
Résumé
La cartographie de la connectivité fonctionnelle (FC) vers la connectivité structurelle (SC) peut faciliter la fusion de réseaux cérébraux multimodaux et révéler des biomarqueurs potentiels à des fins cliniques. Cependant, établir directement des relations de mapping non linéaires fiables entre la SC et l'imagerie par résonance magnétique fonctionnelle (fMRI) constitue un défi majeur. Dans cet article, nous proposons un modèle novateur, DiffGAN-F2S, fondé sur un réseau antagoniste génératif à diffusion symétrique, permettant de prédire la SC à partir de données fMRI cérébrales au sein d'un cadre unifié. Plus précisément, DiffGAN-F2S exploite les modèles probabilistes de diffusion de débruitage (DDPM) et l'apprentissage antagoniste pour générer de manière efficace une SC symétrique et de haute fidélité à partir de données fMRI en quelques étapes. Grâce à la conception de modules d'attention spatiale multi-têtes à double canal (DMSA) et de modules de convolution graphique, le générateur de graphe symétrique capture d'abord les relations globales entre les régions cérébrales directement et indirectement connectées, puis modélise les interactions locales au sein des régions cérébrales. Cette architecture permet de révéler les relations de mapping complexes entre la fMRI et la connectivité structurelle symétrique. Par ailleurs, une fonction de perte de cohérence connectée spatialement est conçue pour contraindre le générateur à préserver les informations topologiques globales et locales, afin d'assurer une prédiction précise de la SC symétrique. Les tests réalisés sur l'ensemble de données public de l'Alzheimer's Disease Neuroimaging Initiative (ADNI) démontrent que le modèle proposé génère efficacement une connectivité préservant la SC empirique à partir de données d'imagerie quadri-dimensionnelles et présente des performances supérieures en prédiction de SC par rapport à d'autres modèles connexes. En outre, ce modèle permet d'identifier la grande majorité des régions cérébrales et des connexions importantes dérivées de la méthode empirique, offrant ainsi une voie alternative pour fusionner les réseaux cérébraux multimodaux et analyser les maladies cérébrales cliniques.
One-sentence Summary
The authors propose DiffGAN-F2S, a symmetric diffusion generative adversarial network that directly predicts structural connectivity from fMRI in one stage. By integrating dual-channel attention and a spatially connected consistency loss, this model outperforms prior methods in accuracy and efficiency for analyzing neurodegenerative diseases like Alzheimer's.
Key Contributions
- The paper introduces DiffGAN-F2S, a symmetric diffusive generative adversarial network that predicts structural connectivity from fMRI data by integrating denoising diffusion probabilistic models with adversarial learning to generate high-fidelity graphs in a unified framework.
- A dual-channel multi-head spatial attention mechanism combined with graph convolutional modules is designed to capture global relations among direct and indirect brain regions while modeling local interactions to uncover complex non-linear mapping relations.
- Experiments on the ADNI dataset demonstrate that the method effectively preserves empirical connectivity patterns and outperforms existing models in structural connectivity prediction, successfully identifying critical brain regions and connections for clinical analysis.
Introduction
Modeling the relationship between functional connectivity derived from fMRI and structural connectivity is vital for uncovering pathological mechanisms in neurodegenerative diseases like Alzheimer's. Prior approaches often rely on inefficient two-stage pipelines that first compute functional connectivity before predicting structure, while also suffering from training instability and mode collapse in generative models. The authors leverage a novel symmetric diffusive generative adversarial network called DiffGAN-F2S to predict structural connectivity directly from fMRI in a unified, end-to-end framework. By integrating denoising diffusion probabilistic models with adversarial learning and a dual-channel multi-head spatial attention mechanism, their method efficiently generates high-fidelity symmetric graphs while preserving critical global and local topological information.
Dataset
-
Dataset Composition and Sources: The authors utilize data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) public repository, selecting 240 patients divided equally into two categories: 120 normal controls and 120 subjects with mild cognitive impairment (MCI), which includes both early and late stages.
-
Key Details for Each Subset:
- Imaging Modalities: Every subject underwent scanning using a 3T magnetic resonance instrument for both functional MRI (fMRI) and Diffusion Tensor Imaging (DTI).
- DTI Specifications: Acquisition parameters included a repetition time (TR) of 3.4 s to 17.5 s, an echo time (TE) of 56 ms to 105 ms, and 6 to 126 gradient directions.
- fMRI Specifications: The TR varied between 0.607 s and 3.0 s, while the TE ranged from 30 ms to 32 ms.
-
Data Processing and Preprocessing:
- DTI Pipeline: The authors processed DTI data using the PANDA toolbox and the anatomical automatic labeling (AAL90) atlas to generate an empirical structural connectivity (SC) matrix (A0) with dimensions of 90×90.
- fMRI Pipeline: Instead of standard procedures, the team used the
aal.niiatlas file to convert raw fMRI data into ROI-based time series without additional parameters, resulting in a primary sample matrix (F) of size 90×187. - Matrix Definition: In the 90×187 fMRI matrix, the 90 rows correspond to the ROIs defined by the AAL90 atlas, and each element represents the mean signal intensity of a specific ROI at a given time point.
-
Model Usage: The processed datasets serve as the foundation for testing the model's performance in predicting structural connectivities and comparing them against empirical SCs across different diffusive steps and subject groups.
Method
The authors propose the DiffGAN-F2S model to predict Structural Connectivity (SC) from brain fMRI data. The overall framework is illustrated in the first image, which depicts the transformation from raw brain data to fMRI, processed by the proposed model to yield structural connectivity.

The architecture consists of two primary components: a symmetric diffusive process and a conditional denoising process. As shown in the figure below, the diffusive process models the truth distribution of noisy SC by persistently injecting Gaussian noise into the empirical SC matrix A0 over T steps until it becomes a Gaussian matrix AT.

In the denoising process, the brain fMRI serves as a condition to guide the prediction of the clean SC A0′. To reduce computational cost, the denoising steps are compressed to T/d steps, where each step utilizes a conditional generative adversarial network. The detailed structure of a single denoising step involves a Symmetric Graph Generator and a Connectivity Discriminator.
The Symmetric Graph Generator employs a Non-parametric modulator (NPM) to transform raw fMRI into preliminary ROI-based time series without learnable parameters. These features, along with the noisy sample, pass through L layers containing a Dual-channel multi-head spatial attention (DMSA) module and GCN-based modules. The DMSA module is designed to capture global relations among directly and indirectly connected brain regions. The specific implementation of this attention mechanism is detailed in the following figure.

The module computes Query (Q), Key (K1,K2), and Value (V) vectors, where K1 and K2 represent direct and indirect connected ROI features derived from one-layer GCNs. The attention values are calculated separately for direct and indirect connections and combined to update the ROI features. Following the attention mechanism, a Posterior Computing Module (PCD) translates the features into a symmetrically blurred A˙0 and predicts the noisy SC at the current step via posterior sampling.
The Connectivity Discriminator distinguishes whether the noisy SC originates from the generator or the empirical method. It utilizes three GCN layers where the edge represents the noisy SC and the node feature is a one-hot vector corresponding to the 90 brain regions defined by the AAL90 atlas. Temporal embeddings are inserted as bias terms into the discriminating computation.
To optimize the model, three loss functions are devised: the denoising adversarial loss to constrain distribution consistency, the mean absolute error (MAE) loss to measure edge strength differences, and the spatially connected consistency loss to capture local and global graphical properties. The spatially connected consistency loss includes terms for overall similarity and betweenness centrality to ensure accurate prediction of intrinsic structural patterns.
Experiment
- The model transforms fMRI into structural connectivity in a single stage, validating its ability to denoise Gaussian noise into empirical connectivity while preserving both global and local patterns.
- Comparative experiments against seven existing models demonstrate superior accuracy and topological consistency, with predicted results showing high correlation to empirical data and significantly lower errors in graph metrics.
- Connectivity analysis confirms the model reliably identifies disease-relevant brain regions and abnormal connections associated with Mild Cognitive Impairment, matching empirical findings in the majority of key regions.
- Ablation studies verify that the adversarial loss, spatially connected consistency loss, symmetric design, and specific attention mechanisms are critical for maintaining distribution consistency and capturing global topological features.
- Efficiency tests show that the skipping step mechanism balances prediction speed and accuracy, reducing inference time to approximately 3 seconds without significant loss in performance.