HyperAIHyperAI

Command Palette

Search for a command to run...

Generative AI Enables Structural Brain Network Construction from fMRI via Symmetric Diffusion Learning

Qiankun Zuo Bangjun Lei Wanyu Qiu Changhong Jing Jin Hong Shuqiang Wang

초록

기능적 연결성(FC)에서 구조적 연결성(SC)으로의 매핑은 다중 모달 뇌 네트워크 융합을 촉진하고 임상적 함의를 가진 잠재적 바이오마커를 발견하는 데 기여할 수 있습니다. 그러나 구조적 연결성(SC)과 기능적 자기공명영상(fMRI) 간의 신뢰할 수 있는 비선형 매핑 관계를 직접적으로 연결하는 것은 여전히 도전 과제로 남아 있습니다. 본 논문에서는 fMRI 데이터로부터 SC를 예측하기 위해 통합된 프레임워크 내에서 새로운 대칭 확산 생성 적대 신경망 기반 fMRI-to-SC(DiffGAN-F2S) 모델을 제안합니다. 구체적으로, 제안된 DiffGAN-F2S 는 확산 확률 모델(Denoising Diffusion Probabilistic Models, DDPMs)과 적대적 학습을 활용하여 fMRI로부터 소수의 단계를 거쳐 대칭성이 보장되고 고충실도의 SC 를 효율적으로 생성합니다. 이중 채널 멀티헤드 공간 어텐션(Dual-Channel Multi-Head Spatial Attention, DMSA) 모듈과 그래프 합성곱 모듈을 설계함으로써, 대칭 그래프 생성기는 직접 및 간접적으로 연결된 뇌 영역 간의 글로벌 관계를 먼저 포착한 후, 지역적 뇌 영역 상호작용을 모델링합니다. 이를 통해 fMRI 와 대칭적 구조적 연결성 간의 복잡한 매핑 관계를 규명할 수 있습니다. 또한, 생성기가 글로벌 및 로컬 위상 정보를 보존하도록 제약하여 정확한 대칭 SC 예측을 가능하게 하는 공간 연결 일관성 손실(Spatially Connected Consistency Loss)을 고안하였습니다. 공개 데이터셋인 알츠하이머병 신경영상 이니셔티브(Alzheimer's Disease Neuroimaging Initiative, ADNI)에서 수행한 실험 결과, 제안된 모델은 4 차원 영상 데이터로부터 경험적 SC 가 보존된 연결성을 효과적으로 생성할 수 있으며, 기존 관련 모델 대비 SC 예측 성능에서 우수한 결과를 보였습니다. 더 나아가, 본 모델은 경험적 방법에서 도출된 주요 뇌 영역 및 연결의 대부분을 식별할 수 있어, 다중 모달 뇌 네트워크 융합 및 임상적 뇌 질환 분석을 위한 대안적 접근법을 제공합니다.

One-sentence Summary

The authors propose DiffGAN-F2S, a symmetric diffusion generative adversarial network that directly predicts structural connectivity from fMRI in one stage. By integrating dual-channel attention and a spatially connected consistency loss, this model outperforms prior methods in accuracy and efficiency for analyzing neurodegenerative diseases like Alzheimer's.

Key Contributions

  • The paper introduces DiffGAN-F2S, a symmetric diffusive generative adversarial network that predicts structural connectivity from fMRI data by integrating denoising diffusion probabilistic models with adversarial learning to generate high-fidelity graphs in a unified framework.
  • A dual-channel multi-head spatial attention mechanism combined with graph convolutional modules is designed to capture global relations among direct and indirect brain regions while modeling local interactions to uncover complex non-linear mapping relations.
  • Experiments on the ADNI dataset demonstrate that the method effectively preserves empirical connectivity patterns and outperforms existing models in structural connectivity prediction, successfully identifying critical brain regions and connections for clinical analysis.

Introduction

Modeling the relationship between functional connectivity derived from fMRI and structural connectivity is vital for uncovering pathological mechanisms in neurodegenerative diseases like Alzheimer's. Prior approaches often rely on inefficient two-stage pipelines that first compute functional connectivity before predicting structure, while also suffering from training instability and mode collapse in generative models. The authors leverage a novel symmetric diffusive generative adversarial network called DiffGAN-F2S to predict structural connectivity directly from fMRI in a unified, end-to-end framework. By integrating denoising diffusion probabilistic models with adversarial learning and a dual-channel multi-head spatial attention mechanism, their method efficiently generates high-fidelity symmetric graphs while preserving critical global and local topological information.

Dataset

  • Dataset Composition and Sources: The authors utilize data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) public repository, selecting 240 patients divided equally into two categories: 120 normal controls and 120 subjects with mild cognitive impairment (MCI), which includes both early and late stages.

  • Key Details for Each Subset:

    • Imaging Modalities: Every subject underwent scanning using a 3T magnetic resonance instrument for both functional MRI (fMRI) and Diffusion Tensor Imaging (DTI).
    • DTI Specifications: Acquisition parameters included a repetition time (TR) of 3.4 s to 17.5 s, an echo time (TE) of 56 ms to 105 ms, and 6 to 126 gradient directions.
    • fMRI Specifications: The TR varied between 0.607 s and 3.0 s, while the TE ranged from 30 ms to 32 ms.
  • Data Processing and Preprocessing:

    • DTI Pipeline: The authors processed DTI data using the PANDA toolbox and the anatomical automatic labeling (AAL90) atlas to generate an empirical structural connectivity (SC) matrix (A0A_0A0) with dimensions of 90×9090 \times 9090×90.
    • fMRI Pipeline: Instead of standard procedures, the team used the aal.nii atlas file to convert raw fMRI data into ROI-based time series without additional parameters, resulting in a primary sample matrix (FFF) of size 90×18790 \times 18790×187.
    • Matrix Definition: In the 90×18790 \times 18790×187 fMRI matrix, the 90 rows correspond to the ROIs defined by the AAL90 atlas, and each element represents the mean signal intensity of a specific ROI at a given time point.
  • Model Usage: The processed datasets serve as the foundation for testing the model's performance in predicting structural connectivities and comparing them against empirical SCs across different diffusive steps and subject groups.

Method

The authors propose the DiffGAN-F2S model to predict Structural Connectivity (SC) from brain fMRI data. The overall framework is illustrated in the first image, which depicts the transformation from raw brain data to fMRI, processed by the proposed model to yield structural connectivity.

The architecture consists of two primary components: a symmetric diffusive process and a conditional denoising process. As shown in the figure below, the diffusive process models the truth distribution of noisy SC by persistently injecting Gaussian noise into the empirical SC matrix A0\mathbf{A}_0A0 over TTT steps until it becomes a Gaussian matrix AT\mathbf{A}_TAT.

In the denoising process, the brain fMRI serves as a condition to guide the prediction of the clean SC A0\mathbf{A}_0'A0. To reduce computational cost, the denoising steps are compressed to T/dT/dT/d steps, where each step utilizes a conditional generative adversarial network. The detailed structure of a single denoising step involves a Symmetric Graph Generator and a Connectivity Discriminator.

The Symmetric Graph Generator employs a Non-parametric modulator (NPM) to transform raw fMRI into preliminary ROI-based time series without learnable parameters. These features, along with the noisy sample, pass through LLL layers containing a Dual-channel multi-head spatial attention (DMSA) module and GCN-based modules. The DMSA module is designed to capture global relations among directly and indirectly connected brain regions. The specific implementation of this attention mechanism is detailed in the following figure.

The module computes Query (QQQ), Key (K1,K2K_1, K_2K1,K2), and Value (VVV) vectors, where K1K_1K1 and K2K_2K2 represent direct and indirect connected ROI features derived from one-layer GCNs. The attention values are calculated separately for direct and indirect connections and combined to update the ROI features. Following the attention mechanism, a Posterior Computing Module (PCD) translates the features into a symmetrically blurred A˙0\dot{\mathbf{A}}_0A˙0 and predicts the noisy SC at the current step via posterior sampling.

The Connectivity Discriminator distinguishes whether the noisy SC originates from the generator or the empirical method. It utilizes three GCN layers where the edge represents the noisy SC and the node feature is a one-hot vector corresponding to the 90 brain regions defined by the AAL90 atlas. Temporal embeddings are inserted as bias terms into the discriminating computation.

To optimize the model, three loss functions are devised: the denoising adversarial loss to constrain distribution consistency, the mean absolute error (MAE) loss to measure edge strength differences, and the spatially connected consistency loss to capture local and global graphical properties. The spatially connected consistency loss includes terms for overall similarity and betweenness centrality to ensure accurate prediction of intrinsic structural patterns.

Experiment

  • The model transforms fMRI into structural connectivity in a single stage, validating its ability to denoise Gaussian noise into empirical connectivity while preserving both global and local patterns.
  • Comparative experiments against seven existing models demonstrate superior accuracy and topological consistency, with predicted results showing high correlation to empirical data and significantly lower errors in graph metrics.
  • Connectivity analysis confirms the model reliably identifies disease-relevant brain regions and abnormal connections associated with Mild Cognitive Impairment, matching empirical findings in the majority of key regions.
  • Ablation studies verify that the adversarial loss, spatially connected consistency loss, symmetric design, and specific attention mechanisms are critical for maintaining distribution consistency and capturing global topological features.
  • Efficiency tests show that the skipping step mechanism balances prediction speed and accuracy, reducing inference time to approximately 3 seconds without significant loss in performance.

AI로 AI 구축

아이디어에서 출시까지 — 무료 AI 코코딩, 즉시 사용 가능한 환경, 최적의 GPU 가격으로 AI 개발을 가속화하세요.

AI 협업 코딩
바로 사용 가능한 GPU
최적의 가격

HyperAI Newsletters

최신 정보 구독하기
한국 시간 매주 월요일 오전 9시 에 이번 주의 최신 업데이트를 메일로 발송합니다
이메일 서비스 제공: MailChimp