Metric learning with spectral graph convolutions on brain connectivity networks

04 Jan 2018

Paper: NeuroImage
Code: (old version) github (tensorflow)
Published: Dec 2017

Key Idea:

We propose to learn a graph similarity metric using a siamese graph convolutional neural network (s-GCN) in a supervised setting.

Backgroung knowledge:

Siamese neural networks

Siamese neural network is a class of neural network architectures that contain two or more identical subnetworks. identical here means they have the same configuration with the same parameters and weights. Parameter updating is mirrored across both subnetworks. Siamese NNs are popular among tasks that involve finding similarity or a relationship between two comparable things.

Degree matrix or diagonal degree matrix

The degree matrix is a diagonal matrix which contains information about the degree of each vertex—that is, the number of edges attached to each vertex.

Adjacency matrix

An adjacency matrix is a square matrix used to represent a finite graph. The elements of the matrix indicate whether pairs of vertices are adjacent or not in the graph.

Laplacian matrix / Symmetric normalized Laplacian

Chebyshev polynomials of the first kind

Methodology:

Check this blog

Dataset & preprocess:

Dataset: Autism Brain Imaging Data Exchange (ABIDE) & UK Biobank (UKB)
Preprocess pipeling:
- ABIDE: Configurable Pipeline for the Analysis of Connectomes (C-PAC)

  
  Including:
    * skull striping
    * slice timing correction
    * motion correction
    * global mean intensity normalisation 
    * nuisance signal regression 
    * band-pass filtering (0.01-0.1Hz)
    * registration of fMRI images to standard anatomical space (MNI152)

UKB: Miller2016
ROI:
- ABIDE: Harvard Oxford (HO) atlas (R = 110 cortical and subcortical ROIs)
  - Extract the mean time series for ROI
  - Normalised to zero mean and unit variance.
- UKB: 55 (100 spatially independent components, 55 non artefactual. Miller2016)
Number:
- ABIDE:

  
  Subjects number: N = 871 
  ASD disease: 403 
  Healthy controls: 468 
  Sites number: 20
  (from different imaging sites, 871 met the imaging quality and phenotypic information criteria)

UKB:

  
  Subjects number: N = 2500
  Male: 1181 
  Female: 1319

Network detail:

Task: measure the similarity between two graph
Graph:
- Vertex: Each ROI is represent by a node $\mathcal{v}_i\in\mathcal{V}$
- Input feature: for each ROI, the input feature is the corresponding row of correlation matrix for that ROI.
- Edge & weight:
  - Type 1: Spatial distance as graph $e_{ij}=d(v_i,v_j)=\sqrt{\|v_i-v_j\|^2}$ for weight
  - Type 2: mean functional connectivity as graph
  - The edge is determined by k-NN (k-nearest neighbors). k=10
Network Structure:
1. CNN:
  1. 2 layers with 64 features (shared in Siamese network)
  2. K=3, convolution takes input at most K steps away from a node.
2. FC:
  1. One output with Sigmoid activation $S(x)={\frac {1}{1+e^{-x}}}={\frac {e^{x}}{e^{x}+1}}.$
  2. A binary feature is introduced at the FC layer indicating whether the subject pair were scanned at the same site or not.
  3. Dropout 0.2/0.5 (ABIDE/UKB) on FC
Loss function:
- global loss:
  $J^g=(\sigma^{2+}+\sigma^{2-})+\lambda max(0,m-(\mu^+-\mu^-))$
It maximises the mean similarity $\mu^+$ between embeddings belonging to the same class, minimises the mean similarity between embeddings belonging to different classes $\mu^-$ . And minimises the variance of pairwise similarities for both matching $\sigma^{2+}$ and non-matching $\sigma^{2-}$ pairs of graphs.
- constrained variance loss:
  $J^g=max(0,\sigma^{2+}-a)+max(0,\sigma^{2-}-a)+\lambda max(0,m-(\mu^+-\mu^-))$ Compare to global loss, it add a threshold a to the variance.
Network detail:
- Adam optimizer: 0.001 learning rate and 0.0005/0.05 (ABIDE/UKB) regularization
- Loss function: margin m=1.0, weight lambda=1.0, a=m/2
- mini-batch: 200
Train and test:
- ABIDE:
  1. 871 total, 720 train, 151 test.
  2. train form 21802 matching and 21398 non-matching graph pairs. test form 5631 matching and 5694 non-matching.
  3. all graphs are fed to the network the same number of times to avoid biases.
  4. subjects from all 20 sites are included in both training and test sets
- UKB:
  1. 5 fold cross validation
  2. 2500 total, 2000 train, 500 test

Deep Paper Pool really deep.

Metric learning with spectral graph convolutions on brain connectivity networks

Key Idea:

Backgroung knowledge:

Methodology:

Dataset & preprocess:

Network detail:

Related Posts

The Elephant in the Room 07 Sep 2018

LSTM Time and Frequency Recurrence for Automatic Speech Recognition 29 Oct 2017

Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks 28 Oct 2017