Metric learning with spectral graph convolutions on brain connectivity networks
04 Jan 2018Paper: NeuroImage
Code: (old version) github (tensorflow)
Published: Dec 2017
Key Idea:
We propose to learn a graph similarity metric using a siamese graph convolutional neural network (s-GCN) in a supervised setting.
Backgroung knowledge:
Siamese neural network is a class of neural network architectures that contain two or more identical subnetworks. identical here means they have the same configuration with the same parameters and weights. Parameter updating is mirrored across both subnetworks. Siamese NNs are popular among tasks that involve finding similarity or a relationship between two comparable things.
Degree matrix or diagonal degree matrix
The degree matrix is a diagonal matrix which contains information about the degree of each vertex—that is, the number of edges attached to each vertex.
An adjacency matrix is a square matrix used to represent a finite graph. The elements of the matrix indicate whether pairs of vertices are adjacent or not in the graph.
Laplacian matrix / Symmetric normalized Laplacian
Chebyshev polynomials of the first kind
Methodology:
Dataset & preprocess:
- Dataset: Autism Brain Imaging Data Exchange (ABIDE) & UK Biobank (UKB)
- Preprocess pipeling:
- ABIDE: Configurable Pipeline for the Analysis of Connectomes (C-PAC)
Including:
* skull striping
* slice timing correction
* motion correction
* global mean intensity normalisation
* nuisance signal regression
* band-pass filtering (0.01-0.1Hz)
* registration of fMRI images to standard anatomical space (MNI152)
-
UKB: Miller2016
- ROI:
- ABIDE: Harvard Oxford (HO) atlas (R = 110 cortical and subcortical ROIs)
- Extract the mean time series for ROI
- Normalised to zero mean and unit variance.
- UKB: 55 (100 spatially independent components, 55 non artefactual. Miller2016)
- ABIDE: Harvard Oxford (HO) atlas (R = 110 cortical and subcortical ROIs)
- Number:
- ABIDE:
Subjects number: N = 871
ASD disease: 403
Healthy controls: 468
Sites number: 20
(from different imaging sites, 871 met the imaging quality and phenotypic information criteria)
- UKB:
Subjects number: N = 2500
Male: 1181
Female: 1319
Network detail:
- Task: measure the similarity between two graph
- Graph:
- Vertex: Each ROI is represent by a node
- Input feature: for each ROI, the input feature is the corresponding row of correlation matrix for that ROI.
- Edge & weight:
- Type 1: Spatial distance as graph for weight
- Type 2: mean functional connectivity as graph
- The edge is determined by k-NN (k-nearest neighbors). k=10
- Network Structure:
- CNN:
- 2 layers with 64 features (shared in Siamese network)
- K=3, convolution takes input at most K steps away from a node.
- FC:
- One output with Sigmoid activation
- A binary feature is introduced at the FC layer indicating whether the subject pair were scanned at the same site or not.
- Dropout 0.2/0.5 (ABIDE/UKB) on FC
- CNN:
- Loss function:
- global loss:
It maximises the mean similarity between embeddings belonging to the same class, minimises the mean similarity between embeddings belonging to different classes . And minimises the variance of pairwise similarities for both matching and non-matching pairs of graphs.
- constrained variance loss:
Compare to global loss, it add a threshold a to the variance.
- global loss:
- Network detail:
- Adam optimizer: 0.001 learning rate and 0.0005/0.05 (ABIDE/UKB) regularization
- Loss function: margin m=1.0, weight lambda=1.0, a=m/2
- mini-batch: 200
- Train and test:
- ABIDE:
- 871 total, 720 train, 151 test.
- train form 21802 matching and 21398 non-matching graph pairs. test form 5631 matching and 5694 non-matching.
- all graphs are fed to the network the same number of times to avoid biases.
- subjects from all 20 sites are included in both training and test sets
- UKB:
- 5 fold cross validation
- 2500 total, 2000 train, 500 test
- ABIDE: