MRI Image Segmentation
MRI Image Segmentation
MRI Image Segmentation
County
Author:
Krishna Dontaraju
3 Methodology 2
4 Problem Formulation 3
4.1 KSVD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
4.2 kernal KSVD . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
5 Dataset 4
6 Implementation 5
7 Results 5
8 Discussion 8
9 Appendix 9
9.1 Proof of D= Φ(Y )A . . . . . . . . . . . . . . . . . . . . . . . . 9
9.2 KOMP algorithm[5] . . . . . . . . . . . . . . . . . . . . . . . . 9
1
Abstract
In this project, two algorithms have been implemented to address
the issue of brain image segmentation. The include a KSVD based
dictionary learning algorithm and its kernelized version.
1 Introduction
Identifying tumors in brain scan is currently being done manually by radiol-
ogists. According to Lee et al [1] the manual segmentation could take upto
3.5 hours. The authors Vaidyanathan et al [2] contend that it is often hard
and unreliable due to multiple interpretation of MRI by different people.
Thus, automating this process of brain tumor segmentation could be benefi-
cial to the diagnosis and treatment of tumors. There are multiple challenges
involved in this process. The tumor tissue and normal tissue have similar
pixel intensities. Also the tumors vary in the shape, size and locations of the
tumors making it further challenging [1].
3 Methodology
The goal of the project is to evaluate the performance of kernel KSVD al-
gorithm over standard DL algorithm. Initially the datasets weree tested on
2
KSVD agorithm described by Aharon et al[4] then on kernel KSVD algorithm
presented by Nguyen et al[].
4 Problem Formulation
4.1 KSVD
Given yi N signals of set Y = [y1 , ...yN ]
The Goal of the algorithm is to minimize the representational error.
kxi k0 ≤ T0 , ∀i = 1...N
where xi i-th column of X, T0 Sparsity level.
This problem is solved in alternating stages of Sparse coding and Dictionary
Update. In the first stage the sparse coefficients are computed using a pursuit
algorithm like OMP. Then the dictionary is updated one atom at a time
based on singular value decomposition. The problem with this algorithm is
it cannot represent non linear data structures. To overcome this problem a
kernal version of the same algorithm has been proposed by Nguyen et al[5]
3
of a pair of signal in feature, is equal to, the output of ”a non-linear positive
semidefinte kernel function” applied on the corresponding signals in the input
space. This means the inner products of the feature space can be calculated
without actually mapping the data into feature space, by just finding the
values of kernel in the input space.
So, now by alternating between sparse coding and dictionary update stages,
we can solve this problem. However, there is another hurdle in solving this
problem. We can not find the reverse mapping between the updated inner
products and the input space directly. So, the dictionary D is split into a fixed
part Φ(Y ) and a variable part A. The variable part is called the ”coefficient
dictionary”[6]. This gives us a new formulation to our problem
5 Dataset
Two datasets have been used to evaluate the performance of the algorithm.
The first dataset contains 60,000 training samples and 10,000 testing samples
of handwritten digits from 0 to 9. The second dataset contains fully anno-
tated MRI scans of 30 patients taken in four different modalities FLAIR, T1,
T1c and T2. This includes 15 patients with High grade glioma and another
fifiteen with low grade glioma. It also contains 50 synthetic MRI scan sets,
developed by a software. The annotations for real MRI scans include 5 labels
indicating ”biological interpretations” of the region[7] which are 1.necrosis,
2.edema, 3.non enhancing tumor, 4.enhancing tumor, 5.everything else. Dif-
ferent modalities provide specific ”biological information”[7] of a particular
region of the brain.
4
6 Implementation
All initial experiments have been performed on MNIST datasets to bench-
mark the algorithms. 50,000 images have been used for training, 10,000 for
validation and remaining 10,000 for testing. Subsequently tests were done on
the brats database. For the purpose of this project, owing to the enormity of
each MRI scan and time taken to evaluate each algorithm, only limited parts
of the database have been utilized. This includes FLAIR MRI of 3 patients
with high grade glioma for training, one patients scan for validation and one
patient scan for testing. Overlapping patches of size 5*5*5=125 around each
voxel are formed as feature vector. Then kmeans was applied on the training
set to get 2000 centroids. These are then input to the algorithms. 4000 vox-
els from each class label of the test MRI were used for testing the algorithm.
For the MNIST dataset Dictionary sizes(K) of 500, 1000, 1500 and sparsity
values(T) of 3, 5, and 10 have been tested. While for the BRATS dataset
dictionary size of 500, 1000 and sparsity value of 5 were tested.
7 Results
Reconstructive accuracy was used as parameter for performance evaluation.
The results of the MNIST dataset have been tabulated in table 1. From the
table we can see that kernal ksvd clearly outperforms ksvd in all the runs.
Increasing dictionary size improved performance of both algorithms, while
with increasing sparsity, the performance flatten after 5. The results of kernal
ksvd have been tabulated in table 2. For ksvd algorithm the total accuracy is
around 50 percent, however, if we take individual class accuracy values into
account, the accuracy of class1 and class 2 together is around 80 percent.
This is expected since, only one modality was used for testing only certain
classes of the tumor would flare up, hence those classes show higher accuracy.
For kernal ksvd only one class seems to form leading to an accuracy of 25
percent. This could mean it requires more tuning of parameters to validate
the results.
5
run ksvd kernel ksvd
K=500,T=3 94.8 97.3
K=500 T=5 96.4 98.4
K=500 T=10 96.3 98.9
K=1000,T=3 94.9 97.8
K=1000 T=5 96.2 98.5
K=1000 T=10 96.4 98.3
6
7
8 Discussion
Due to the heavy computational burden of the algorithm on the system, a
comprehensive evaluations could not be conducted on MRI data. To avoid
this situations faster versions of the algorithms like ”online kmeans” and
”online kernel dictionary learning” need to be explored.
8
9 Appendix
9.1 Proof of D= Φ(Y )A
Using the standard orthogonal decomposition, we can write D∗
D∗ = D̃ + D⊥
where D̃ and D⊥ are matrices whose columns lie within and orthogonal to col-
umn subspace of Φ(Y ), respectively. In other words, D̃ = Φ(Y )AandΦ(Y )T D⊥ =
0
since the second term is a ”positive semidefinite matrix”, it will only increase
the cost function which means D⊥ = 0, D∗ = D̃ = Φ(Y )A is an optimal
solution
9
References
[1] Lee, Jeon, et al. Brain tumor image segmentation using kernel dictionary
learning. Engineering in Medicine and Biology Society (EMBC), 2015
37th Annual International Conference of the IEEE. IEEE, 2015.
[3] Kim, Seung-Jun. ”Online kernel dictionary learning.” In Signal and In-
formation Processing (GlobalSIP), 2015 IEEE Global Conference on, pp.
103-107. IEEE, 2015.
[4] Michal Aharon, Michael Elad, and Alfred Bruckstein. ”K-SVD: An Al-
gorithm for Designing Overcomplete Dictionaries for Sparse Representa-
tion.” IEEE Transactions on Signal Processing 54, no. 11 (2006): 4311-
23.
[5] Van Nguyen, Hien, et al. ”Design of non-linear kernel dictionaries for ob-
ject recognition.” IEEE Transactions on Image Processing 22.12 (2013):
5123-5135.
[6] Golts, Alona, and Michael Elad. ”Linearized kernel dictionary learning.”
IEEE Journal of Selected Topics in Signal Processing 10.4 (2016): 726-
739.
[7] Ayache, N., N. Cordier, and H. Delingette. ”The Multimodal Brain Tu-
mor Image Segmentation Benchmark (BRATS).” (2014).
10