Semantic Segmentation of Brain Tumor on multi-band 3D volumes using non-uniform 3D U-Net

¹Dept. of Computer Science and Engineering, GIMT
²Associate Professor & HoD, GIMT

Abstract

A brain tumor is a cancerous and non-cancerous mass or growth of abnormal cells in the brain. It can begin else- where and spread to the brain. There is considerable significance in MR-Images of the brain in identifying the outline of the tumor and in identifying clinical relevance in the diagnosis, prognosis, and treatment of the tumor.

Recent improvements using deep learning models have proved their effectiveness in various seg- mentation and medical imaging tasks, many of which are based on the U-Net network structure with symmetric encoding and decoding paths for end-to-end segmentation.

In this work, we aim to develop a pipeline consisting of a baseline deep learning model with 3D U-Net constituting adaptation in the training, model structure, and model parameters/hyper-parameters for semantic segmentation of brain tumors. Furthermore, instead of using one model for best results, multiple variants of the U-Net were trained with tweaked hyper-parameters and encoding/decoding blocks to reduce errors and improve performance. Brain Tumor Segmentation (BraTS) Challenge 2020 data was chosen as the baseline for our choice. Semantic segmentation provides the corresponding class for every pixel of the image and U-Net architecture localizes the area of abnormality. The output of the model provides a corresponding segmented mask of the tumor, given a multi-band 3D scan of the brain (preferably MRI scans), the main cause is to segment tumors from the volumized layers semantically.

(baseline architecture)

U-Net, which evolved from the traditional convolu- tional neural network, was first designed and applied in 2015 to process biomedical images. As a general convolutional neural network focuses its task on image classification [20], where input is an image and output is one label, but in biomedical cases, it requires us not only to distinguish whether there is a disease but also to localize the area of abnormality. U-Net is dedicated to solving this problem. The reason it is able to localize and distinguish borders are by doingclassification on every pixel, so the input and output share the same size.

Poposed Pipeline

The study of semantically segmenting Brain Tumors from MR-Images comprehending with multi-band channels on 3D volumes are carried out in eight phases.

Step 1: Understand the Dataset (BraTS’20 Challenge Data), with all the medical arrangements of the terminologies.
Step 2: Generate, scale, and process the Data.
Step 3: Define variants of the 3D U-Net architecture.
Step 4: Train the Segmentation Models (U-Net).
Step 5: Track the performance of the other models while training, with hand-tweaked parameters.
Step 6: Performance analysis of the models trained and generated outputs.
Step 7: Evaluate with the Benchmark parameters and compare the Mask.
Step 8: Select the model with the best performance.

Experimentation Details

The training was performed on cloud GPU (Google Colab), with the default RAM. Fifty epochs were used for training each model. Implementation was based on the TensorFlow framework. Fifty steps per batch were used at a time per epoch, and the batch size was set to 2. The Adam optimizer [30] was used with an initial learning rate (α) of 0.0001 without further adjustments during the training, as it can self-adjust the rate of gradient update so that no manual reduction of α is needed. The total training time for all the models was recorded to be about 20 hrs.

Semantic Segmentation of Brain Tumor on multi-band 3D volumes using non-uniform 3D U-Net

Abstract

UNet

(baseline architecture)

Contracting Path

Expansive Path

Poposed Pipeline

Experimentation Details

Experimental Results