2. Traditional Brain Tumor Segmentation
Several conventional techniques have been proposed for brain tumor segmentation, including region-focused, edge-focused, and model-focused approaches
[17][19]. While conventional brain tumor segmentation techniques have been widely used in medical imaging, they have several limitations, including sensitivity to image noise, lack of robustness to variations in image intensity and texture, and dependence on expert knowledge for parameter tuning.
The accuracy of conventional techniques depends on the specific application and available resources
[18][20]. Thus, the choice of the most appropriate segmentation technique depends on various factors, including the image quality, the type and location of the tumor, and the clinical objectives. In recent years, deep-learning-based methods, such as CNNs, RNNs, and their variants, have emerged as powerful alternatives to conventional techniques, achieving state-of-the-art performance on various medical imaging tasks, including brain tumor segmentation. The accuracy of these techniques depends on the specific application and available resources. However, these methods require large and diverse datasets for training and the careful selection of appropriate network architectures and hyperparameters.
2.1. Region-Focused Approaches
Region-focused approaches have been widely used in brain tumor segmentation due to their simplicity and computational efficiency. These methods aim to segment brain tumors based on the intensity or texture differences between the tumor and its surrounding tissue. Several region-based approaches have been proposed for brain tumor segmentation, including thresholding, clustering, and region growing methods
[19][21].
Thresholding-based methods involve setting a fixed threshold value to separate the tumor and non-tumor regions
[20][22]. These methods are simple and fast, but may be affected by image noise and variations in image intensity and texture. Clustering-based methods use statistical clustering algorithms, such as fuzzy C-means, k-means, and expectation-maximization algorithms, to partition the image into tumor and non-tumor regions. Clustering-based methods can achieve better segmentation accuracy than thresholding-based methods, but they require the careful selection of clustering parameters.
Region growing methods start with a seed point within the tumor region and expand the region by incorporating adjacent pixels that meet specific criteria. Region growing methods can produce accurate and smooth segmentation results, but may be affected by variations in image intensity and texture, and they are sensitive to the selection of seed points. Several studies have compared the performance of region-based approaches for brain tumor segmentation. Jaglan et al.
[21][23] compared the performance of various thresholding- and clustering-based methods and found that the Otsu thresholding method and the fuzzy C-means clustering method achieved the highest segmentation accuracy. Charutha et al.
[22][24] compared the performance of various region growing methods and found that a fast-marching method with an adaptive threshold achieved the best segmentation results.
Region-focused approaches have been widely used in brain tumor segmentation due to their simplicity and computational efficiency. While these methods have their strengths, they may be negatively affected by variations in image intensity and texture. Thus, they require the careful selection of parameters and domain knowledge. Advances in deep-learning-based methods have shown promising results in overcoming some of the limitations of region-focused approaches.
2.2. Edge-Focused Approaches
Edge-focused approaches aim to identify the boundary between the tumor and surrounding tissue by detecting edges in the image. These methods often use edge detection techniques, such as Canny or Sobel filters, to highlight the edges and then apply a segmentation algorithm to separate the tumor from the background. Edge-focused approaches can achieve better segmentation accuracy than region-based approaches, but may be sensitive to image noise and produce fragmented results. Rajan and Sundar
[23][25] combined K-means clustering, fuzzy C-means, and active contour techniques to enhance segmentation. Similarly, Sheela and Suganthi
[24][26] used an approach to improve accuracy, reduce processing time, and compute tumor volume for effective diagnosis and treatment. The process involves collecting images, calculating tumor area and volume, preprocessing and enhancing the images, adjusting the image intensities, clustering the images, classifying the images, extracting features, and segmenting the images. Each step in the process is important and contributes to the overall quality of the enhanced images. The process has strengths such as its ability to improve the quality of MRI images in several ways, i.e., identifying tumors, providing quantitative information about tumors, and segmenting tumors from the surrounding tissue. However, the process also has weaknesses, such as its complexity, time-consuming nature, reliance on specialized software and hardware, difficulty in automation, and sensitivity to noise in the images.
Several studies have compared the performance of edge-focused approaches for brain tumor segmentation. The performance of various edge detection methods, including Canny, Sobel, and Laplacian filters, showed that the Sobel filter achieved the highest segmentation accuracy. Und et al.
[25][27] compared the performance of edge detection methods combined with different segmentation algorithms, such as region growing and active contour methods, and found that the Sobel filter combined with the active contour method achieved the best segmentation results.
Edge-focused approaches can be computationally intensive due to the need to detect edges in the image. Moreover, it may be affected by variations in image intensity and texture and produce fragmented segmentation results. Deep-learning-based methods, such as CNNs, have shown promising results in overcoming some of these limitations. They can also achieve better segmentation accuracy than region-based approaches, but they may be sensitive to image noise and produce fragmented results. The choice of the most appropriate edge detection method depends on the specific application and available resources. Deep-learning-based techniques have reported very significant results in improving segmentation accuracy and robustness.
2.3. Model-Focused Approaches
Model-focused approaches aim to model the shape, texture, and intensity-based characteristics of brain tumors and their surrounding tissue to improve segmentation accuracy. These methods often use traditional machine learning algorithms, such as random forests, decision trees, and support vector machines, to learn those features that distinguish between tumorous and non-tumorous regions
[23][25]. Model-focused approaches can achieve high segmentation accuracy and robustness subject to the availability of high-quality large datasets for training and may be affected by overfitting, as presented in
Figure 2.
Figure 2. Model-based approach for brain tumor segmentation [26]. Model-based approach for brain tumor segmentation [28].
Several studies have conducted a comprehensive analysis of the model-based techniques used in brain tumor segmentation. Tandoori et al.
[26][28] considered a couple of machine-learning-based models including random forests, decision trees, and support vector machines (SVMs), and reported that SVMs were the best models among all models. Similarly, Jiang et al.
[27][29] presented a DNN model as a hybrid approach of the CNN and conditional random field and reported a very good performance.
Model-focused approaches can achieve high segmentation accuracy, but these have a substantial trade-off concerning the size of the dataset. For better performance, these models need a large high-quality dataset. However, a large dataset is also prone to data overfitting issues. Therefore, significant attention should be paid to the experimentation to obtain the best trade-off between the size of the dataset and the performance of model-focused techniques. In such cases, DL-based methods have revealed very good results in improving segmentation accuracy and robustness.
3. DL-Based Brain Tumor Segmentation
DL-based models, particularly convolutional neural networks, have demonstrated their effectiveness in medical imaging
[28][30]. A traditional set up of a CNN-based DL model is presented in
Figure 3. CNNs are designed to automatically learn the features that distinguish between tumor and non-tumor regions by processing the image using a series of kernels, convolutional and pooling layers, activations, and loss functions. CNNs have the advantage of being able to obtain features from the input, allowing them to capture complex patterns and variations in image intensity and texture
[5].
Figure 3. Neural-network-based model [28]. Neural-network-based model [30].
Several DL-based models have been developed for brain tumor segmentation, including V-Net, U-Net, and their variants. U-Net is a popular CNN architecture that consists of a contracting path to capture the contextual information of the input image and an expanding path to produce the segmentation map. V-Net is an extension of U-Net that includes a 3D CNN for processing volumetric data. These architectures have achieved state-of-the-art performance on various brain tumor segmentation challenges, such as the Multimodal Brain Tumor Segmentation Challenge (BRATS).
3.1. Neural-Network-Based Models
Several network structures, including CNNs and RNNs, have been proposed for brain tumor segmentation. CNNs are a type of deep neural network that has shown remarkable success in various image-processing tasks, including object recognition, segmentation, and classification. CNNs consist of multiple convolutional layers that extract the features of the input image and a fully connected layer that produces the segmentation map
[29][31]. RNNs are another type of neural network that can process sequential data, such as time series or sequences of images. RNNs use feedback connections to retain information about the previous inputs and produce outputs that depend on the current and past inputs. RNNs have been applied to brain tumor segmentation by processing sequences of images and incorporating spatial and temporal information into the segmentation.
Several studies have compared the performance of different neural network structures for brain tumor segmentation. Akilan et al.
[30][32] proposed a 3D CNN-LSTM and found that the LSTM architecture achieved the highest segmentation accuracy. Another study by Zhao et al.
[31][33] proposed a CNN-RNN architecture that combines a 3D MRI brain tumor segmentation to improve segmentation accuracy.
CNNs and RNNs have been widely used for brain tumor segmentation due to their potential to learn intricate features from the input image and process sequential data. The choice of the most appropriate neural network structure depends on the specific application and available resources. Further research is needed to explore the potential of these structures in brain tumor segmentation.
3.2. Specialized Neural-Network-Based Model for Medical Imaging
U-Net and V-Net are two dedicated architectures that have been proposed for medical image segmentation, including brain tumor segmentation. U-Net is a fully convolutional neural network that involves an expanding path and a contracting path. The contracting path captures the context of the input image by reducing the spatial dimensions, while the expanding path produces the segmentation map by increasing the spatial dimensions
[27][29].
Maqsood et al.
[32][34] used fuzzy-logic-based edge detection and U-NET CNN classification. The approach involves preprocessing, edge detection, and feature extraction using wavelet transform, followed by CNN-based classification. The method outperforms other approaches in terms of accuracy and visual quality. Nawaz et al.
[33][35] combine DenseNet77 in the encoder and U-NET in the decoder. The method achieves high segmentation accuracy on ISIC-2017 and ISIC-2018 datasets. Meraj et al.
[34][36] used a quantization-assisted U-Net approach for accurate breast lesion segmentation in ultrasonic images. It combined U-Net segmentation with quantization and Independent Component Analysis (ICA) for feature extraction.
V-Net is an extension of U-Net that uses a 3D convolutional neural network for processing volumetric data. Isensee et al.
[28][30] compared the performance of various deep-learning-based methods, including U-Net, V-Net, and other architectures, and found that U-Net achieved the best segmentation accuracy. The authors introduced a U-Net Cascade approach for large image datasets. Their approach contained two stages. Stage 1 trained a 3D U-Net on downsampled images; it enhanced the results of the image and fed it as inputs for Stage 2. Stage 2 used a second 3D U-Net for patch-based training at full resolution. Their two-stage structure is shown in
Figure 4.
Figure 4. Specialized deep learning approaches [28]. Specialized deep learning approaches [30].
Several studies have compared the performance of U-Net and V-Net with other deep-learning-based methods for brain tumor segmentation. Wang et al.
[35][37] proposed a U-Net-based method that uses a multi-scale feature extraction strategy to improve segmentation accuracy. Other dedicated architectures, such as DeepMedic, BRATS-Net, and Tiramisu-Net, have also been proposed for brain tumor segmentation. DeepMedic is a hybrid CNN that combines a multi-scale architecture with a multi-path architecture to improve segmentation accuracy. BRATS-Net is a CNN architecture that includes a cascaded network and a refinement network for processing multimodal brain images. Tiramisu-Net is a densely connected convolutional network that uses skip connections to capture the features at different scales. Thus, there are several dedicated architectures that have also shown promising results in improving segmentation accuracy and robustness. The choice of the most appropriate architecture depends on the specific application and available resources.
4. Federated Learning for Medical Imaging
FL is a novel machine learning technique that enables multiple parties to collaboratively train a global model without sharing their sensitive data
[36][38]. FL has been widely used in the context of medical imaging, where privacy and security concerns are of paramount importance. In FL, each participating client trains a local model on its data and shares only the model weights with the central server. The central server aggregates the model w and sends the updated global model back to the clients. FL allows medical institutions to collaborate and leverage their data to improve the accuracy and generalization of machine learning models while preserving the privacy and security of the data.
Several studies have demonstrated the effectiveness of FL in medical imaging tasks, including brain tumor segmentation. Khan et al.
[37][39] proposed an FL-based framework for brain tumor segmentation that allowed multiple medical institutions to collaborate on the training of a deep learning model without sharing their data. The FL-based method achieved comparable segmentation accuracy to the centralized learning method while preserving the privacy and security of the data. Arikumar et al.
[38][40] proposed an FL-based method for multimodal brain tumor segmentation that allowed multiple institutions to collaboratively train a global model using both MRI and histology data. The FL-based method achieved higher segmentation accuracy than the traditional centralized learning method while preserving the privacy and security of the data, as shown in
Figure 5.
Figure 5. Federated learning [FL]-based model [37]. Federated learning [FL]-based model [39].
FL is a promising technique for medical tasks, including brain tumor segmentation, that enables multiple institutions to collaborate on the training of deep learning models without compromising the privacy and security of the data. Further research is needed to explore the potential of FL in medical imaging and to address the challenges related to the communication, computation, and heterogeneity of the data.
4.1. Data Confidentiality and Safety Concerns
Data confidentiality and safety concerns are critical issues in medical imaging, especially in the context of brain tumor segmentation, where patient data is highly sensitive and confidential. Deep-learning-based methods require large amounts of patient data for training, and the sharing of such data among multiple institutions raises concerns about data privacy, security, and ownership
[39][41]. Several studies have addressed data confidentiality and safety concerns in medical imaging by proposing secure and privacy-preserving methods for data sharing and machine learning. Sheller et al.
[40][42] proposed a secure data-sharing method that uses homomorphic encryption to enable multiple institutions to collaborate on the training of a machine learning model without revealing sensitive data. It achieved comparable performance to the traditional centralized learning method while preserving the privacy and security of the data. Wang et al.
[35][37] proposed a blockchain-based method for sharing medical data among multiple institutions while maintaining data ownership and privacy. The authors demonstrated that the method provides a secure and transparent mechanism for data sharing and machine learning while preserving the privacy and security of the data.
Moreover, several studies have addressed the legal and ethical implications of data sharing and machine learning in medical imaging. Jiang et al.
[41][43] proposed a framework for responsible data sharing in medical research that takes into account legal, ethical, and social considerations. Data confidentiality and safety concerns are critical issues in medical imaging, especially in the context of brain tumor segmentation. Secure and privacy-preserving methods, such as homomorphic encryption and blockchain, have been proposed to enable data sharing and machine learning while preserving the privacy and security of the data. Further research is needed to explore the potential of these methods in medical imaging and to address the legal and ethical implications of data sharing and machine learning.
4.2. Cooperative Learning for Better Performance
Cooperative learning, also known as collaborative learning, is a method that involves multiple learners working together to achieve a common goal. In the context of brain tumor segmentation, cooperative learning has been used to improve the performance of machine learning models by leveraging the collective intelligence of multiple experts
[42][44]. Several studies have explored the potential of cooperative learning for brain tumor segmentation. Amiri et al.
[43][45] proposed a cooperative learning framework that combines the outputs of multiple deep learning models to improve segmentation accuracy. The authors stated that the cooperative learning framework achieved higher segmentation accuracy than individual models and traditional ensemble learning methods. Witowski et al.
[44][46] proposed a cooperative learning method that involves multiple experts in a crowdsourcing platform to annotate brain tumor images. The authors demonstrated that the cooperative learning method achieved higher annotation accuracy and consistency than individual experts and traditional methods.
Moreover, several studies have investigated the impact of cooperative learning on the interpretation and generalization of machine learning models. Abramoff et al. proposed a cooperative learning method that involves radiologists and deep learning models to jointly interpret brain MRI images. The proposed cooperative learning method achieved higher diagnostic accuracy and interpretability than individual radiologists and deep learning models.
Cooperative learning is a promising method for improving the performance and interpretability of machine learning models in brain tumor segmentation. Further research is needed to explore the potential of cooperative learning in other medical imaging tasks and to address the challenges related to the communication, computation, and heterogeneity of the data.
4.3. Federated Learning Applications
FL is a novel machine learning technique that has been applied in various domains, including healthcare, finance, and the internet of things (IoT). In healthcare, FL has been used for various tasks, such as disease diagnosis, drug discovery, and medical imaging analysis
[45][47]. Chen et al.
[46][48] proposed an FL-based framework for personalized cancer diagnosis that allowed multiple hospitals to collaboratively train a DL model without sharing their patient data. The FL-based method achieved higher accuracy than the traditional centralized learning method while preserving the privacy and security of the data. Dayan et al.
[47][49] proposed a FL-based method for predicting COVID-19 severity using chest X-ray images. The authors showed that the FL-based method achieved higher accuracy than the traditional centralized learning method while preserving the privacy and security of the data.
Moreover, several studies have explored the potential of FL for medical image analysis tasks, such as brain tumor segmentation, retinal image analysis, and skin lesion classification. Zhang et al.
[48][50] present an FL-based method for retinal image analysis that allowed multiple medical institutions to collaborate on the training of a deep learning model without sharing their patient data. The authors demonstrated that the FL-based method achieved higher accuracy than the traditional centralized learning method while preserving data security privacy.
FL is a promising technique for various ML applications, including healthcare and medical imaging analysis. However, further research is needed to explore the potential of FL in other domains and to address the challenges related to the communication, computation, and heterogeneity of the data.