1. The Classification Methods of COVID-19 Based on Supervised Learning
The classification methods of COVID-19 based on supervised learning refers to use of the supervised learning method in extracting features to train a deep network model to distinguish COVID-19. This section focuses on the application to the COVID-19 classification task of VGG, ResNet, DenseNet, and lightweight networks, as shown in Table 1.
Table 1.
COVID-19 classification methods based on supervised learning.
It is not difficult to see the following summary from Table 1. Firstly, the mainstream classification models are based on ResNet in COVID-19. There are three classification models based on VGG, accounting for 18.75%. There are five classification models based on ResNet, accounting for 31.25%. There are three classification models based on DenseNet, accounting for 18.75%. There are five COVID-19 classification models based on lightweight networks, accounting for 31.25%. Secondly, public datasets are mainly used for classification method research on COVID19. COVID-19 datasets include X-ray datasets, CT datasets and dual-modality datasets that mix X-ray and CT. About 75% of the COVID-19 dataset come from public datasets and 25% of the COVID-19 datasets come from private datasets. Thirdly, regarding the COVID-19 classification, 30% of the tasks are second classification, 46.66% tasks are third classification. Finally, there are relatively few open source and public code resources about the model. The number of public code resources accounted for 25%, and the number of undisclosed code resources accounted for 75%.
1.1. The Classification Methods of COVID-19 Based on VGG
VGG
[61][17] is a deep convolutional network structure proposed by Simonyan, which consists of five convolutional blocks and the whole network uses a convolutional kernel of fixed size 3 × 3. The first two convolutional blocks use two successive 3 × 3 convolutional operations and the last three convolutional blocks use three successive 3 × 3 convolutional operations. The advantage of VGG is its simple structure and easy to perform model improvement. VGG improved model performance by deepening the number of network layers and learning more refined deep network features. The main research work of using VGG as the backbone network for COVID-19 classification task is as follows. Ibrahim et al.
[47][1] used X-ray and CT images in VGG19 to provide complementary information for the classification of COVID-19. Elazab et al.
[48][2] extracted the average of advanced features of infected and healthy cases with the help of pre-trained VGG. The extracted features were used as supervisory signals to retrain VGG to reduce the effect of feature space noise and outliers. Danilov et al.
[49][3] achieved the best results with VGG16 in 10 pre-trained networks by generating attention heat maps to supervise neural networks focusing on the objects in the images.
1.2. The Classification Methods of COVID-19 Based on ResNet
ResNet
[62][18] is a convolutional network with residual connection structure proposed in 2015. The feature extraction capability was improved by increasing the depth. However, the gradient disappearance problem leads to a sharp drop in the performance of the model when the network is deepened. The ResNet uses residual connections in the network with the aim of alleviating the gradient disappearance problem in deep neural networks. The residual connections pass the current output directly to the next layer by adding constant mappings between different layers. In the COVID-19 classification task, ResNet is used to alleviate the gradient disappearance problem and improve the classification accuracy of the model. Toğaçar et al.
[37][4] proposed a parallel Resnet network model which uses ResNet-18, ResNet-50, and ResNet-101 to extract different activation sets and improve the classification performance by selecting the dominant activation set. Agrawal et al.
[50][5] added squeezed excitation blocks in residual blocks to enhance lower level feature representation and avoid overfitting by keeping trainable parameters minimal. Cheng et al.
[51][6] improved classification performance by enriching feature information in a single residual block. Yu et al.
[52][7] used the Resnet101 features to underlying relationship between the combined features of graph convolutional neural network to enhance features and improve classification performance. Wang et al.
[53][8] proposed a 3D ResNet-based classification network for COVID-19 in CT images. The residual learning block reduces the complexity of the a priori attention mechanism for transferring the pre-trained detection model.
1.3. The Classification Methods of COVID-19 Based on DenseNet
DenseNet
[63][19] is a convolutional network with dense connections structure proposed by Huang Gao. DenseNet connects all the previous layers to the later layers and each layer receives inputs from all the previous layers
[64][20]. Feature reuse is achieved by feature stitching as the input to the next module. The dense connection helps in back propagation of gradients. DenseNet achieves better performance than ResNet in terms of parameters and computational cost. With DenseNet as the backbone network in the COVID-19 classification task, dense connections pass the feature mapping of the current layer to all subsequent layers. The feature reuse approach reduces the number of parameters and alleviates overfitting. Montalbo et al.
[54][9] used DenseNet as a COVID-19 classification network to reduce the number of parameters by reducing the dense network depth increasing the network width. Park et al.
[55][10] proposed a self-attention mechanism based on DenseNet to achieve COVID-19 X-ray image classification by feeding features into transformer. Li et al.
[56][11] applied spatial attention and channel attention to DenseNet, which improved the objective lesion features and suppressed less relevant features to improve the COVID-19 classification accuracy.
1.4. The Classification Methods of COVID-19 Based on Lightweight Networks
Methods based on the lightweight COVID-19 classification model refer to minimizing the number of parameters without degrading the model performance. The current lightweight networks mainly include SqueezeNet, ShuffleNet, MobileNet, Inception, etc. The size of model parameters can be effectively compressed by using lightweight networks or by introducing deep separable convolution, group convolution or channel shuffling methods. Lightweight methods have received a lot of attention from researchers. The COVID-19 classification method based on lightweight networks can effectively improve the model training speed and facilitate the deployment on mobile devices. Chakraborty et al.
[57][12] designed a lightweight deep neural network, Corona-Nidaan, which uses deeply separable convolution and multiscale convolution kernels. Das et al.
[58][13] truncated a lightweight network model of InceptionV3 using maximum pooling and global average pooling to reduce the feature dimension, the number of parameters and the complexity of computation
[65][21]. Ahmed et al.
[1][14] added an attention module with a one-dimensional vector representation on Inception-ResNet-V2 to evaluate this convolutional neural network and detect COVID-19 using CT images. Owais et al.
[59][15] integrated complementary feature information through multiple lightweight integrated network models to detect COVID-19 from CT and X-ray images. Abraham et al.
[60][16] proposed an integrated lightweight network that invokes correlated feature selection algorithms in multiple networks of Squeezenet, Darknet-53, MobilenetV2, Xception, and Shufflenet to determine the best subset of features to improve the accuracy.
To sum up, the classification methods of COVID-19 based on VGG use different scales and alternating training methods during the training process, which can converge in fewer epochs and alleviate the training time. There are examples of better transferability and strong generalization ability. However, the VGG COVID-19 classification models use three fully connected layers with too many parameters, which leads to large memory and more computational resources. The classification methods of COVID-19 based on ResNet use residual connections to replicate shallow features directly. The gradient disappearance and network degradation problems are solved by residual connectivity, and the better COVID-19 classification performance is obtained. However, there are more deep layers in ResNet COVID-19 classification model, and it is difficult to optimize; there is a lot of redundant information in this model. The classification methods of COVID-19 based on DenseNet use dense concatenation to transfer shallow layer information directly to the deep layer for further application. The redundancy information is reduced by this model. There are better convergence rates and less parameters at the same network layers. However, when the network layers are increased and features are replicated multiple times, resulting in higher computational complexity and spatial complexity of the model. The classification methods of COVID-19 based on lightweight networks can reduce the number of parameters and the computation in some degree. There are a few lightweight models which are commonly put into use for mobile devices.
2. The Segmentation Methods of COVID-19 Based on Supervised Learning
Image segmentation is an essential task for analyzing medical images and obtaining further diagnostic information in the field of medical image processing. U-net is an image segmentation network structure with classical downsampling and upsampling, which was proposed by Ronnerberger et al.
[66][22]. U-net can obtain accurate segmentation results by employing fewer training samples size. U-net locates COVID-19 lesion regions using deep features and achieves accurate segmentation results using shallow features. This section summarizes the U-net in COVID-19 image segmentation applications combined with attention mechanism, multi-scale mechanism, residual connectivity mechanism and dense connectivity mechanism, as shown in
Table 32.
It is not difficult to see the following summary from Table 2. Firstly, the methods of U-Net combined with the attention mechanism and U-Net combined with the multiscale mechanism were more often adopted in COVID-19 lesion segmentation. There are five COVID-19 segmentation models based on the attention mechanism, accounting for 31.25% of all the methods of U-Net. There are five COVID-19 segmentation models based on a multi-scale mechanism, accounting for 31.25%. There are four COVID-19 segmentation models based on a residual connection mechanism, accounting for 25%. There are two COVID-19 segmentation models based on the dense connection mechanism, accounting for 12.5%. Secondly, the CT modality datasets are used for all COVID-19 segmentation methods of U-net, and X-ray is rarely used as a segmentation dataset. Thirdly, more COVID-19 segmentation methods focus on research on public datasets which account for 81.81% of all the datasets. Fourthly, there are relatively few open source code resources in terms of the models. The number of public code resources accounted for 37.5%, the number of undisclosed code resources accounted for 62.5%.
2.1. The Segmentation Methods of COVID-19 Based on Attention Mechanism
The segmentation methods of COVID-19 based on the attention mechanism refer to calculating its attention distribution based on the input information and obtaining the context vector to selectively focus on the key information of the lesion region in the COVID-19 image. This method can select the information that is more critical to segmentation task among redundant information and improve the segmentation performance by efficiently selecting segmentation features in COVID-19 images. The attention mechanism has received a lot of attention from researchers since it was proposed and the following work has been done in COVID-19 segmentation lesions. Liu et al.
[67][23] used spatial and channel attention in U-Net to segment COVID-19 lung infections. The problem of boundary unclear and lesions complex was solved by making the upper and lower features combined. Karthik et al.
[68][24] introduced contour attention on the last two decoders to refine the infected region for COVID-19 lesion segmentation. The noise inherent in the coarse contour region is discarded by combining shape and boundary information structural features with depth semantic feature maps. Zhao et al.
[69][25] applied gate attention between encoder and decoder to suppress irrelevant information noise and refine the upsampling features. Xie et al.
[70][26] proposed a model based on expansive attention applied between encoder and decoder to solve the redundancy problem in high-level and low-level feature channels. Kitrungrotsakul et al.
[71][27] proposed an interactive attention in U-Net that emphasizes important sensitive segmentation lesions by using residual attention.
2.2. The Segmentation Methods of COVID-19 Based on Multi-scale Mechanism
The COVID-19 segmentation method based on the multi-scale mechanism refers to the multi-scale features obtained from receptive field for COVID-19 lesion segmentation. The multiscale feature extraction method reduces the loss of lesion edge and spatial location information and improves segmentation performance by combining multiscale feature receptive field information. Scale invariant features are learned without loss of information to improve the segmentation accuracy of weak lesion and boundary. For the problems of different sizes of COVID-19 lesions, blurred boundaries and the gaps between high and low levels, researchers mainly carried out the following work. Zheng et al.
[72][28] proposed a multiscale discriminative segmentation network, MSD-Net, using pyramidal convolution blocks to achieve multiscale sensory fields for input feature maps. Pei et al.
[73][29] used grouped convolution in an encoder to achieve multiscale feature extraction. Bose et al.
[74][30] proposed a deep multiscale segmentation network, in which the deep multiscale module captures multi-spatial dimensional objects based on acquiring different depth feature maps. Yan et al.
[75][31] proposed a 3D segmentation network, COVID-SegNet, which implicitly enhances the contrast and adaptively adjusts intensity on the feature layer to capture effective features of different scales. Wu et al.
[34][32] enhanced segmentation features by aggregating different scale feature maps from different stages to segment COVID-19 lesions.
2.3. The Segmentation Methods of COVID-19 Based on Residual Connectivity Mechanism
The COVID-19 segmentation method based on the residual mechanism refers to a feature pass by using jump connections in the network cross-layer or using residual blocks to replace the convolutional layer for COVID-19 lesions segmentation. This method improves the feature reuse capability by introducing residual mechanism to ensure the back propagation of gradients and alleviate the degradation problem caused by deep networks. Hu et al.
[76][33] proposed a COVID-19 segmentation network using ResUnet as the backbone to reduce the semantic gap between high and low feature maps. Zhou et al.
[77][34] introduced residual connections in COVID-19 lesion segmentation network of U-Net to improve segmentation performance by integrating segmentation information from different levels. Yang et al.
[78][35] reduced the contextual semantic gap by concatenating the outputs of three series of convolutional layers through residual connections in a jump connection structure. Chen et al.
[79][36] captured complex features from the original image to segment COVID-19 lesions by using a topology of residual connections in U-Net to better learn potential representation of the input CT image.
2.4. The Segmentation Methods of COVID-19 Based on Dense Connectivity Mechanism
The COVID-19 segmentation method based on the dense connectivity mechanism refers to the use of dense connectivity in the network for interconnection between any layers to achieve feature reuse for COVID-19 lesions segmentation. The use of dense connectivity in the model can reduce the interdependence between different layers and reduce the problem of difficult optimization due to gradient disappearance in the deep network. Zhao et al.
[80][37] proposed a U-Net++ COVID-19 segmentation model, SCOAT-Net, to further reduce semantic gap and produce fine segmentation results by nesting dense jump paths connecting. Raj et al.
[81][38] used two dense networks instead of traditional convolution in U-net networks to enhance global feature propagation, encourage feature reuse and accelerate information transfer to improve segmentation accuracy.
To sum up, there are some advantages in the segmentation methods of COVID-19 based on the attention mechanism, such as ignoring irrelevant feature information, selecting important feature information. However, the COVID-19 segmentation model with added attention mechanism usually has a complex model structure, and it is difficult to find a simple and lightweight segmentation model with an attention mechanism. The segmentation methods of COVID-19 based on multi-scale mechanism are beneficial to obtain the features of different size by combining multi-scale feature receptive fields, and thus improve the recognition ability of multi-scale target. However, the multi-scale mechanism may lose the continuity features by using different scales of receptive fields. The segmentation methods of COVID-19 based on residual connectivity mechanism are beneficial to speed-up model gradient back propagation. Model training instability is prevented, and gradient disappearance avoided. However, it lacks the ability to explore segmentation feature extraction from the full scale. The segmentation methods of COVID-19 based on dense connectivity mechanism are beneficial to improve the efficiency of feature information transmission, which can better solve the problem of image detail loss and improve the segmentation performance of the network. However, it usually has a complex network structure and a lot of parameters in segmentation model.