Insulator Defect Detection: Comparison
Please note this is a comparison between Version 2 by Wendy Huang and Version 1 by deng chong.

Insulators, as important components of high-voltage transmission lines, serve the functions of electrical separation and support for conductors. Due to their long-term outdoor exposure to sunlight, rain, climate changes, and chemical corrosion, insulators often suffer from self-exploding defects, causing the disconnection of insulator strings and interfering with their performance, thus affecting the safety and stability of power systems. Insulator detection methods are generally divided into two types. The first is manual inspection, where workers directly observe insulators to identify defective parts. However, this method is time-consuming and not safe. The second is intelligent inspection, which can effectively locate defective parts by carrying edge detection equipment on drones for regular inspection of insulators. This is also the current mainstream inspection method.

  • insulators
  • defect detection
  • deep learning
  • feature extraction
  • R-CNN
  • ResNet
  • EfficientNet

1. Introduction

Currently, the implementation of insulator defect detection mainly relies on traditional methods and deep learning methods. Traditional detection methods primarily differentiate insulators from the background based on features such as size, texture, and color of the images [3][1]. For example, Tan et al. [4][2] takes a fusion algorithm based on insulator contour features and grayscale similarity matching. It can extract the contours of insulator pieces, accurately separate them, and construct a defect detection model based on the spacing between insulator pieces and grayscale similarity matching. Liu et al. [5][3] proposed an edge-based segmentation method for insulator strings. It uses a multi-scale morphological gradient algorithm to extract the edges of insulator strings, determine the largest connected region, and provide guidance for addressing the problem of mis-segmentation of iron caps and umbrella discs caused by edge loss in infrared images of insulator strings. However, these traditional detection methods have low efficiency in feature extraction, poor generalization capabilities, and difficulty in recognizing small-scale and high-likelihood objects in images [6][4].
To enhance the feature extraction capability and anti-interference ability of insulator detection, traditional detection methods are no longer able to meet modern needs. Many scholars have turned their attention to deep learning methods. For example, Guo et al. [7][5] used a lightweight target detection network called CenterNet-GhostNet to address the issue of the large number of parameters in the insulator defect detection model, which makes it difficult for unmanned aerial vehicles to deploy on the edge. This network significantly reduces the number of network parameters while achieving a slight increase in detection accuracy, thereby improving the detection speed of the network. Jia et al. [8][6] considered a lightweight detection method called MDD-YOLOv3. The improved YOLOv3 can quickly and accurately recognize and locate insulator defects in complex backgrounds. Li et al. [9][7] proposed a method that utilizes multiple-scale feature encoding and dual attention fusion to improve the accuracy and speed of detecting insulator defects in transmission lines. It has a certain reference value for accurate insulator defect detection by unmanned aerial vehicles. In summary, compared with the traditional manual feature extraction of insulators, deep learning-based detection methods can automatically and accurately extract target features and have stronger generalization capabilities.
In recent years, due to the development of “Intelligentization” in the power system, the combination of using drones to collect insulator defect data and computer vision technology has become a popular method for intelligent inspection [10,11][8][9]. However, deep learning-based object detection networks usually require a large number of computational resources and parameters for training and inference, which limits their deployment and usage in practical applications. Therefore, the construction of lightweight detection models has become crucial [12,13,14][10][11][12].
The existing deep learning detection methods can be mainly divided into two categories. One is the two-stage detection model represented by R-CNN, Faster R-CNN, and Mask R-CNN. These algorithms require two-stage processing: (1) candidate region acquisition and (2) classification and regression of candidate regions. The other is the single-stage detection model represented by the YOLO series, which simultaneously obtains candidate regions and categories through joint decoding. Among them, the Faster R-CNN model, as a representative of two-stage networks, exhibits a more pronounced advantage when it comes to handling high-precision, multi-scale, and small object detection tasks. However, the original Faster R-CNN (ResNet) model suffers from significant drawbacks in terms of detection speed performance. Firstly, its feature extraction capability is relatively poor. This is because the original ResNet cannot effectively extract high-level semantic information and low-level fine-grained features from images, making it difficult for deeper feature maps to learn information about small objects. Secondly, the network’s inference speed is slow. The original model contains a lot of redundant information, resulting in a slow detection speed. Finally, the network parameters are not well optimized. For instance, the original model’s learning rate can easily get stuck in local optima, leading to a decline in the overall model performance.

2. Faster R-CNN

Defect detection involves the following two tasks: defect classification and localization. This paper chooses two-stage Faster R-CNN [18][13] as the lightweight base network structure, which exhibits a high accuracy in object detection tasks. Its working principle is to first identify and locate defective insulators in an image, then select them with rectangles, and, finally, mark their belonging categories near the rectangles. Faster R-CNN is a two-stage object detection network proposed by Ross B. Girshick, building upon the foundations of R-CNN and Fast R-CNN. As shown in Figure 1, the Faster R-CNN network structure consists of four parts: the backbone network, the Region Proposal Network (RPN), the Region of Interest (RoI) pooling, and the detection network. The backbone network is a ResNet network stacked with multiple 7 × 7 convolutions of stride 2 and 3 × 3 convolutions of stride 2. The RPN is a feature-processing part composed of two parallel 1 × 1 convolutions by 3 × 3 deep separable convolutions (DWConv). The detection network consists of two parallel fully connected layers (FC).
Figure 1.
The original Faster R-CNN network structure.
The entire algorithm process is divided into several parts. First, the backbone network extracts features from preprocessed images by capturing multi-scale information with inter-channel interactions. Then, these features are used as input for the RPN, which generates candidate boxes. The candidate boxes are mapped to the feature map output by the backbone network. The obtained feature matrix is passed through the RoI Pooling layer, resulting in a 7 × 7 feature map. Finally, the detection network utilizes the feature map to obtain class information and bounding box regression parameters. The candidate boxes are adjusted using the bounding box regression parameters to obtain the final target position. To address the low accuracy and slow speed issues of the original model in insulator defect detection, we propose a lightweight defect detection model based on Faster R-CNN-tiny. The aim is to make the original detection model more suitable for future edge deployment requirements.

3. ResNet

ResNet, which stands for Deep Residual Network, is a landmark convolutional neural network (CNN) that uniquely solves the problems of gradient disappearance and explosion in deep neural networks. In 2015, ResNet won the ILSVRC (ImageNet Large Scale Visual Recognition Challenge) championship and significantly improved error accuracy in the ImageNet classification task. This is mainly due to ResNet’s “shortcut connections”, also known as “skip connections”. Through this connection method, the output of the deep network can be directly added to some layers of the shallow network, which helps the gradient to be directly transmitted to the shallow network. This design allows the network to train deep networks with dozens or even hundreds of layers.

4. EfficientNet

EfficientNet, proposed by Google in 2019, constructs models through compound scaling to improve model efficiency. It is composed of one ordinary convolutional layer and sixteen mobile inverted bottleneck convolution modules (MBConv). Among them, the MBConv module is its core component, which mainly draws inspiration from the residual structure of MobileNetv3 [19][14]. As shown in Figure 2, it has the following functional features: firstly, a Swish activation function [20][15] is used instead of a ReLU activation function, and Swish performs better on deep models. Secondly, a squeeze-and-excitation networks (SENet) [21][16] attention mechanism is added to each MBConv module to strengthen the extraction of small-scale target features and suppress useless feature information. Thirdly, dropout layers are introduced. When there are shortcut branches (shortcuts), the main branch of the whole module will be randomly dropped, leaving only the shortcut branch, making the network lighter and improving the detection speed of the model.
Figure 2.
The MBConv module in the EfficientNet network.
The main difference between the two lies in their network structure and optimization strategies. EfficientNet adopts a deeper and wider network structure, while using compound scaling to adjust the depth, width, and resolution of the network. This makes EfficientNet reduce the number of parameters and computations, thereby improving the efficiency of the model. On the other hand, ResNet mainly solves the vanishing and exploding gradient problems using residual blocks, with a relatively simple network structure.

References

  1. Miao, X.R.; Liu, X.Y.; Chen, J.; Zhuang, S.; Fan, J.; Jiang, H. Insulator Detection in Aerial Images for Transmission Line Inspection Using Single Shot Multibox Detector. IEEE Access 2019, 7, 79945–79956.
  2. Tan, P.; Li, X.F.; Xu, J.M.; Wang, F.J.; Ding, J.; Fang, Y.T.; Ning, Y. Catenary insulator defect detection based on contour features and gray similarity matching. J. Zhejiang Univ. Sci. A 2020, 21, 64–73.
  3. Liu, Y.; Lu, Y.P.; Gao, S.; Bi, X.T.; Yin, Q.G.; Zhu, X.Q.; Yao, J.G. Application of Edge Detection in Infrared Images of Coil-type Suspension Porcelain Insulator. Electro-Ceram. Light. Prot. 2020, 198–203. (In Chinese)
  4. Tao, X.; Zhang, D.P.; Wang, Z.H.; Liu, X.; Zhang, H.; Xu, D. Detection of Power Line Insulator Defects Using Aerial Images Analyzed with Convolutional Neural Networks. IEEE Trans. Syst. Man Cybern. Syst. 2020, 50, 1486–1498.
  5. Guo, J.N.; Du, S.S.; Wang, S.D.; Zhang, X.Y. CenterNet Self-Exploding Detection of Insulators on Transmission Lines Based on Lightweight Feature Fusion. J. Beijing Univ. Aeronaut. Astronaut. 2022, 1–13. (In Chinese)
  6. Jia, X.F.; Yu, Y.Q.; Guo, Y.C.; Huang, Y.R.; Zhao, B.T. Lightweight Detection Method for Insulator Self-Explosion Defects in Aerial Photography. High Volt. Technol. 2023, 49, 294–300. (In Chinese)
  7. Li, L.R.; Chen, P.; Zhang, Y.L.; Zhang, K.; Xiong, W.; Gong, P.C. Insulator Defect Detection Based on Multi-Scale Feature Coding and Dual Attention Fusion. Prog. Laser Photonics 2022, 59, 81–90. (In Chinese)
  8. Li, X.; Zhang, J.; Zhang, L.; Chen, X. Layout Optimization and Multi-scenes Intelligent Inspection Scheme Design Based on Substation Video Monitoring. J. Phys. Conf. Ser. 2023, 2560, 012022.
  9. Wu, J.; Liu, Z.B.; Ren, Q. Detection of Defects in Power Grid Inspection Images Based on Multi-scale Fusion. J. Phys. Conf. Ser. 2022, 2363, 012013.
  10. Zhao, H.; Wan, F.; Lei, G.B.; Xiong, Y.; Xu, L.; Xu, C.; Zhou, W. LSD-YOLOv5: A Steel Strip Surface Defect Detection Algorithm Based on Lightweight Network and Enhanced Feature Fusion Mode. Sensors 2023, 23, 6558.
  11. Li, Q.; Sun, B.Q.; Bir, B. Lite-FENet: Lightweight multi-scale feature enrichment network for few-shot segmentation. Knowl.-Based Syst. 2023, 278, 110887.
  12. Song, K.C.; Wang, H.; Zhao, Y.; Huang, L.; Dong, H.; Yan, Y. Lightweight multi-level feature difference fusion network for RGB-D-T salient object detection. J. King Saud Univ.-Comput. Inf. Sci. 2023, 35, 101702.
  13. Ren, S.Q.; He, K.M.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39.
  14. Liu, Y.M.; Wang, Z.L.; Wang, R.J.; Chen, J.; Gao, H. Flooding-based MobileNet to identify cucumber diseases from leaf images in natural scenes. Comput. Electron. Agric. 2023, 213, 108166.
  15. Taro, S.; Kohei, T.; Shohei, O.; Ito, S. Introducing Swish and Parallelized Blind Removal Improves the Performance of a Convolutional Neural Network in Denoising MR Images. Magn. Reson. Med. Sci. 2021, 20, 410–424.
  16. Hu, J.; Shen, L.; Albanie, S.; Sun, G.; Wu, E.H. Squeeze-and-Excitation Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 42.
More
ScholarVision Creations