Concrete stands as the most widely used construction material globally due to its versatility, encompassing applications ranging from pavement, multifloor structures, and bridges to dams. However, these concrete structures endure structural stress and require close monitoring to prevent accidents and ensure sustainability throughout their complete life cycle. Artificial intelligence (AI) and computer vision (CV) have demonstrated considerable potential in diverse applications within construction engineering, including structural health monitoring (SHM) and inspection processes such as crack and damage detection, as well as rebar exposure. While it is undeniable that CV and deep learning models are transforming the construction industry by offering robust solutions for complex scenarios, there remain numerous challenges pertinent to their applications that require attention. This paper aims to systematically and critically review the literature of the past decade on the application of deep learning models in the construction industry for SHM purposes in concrete structures. The review delves into proposed methodologies and technologies while identifying opportunities and challenges associated with these applications in practice.
1. Introduction
Concrete is the most important and demanding construction material
[1], and concrete structures have been influencing the construction industry for decades
[2]. However, an ever-growing number of concrete structures worldwide are entering the aging phase
[3]. Due to various factors, such as weather and environmental conditions, chemical reactions, and external and internal stresses, concrete structures are often subject to defects such as cracks, efflorescence, spalling, bar exposure, etc., and fail to meet the expected life cycle, aging earlier than expected
[2].
The idea of structural health monitoring (SHM) first emerged in the early 2000s. Although initially, the sole focus of SHM was to monitor concrete bridges, in present times, it is defined as the method of continuously evaluating and assessing the condition and performance of any concrete structures, such as buildings, bridges, dams, pipelines, and other infrastructure, throughout their operational lifespan
[4][5][6]. The objective of SHM is to detect any damage, deterioration, or changes in structural properties that could potentially compromise the safety, functionality, or longevity of the concrete structure and is crucial in maintaining structures in optimal condition
[2]. The traditional methods for the SHM process primarily involve manual inspection, which is heavily dependent on the expertise of the inspector. However, these methods present various challenges, including time-consuming operation, varying subjectivity, or difficulties in inspecting components at elevated heights in tunnels/road pavement in busy traffic conditions
[2][7]. Therefore, there is a pressing need for an innovative and precise inspection approach to effectively monitor the health condition of structures that can overcome the mentioned limitations.
Researchers in the construction engineering field have recognized the immense potential and innovative technological strides resulting from the utilization of deep learning methods
[8][9]. Consequently, numerous initiatives have been undertaken to apply deep learning techniques to structural health monitoring (SHM) of concrete infrastructure
[10]. In the following sections, deep-learning-based research in the SHM domain is delved into, with a specific focus on two facets: (1) damage identification and (2) concrete condition assessment.
2. Damage Identification
At the heart of any SHM system lies its capacity to conduct damage identification. Damage refers to alterations in a material’s physical characteristics caused by ongoing deterioration or a singular event affecting a structure. Such changes have the potential to compromise the performance and structural integrity of the concrete
[11]. One limitation of applying deep learning techniques is that they require a large and annotated database, which is not always available, especially in the concrete research area. However, the application of transfer learning can eliminate this problem, allowing an existing deep learning model to be retrained with smaller amounts of new data
[12]; accordingly, an increasing number of applications of deep learning models in concrete research and SHM have been reported.
For example, Gopalakrishnan et al.
[13] applied transfer learning to a pretrained VGG-16 model for crack detection in hot-mix asphalt and Portland cement concrete-based pavement. Kolar et al.
[14] also applied transfer learning to VGG-16 model to detect safety guardrails to promote on-site safety inspection.
Real-world scenarios often limit the applications of deep learning models at actual construction sites because of lighting and shadow issues. Cha et al.
[15] trained a CNN with a large database of 40k images under various lighting conditions and achieved 98% accuracy in detecting concrete cracks. The
resea
uthorchers later compared the performance of the proposed CNN using Canny and Sobel edge detection methods. Tong et al.
[16] integrated three CNNs to perform recognition, localization, and feature extraction tasks, enabling the 3D reconstruction of hidden pavement cracks with images of cracks collected using ground-penetrating radar (GPR).
Figure 1 demonstrates the proposed pipeline of 3D reconstruction pavement cracks with GPR data. Gibert et al.
[17] combined multiple detectors for automatic inspection of railway tracks.
Figure 1.
Pipeline for the 3D reconstruction models of pavement cracks
.
Assessment of post-disaster damage in concrete to provide valuable insights for necessary follow-up actions is another application of deep learning in the SHM area. Davoudi et al.
[18] applied image segmentation to determine the state of the damage in reinforced concrete beams and slabs. Lattanzi et al.
[19] also applied image segmentation via the MATLAB Image Processing Toolbox for the extraction of features from images of damaged reinforcement columns to estimate the maximum lateral displacement using a regression model. Spalling, a common type of damage in concrete structures, is another area of application of deep learning in SHM practice. For example, Dawood et al.
[20] presented a hybrid model combining image processing and machine learning techniques to identify spalling distress in subway stations. Yeum et al.
[21] applied AlexNet to both collapse classification and spalling detection in post-disaster analysis of concrete structures.
Kim and Cho
[22] introduced a method utilizing unmanned aerial vehicles (UAVs) and R-CNN to detect cracks in old concrete bridges. They applied transfer learning to R-CNN, using crack images to enhance crack detection, and later, image processing was employed to quantify the identified cracks. Kang and Cha
[23] also applied UAV-based damage detection with deep learning; however, they addressed one important issue, which is that UAVs often require a skilled pilot and autonomous flight with GPS in certain complex locations of structures, such as indoors or beneath bridges. The
resea
uthorchers proposed an ultrasonic beacon for UAV navigation in GPS-incompatible environments.
Xue and Li
[24] devised a three-tiered deep learning framework including an FCN, RPN, and position-sensitive region of interest pooling for identification of damage in tunnel linings. Hoang et al.
[25] also compared the performance of a CNN with Sobel and Canny edge detection algorithms, as previously reported by Cha et al.
[15], for a cyclic survey of pavement cracks. Similarly, Dorafshan et al.
[26] compared the performance of four edge detection methods with CNNs in detail for crack detection in concrete. Four common edge detection methods in the spatial domain (Roberts, Prewitt, Sobel, and Laplacian of Gaussian) and two in the frequency domain (Butterworth and Gaussian), as well as the AlexNet model in three modes of training (trained, transfer learning, and without training), were compared, and the
resea
uthorchers concluded that AlexNet showed superiority over other methods.
AlexNet was used by Wang et al.
[27] as well. The
resea
uthorchers utilized both AlexNet and GoogLeNet for the detection of various types of damage to masonry walls in historic structures, using sliding-window techniques to pinpoint concrete damage. Motivated by the ImageNet Challenge, Gao and Mosalam
[28] proposed the concept of Structural ImageNet, with four intended tasks: component identification, spalling detection, damage condition evaluation, and damage type determination in concrete through the application of transfer learning in VGGNet (Visual Geometry Group).
Wu et al.
[29] applied transfer learning to VGG16 and ResNet18 to detect two types of prevalent concrete surface defects, namely cracks and corrosion. Zhang et al.
[30] proposed Faster R-CNN to determine the spatiotemporal information of the vehicles on bridges in order to determine the stress state and traffic densities. Wang and Cheng
[31] proposed DilaSeg-CRF by integrating a CNN with a dense conditional random field (CRF) to improve the segmentation accuracy in sewer pipe defect detection, whereas Li et al.
[32] addressed the issue of data imbalance in sewer damage detection by introducing a hierarchical classification approach to supervise the learning process at different levels. Zha et al.
[33] applied transfer learning to ResNet (deep residual neural network) for eight types post-disaster concrete damage detection: scenario classification, damage detection, spalling detection, material identification, collapse detection, effected component identification, and damage level and type determination, which were categorized into binary or multiclasses according to the conditions. The
resea
uthorchers used the 2018 PEER Hub ImageNet Challenge distributed by the Pacific Earthquake Engineering Research Center to evaluate the proposed methodology.
Jang et al.
[34] used transfer learning in GoogLeNet with hybrid images, combining vision and infrared thermography images to enhance crack detection in concrete structures. The
resea
uthorchers suggested the use of a UAV-mounted hybrid system comprising a vision camera, an infrared camera, and a continuous-wave line laser to capture images, particularly for large-scale structures, then used them for inspection of the respective structures. U-Net, which is famous for applications in biomedical image segmentation, was first applied by Liu et al.
[35] to concrete crack detection and later compared with FCN using evaluation metrics such as precision and the size of the training set. The
resea
uthorchers applied U-Net for localization of concrete cracks under various lighting and background conditions. Khani et al.
[36] investigated the impact of preprocessing on a concrete crack detection pipeline based on a CNN trained with 700 labelled gas turbine images. The
resea
uthorchers concluded that bilateral filtering improves the generalization ability of the suggested framework in cases with cracks on complex structures. Zhang et al.
[37] argued that two-stage detectors such as Faster R-CNN and ResNet-101 have limited practical applications due to their slow speeds. The
resea
uthorchers used a single-stage detector (SSD), YOLOv3 (You Only Look Once), to detect multiple types of concrete bridge damage, such as cracks, pop-outs, spalling, exposed rebar, etc.
Liu et al.
[38] argued that the motion blur from excessive vibration in UAVs limits the accuracy of crack detection in high-rise buildings. The
resea
uthorchers introduced a generative adversarial network (GAN) that incorporates the concept of localized skip connections that recognize the correlation between blurred and sharpened crack images. The proposed method was validated through experiments involving the investigation of skip connections in deblurring and compared with a state-of-the-art deblurring model. Kim et al.
[39] applied transfer learning to Mask R-CNN for automatic concrete damage detection and localization in four classes—cracks, efflorescence, rebar exposure, and spalling—using an instance segmentation approach.
Mondol et al.
[40] applied Faster R-CNN to detect post-disaster damage like surface cracks, exposed rebar, and buckled rebar using image data collected from concrete structures damaged during past earthquakes in Nepal (2015), Taiwan (2016), Ecuador (2016), Erzincan (1992), Duzce (1999), Bingol (2003), Peru (2007), Wenchuan (2008), and Haiti (2010). Deng et al.
[41] introduced LinkASSPNet (LinkNet with atrous spatial pyramid pooling) and conducted a performance comparison with U-Net and LinkNet in the context of concrete bridge surface damage detection. Notably, this
res
tudyearch stands out, as the models were trained on a relatively small dataset. It purports to address the challenge of variations in labeling areas among labelers in pixel-wise image segmentation tasks.
Zheng and Zhang
[42] proposed a crack detection model for concrete based on image segmentation tasks and the FCN, R-CNN, and RFCN (Richer Fully Convolutional Networks) models. The training included a wide range of image data, including images of buildings, bridges, dams, roads, etc. Karaaslan et al.
[43] proposed a combination of an SSD-based VGG-16 model and a modified SegNet, where the former detects regions of interest related to damage, such as cracks or spalling, upon verification by the respective inspector, and the latter then applies segmentation to the damage for further analysis. Miao et al.
[44] proposed U-Net-based Damage-Net for semantic segmentation of seismic damage in reinforced concrete structures, where the
resea
uthorchers adjusted the padding size and stride size to ensure that the input and output size were the same, which is usually not the case in U-Net. The proposed Damage-Net receives its encoder from the convolutional layers of VGG-16, allowing it to adapt transfer learning and to be trained on a comparatively smaller dataset. Based on this architecture, two individual models were proposed: Crack-Net for detecting cracks, and 4Category-Net for identifying four additional damage categories, namely concrete spalling and crushing, reinforcement exposure, buckling, and fracture.
Qiao et al.
[45] proposed EMA-DenseNet, a combination of densely connected convolutional networks (DenseNet) integrated with an expected maximum attention (EMA) module in the last pooling layer for the detection of surface damage in concrete bridges in a set of images collected from multiple bridges located in Zhejiang (China). The
resea
uthorchers claimed that the proposed model performs better than FCN, SegNet, DeepLab v3+, and SDDNet. Huang et al.
[46] proposed a software system for damage detection in subway tunnels by integrating four separate functions: image fusion to splice the images acquired by different cameras, image preprocessing to remove background noise and other preprocessing tasks, damage identification performed by the R-CNN model and a data platform for evaluation by the respective personnel. Arya et al.
[47] proposed a concrete pavement damage dataset consisting of 26,620 data point from multiple countries and investigated how the demographics of the damage data affect the model performance based on a YOLO-v5/YOLO-v4/cascade R-CNN-based ensemble model. Cui et al.
[48] proposed an improved YOLO-v3 model for the detection of erosion damage that achieved up to a 75% mean average precision value.
Pozzer et al.
[49] compared the performance of different models, i.e., VGG-16, ResNet-18, ResNet-50, MobileNet-V2, Xception, etc., in detecting concrete defects such as delamination, cracks, spalling, and patches in thermographic and regular images at varying distances and under varying conditions using semantic segmentation. Andrushia et al.
[50] implied that most research on damage detection in concrete structures does not consider the complex background or environmental effects and therefore proposed a U-Net with an encoder–decoder framework for thermal damage detection in concrete structures in the event of fires.
Munawar et al.
[51] introduced a cycle generative adversarial network (CycleGAN) with 16 convolution layers, providing additional support to refine predictions through guided filtering (GF) and conditional random fields (CRFs). The
resea
uthorchers applied this model to inspect mid- to high-rise concrete structures constructed during the 2000s using segmentation techniques and drones. Zou et al.
[52] proposed a YOLOv4-based approach to the detection of multiple types of damage, including both fine and wide cracks, spalling, exposed and bucking rebars, etc., that was integrated in a graphical user interface (GUI) to streamline the assessment of structural damage in reinforced concrete (RC) buildings following an earthquake. Han et al.
[53] proposed the use of a transfer-learning-based AlexNet and threshold segmentation to precisely locate cracks in concrete structures.
Tanveer et al.
[54] compared and analyzed the performance of five semantic segmentation models (ENet, CGNet, ESNet, DDRNet-Slim23, and DeepLabV3+ (ResNet-50)). These models were categorized as lightweight and heavyweight based on the parameter count. The evaluation focused on on-site damage detection in concrete structures using edge computing devices such as smartphones, tablets, etc. Bai et al.
[55] proposed an EfficientNet-V2-based model for component damage recognition, serving both structural health monitoring (SHM) and post-disaster assessment purposes. They also investigated the relationship between damage type, component damage level, and the structural safety state. Crognale
[56] compared four different image processing techniques, namely Otsu-method thresholding, Markov random field segmentation, the RGB color detection technique, and the K-means clustering algorithm, in corrosion and crack detection based on a case study. Chen et al.
[57] proposed an AlexNet-based multiclass damage detection method for reinforced concrete bridges in high-speed rail systems.
Wan et al.
[58] proposed a BR-DETR model, a concrete bridge damage detection model based on detection transformers (DETR), with deformable Conv2D in place of convolution, as well as with an additional convolutional project attention layer after the self-attention layer. Zhu and Tang
[59] introduced a DeepLabV3+ network architecture with Xception as the backbone to automatically estimate detailed crack information in hydraulic concrete structures. Huang et al.
[60] proposed a Faster R-CNN with Res-Net101 as the backbone for detection of damage like cracks, spalling, and precipitates in hydraulic concrete structures.
3. Damage Quantification
Damage quantification is the next step after damage identification. Concrete damage quantification aims to determine the extent, severity, and specific characteristics of damage, such as cracks, spalling, corrosion, or other forms of deterioration. Although using deep learning for concrete damage quantification is still a relatively new concept in SHM, researchers are continuously generating new ideas to automate the quantification process, given the inherent challenges associated with this topic.
Kim et al.
[61] proposed a UAV-based digital image processing system integrated with imaging and distance-sensing technology to determine the width and length of the cracks in concrete surfaces. Tong et al.
[62] proposed a CNN-based method to calculate the mean texture depth (MTD) of pavement surfaces from 3D scan data, which was tested on four different highways in Shanxi, China. Huang et al.
[63] studied lining damage in tunnels with a rapid detection and assessment analysis system developed by Nanjing HuoYang Hou Mdt InfoTech Ltd. The system includes a multichannel array of high-speed CCD (charged couple device) cameras to obtaining image data, multiple sensors to mitigate the impact of vehicle vibration on the tunnel, a multilayer lighting system, multiple positioning technology (reference object positioning technology + image positioning technology + mileage positioning technology + infrared laser positioning technology), and a computer vision approach for damage identification and analysis. Tayo et al.
[64] presented a device capable of portable crack width calculation in concrete road pavement using pattern recognition based on multiple image processing technologies, such as graying, enhancement, filtering and denoising, binarization, segmentation, etc. Kim and Cho
[65] proposed Mask R-CNN+image processing techniques for successful detection and quantification of concrete cracks with widths of 0.3 mm or more. Wei et al.
[66] applied the same approach to concrete surface bughole segmentation and diameter measurement.
Beckman
[67] applied Faster R-CNN to automatically and simultaneously detect and quantify concrete spalling in multiple locations within the same surface. The
resea
uthorchers used a depth camera to obtain the volume quantifications of the spalling damage. Park et al.
[68] applied YOLO for both concrete crack detection and quantification (i.e., to determine the size of the cracks) in real time. The
resea
uthorchers used laser beams with integrated distance sensors for accurate measurement of the crack size. Bhowmick et al.
[69] applied U-Net-based segmentation for concrete crack localization and binarization to estimate quantitating properties of cracks, like length, width, area, orientation, etc., from video data collected by a camera mounted on a UAV. Flah et al.
[70] applied a deep learning technique to identify both structural and durability-related damage in structural members and assess the condition in a short time span by combing a Keras classifier with Otsu image processing. The proposed method can classify cracks; quantify them in terms of length, width, and angular orientation; and evaluate the severity of the damage.
Yuan et al.
[71] proposed an inspection robot that transforms the quantification of concrete damage from a 2D plane to 3D space with stereo vision and a Mask R-CNN approach. The robot is based on four different sensors, with a monocular camera as a visual sensor, a stereo camera with a sensor for inertial measurement (IMU) of six degrees of freedom that can be mapped for panoramic image stitching, and a LiDAR sensor to measure the distance between the RC structure and the camera. Miao et al.
[72] proposed a GoogLeNet-based transfer learning approach incorporating a novel sliding technique known as neighborhood scanning. This method aims at the detection, segmentation, and quantification of concrete cracks, achieving an average relative error of 14.58% in crack calculation.
Song et al.
[73] introduced a deep learning approach for crack segmentation and quantification utilizing MobileNetV1 and ResNet50, along with DeeplabV3+ and U-Net. MobileNetV1 and ResNet50 handle crack classification, while DeeplabV3+ and U-Net manage panoramic crack segmentation (
Figure 2). The quantitative information of the crack was subsequently acquired by multiplying the actual physical size corresponding to the unit pixel, assuming the length of a single pixel as the unit length. Kumarapu et al.
[74] introduced UAVIC, a system that integrates UAVs with an image processing technique, i.e., digital image correlation. This approach is employed for damage quantification on scaled bridge girders.
Figure 2.
Before and after concrete crack segmentation
.
Bae et al.
[75] proposed a computer-vision-based crack quantification algorithm using decision making based on statistical methods for accurate estimation and quantification of damage based on an image dataset of concrete building structures in South Korea. Li et al.
[76] proposed a ResNet50-based improved You Only Look At CoefficienTs for Edge devices (YolactEdge) combined with digital image processing techniques for damage identification and quantification in hydraulic tunnels.
4. Suggested Future Frameworks
The number of deep-learning-based applications in concrete research is rapidly growing, especially in the SHM area. Numerous applications have been reported with respect to both concrete damage identification and quantification. The application and integration of stereo cameras and sensors, such as LiDAR and laser sensors, have made deep learning applications for damage quantification. Many researchers have applied various image processing techniques rather than integration with depth cameras or sensors. However, concrete cracks are very fine, so whichever system is adopted must precisely quantify a particular property (either length, width, or diameter). AlexNet, GoogleNet, Faster R-CNN, Mask R-CNN, U-Net, VGG, and YOLO models seem to be popular choices for damage identification. However, compared to other industries, construction falls behind in terms of adopting digitalization; therefore the application of deep-learning- and vision-based systems to monitor concrete health in the SHM area is still not sufficient in real practice, mostly due to the following issues:
-
Data shortage: Although transfer learning has made the adaptation of deep learning easier. Raw data often need to go through many stages of post processing, which is very time-consuming and labor-intensive. Also, there is a need for annotated datasets, which are essential for any deep learning training [77]. Most studies have been conducted using private datasets; making such datasets public would open multiple doors for researchers in the SHM domain for multiple applications. Although data augmentation plays an important role in dataset incrementation, applying various transformations to existing data, such as rotating, scaling, flipping, or cropping images, is insufficient for research in the SHM area. An alternative method involves utilizing generative adversarial networks (GANs), where a deep learning model comprising two distinct networks (namely a generator and a discriminator) is employed to generate synthetic image data instead of relying on real-world camera inputs only, as reported in [38][51].
-
Impact of the training data on overfitting: Transfer learning has undeniably simplified the application of deep learning models in structural health monitoring (SHM). However, the persistent challenge of overfitting can arise, particularly in instances where there is a paucity of image data. Deep learning models characterized by multiple layers and millions of parameters demand extensive tuning, as illustrated, for example, by the necessity of adjusting at least 100 million parameters in VGG-16 for crack detection [12]. It is imperative that the training data encompass diverse real-world scenarios, accounting for variations in background, lighting, and weather conditions, to ensure the model’s robustness and applicability.
-
Requirement for high-performance computers: Many deep learning techniques necessitate several days for training due to the extensive calculations involved in computing related training parameters, such as loss functions. Adequate hardware, including high-capacity hard disks, multiple GPUs/CPUs, and substantial memory, is essential for storing these calculations. Researchers should prioritize discovering optimized model structures with fewer parameters, facilitating their seamless adaptation in structural health monitoring (SHM) applications.
-
Dealing with background noise: On the other hand, in addressing various background noises in images, researchers have implemented different morphological changes in the CNN architecture [31][38][41][44][45][51][58] to increase the detection accuracy. However, the source code is typically not publicly available. Researchers should be encouraged to make their source code publicly accessible, enabling other researchers to enhance the architecture further and, consequently, increase its applicability in actual practice. Due to the image resizing requirement of deep learning models to be trained on computers with average computing capacities, generalization abilities are often lost. For example, stains are a common issue in concrete structures and often incorrectly identified as cracks. To solve this issue, stains and similar could be categorized as another class [78] to improve the generalization abilities.
4. Suggested Future Frameworks
The number of deep-learning-based applications in concrete research is rapidly growing, especially in the SHM area. Numerous applications have been reported with respect to both concrete damage identification and quantification. The application and integration of stereo cameras and sensors, such as LiDAR and laser sensors, have made deep learning applications for damage quantification. Many researchers have applied various image processing techniques rather than integration with depth cameras or sensors. However, concrete cracks are very fine, so whichever system is adopted must precisely quantify a particular property (either length, width, or diameter). AlexNet, GoogleNet, Faster R-CNN, Mask R-CNN, U-Net, VGG, and YOLO models seem to be popular choices for damage identification. However, compared to other industries, construction falls behind in terms of adopting digitalization; therefore the application of deep-learning- and vision-based systems to monitor concrete health in the SHM area is still not sufficient in real practice, mostly due to the following issues:
- Data shortage: Although transfer learning has made the adaptation of deep learning easier, . Raw data often need to go through many stages of post processing, which is very time-consuming and labor-intensive. Also, there is a need for annotated datasets, which are essential for any deep learning training [129]. Most studies have been conducted using private datasets; making such datasets public would open multiple doors for researchers in the SHM domain for multiple applications.Although data augmentation plays an important role in dataset incrementation, applying various transformations to existing data, such as rotating, scaling, flipping, or cropping images, is insufficient for research in the SHM area. An alternative method involves utilizing generative adversarial networks (GANs), where a deep learning model comprising two distinct networks (namely a generator and a discriminator) is employed to generate synthetic image data instead of relying on real-world camera inputs only, as reported in [87,100].
- Impact of the training data on overfitting: Transfer learning has undeniably simplified the application of deep learning models in structural health monitoring (SHM). However, the persistent challenge of overfitting can arise, particularly in instances where there is a paucity of image data. Deep learning models characterized by multiple layers and millions of parameters demand extensive tuning, as illustrated, for example, by the necessity of adjusting at least 100 million parameters in VGG-16 for crack detection [61]. It is imperative that the training data encompass diverse real-world scenarios, accounting for variations in background, lighting, and weather conditions, to ensure the model’s robustness and applicability.
- Requirement for high-performance computers: Many deep learning techniques necessitate several days for training due to the extensive calculations involved in computing related training parameters, such as loss functions. Adequate hardware, including high-capacity hard disks, multiple GPUs/CPUs, and substantial memory, is essential for storing these calculations. Researchers should prioritize discovering optimized model structures with fewer parameters, facilitating their seamless adaptation in structural health monitoring (SHM) applications.
- Dealing with background noise: On the other hand, in addressing various background noises in images, researchers have implemented different morphological changes in the CNN architecture [80,87,90,93,94,100,107] to increase the detection accuracy. However, the source code is typically not publicly available. Researchers should be encouraged to make their source code publicly accessible, enabling other researchers to enhance the architecture further and, consequently, increase its applicability in actual practice. Due to the image resizing requirement of deep learning models to be trained on computers with average computing capacities, generalization abilities are often lost. For example, stains are a common issue in concrete structures and often incorrectly identified as cracks. To solve this issue, stains and similar could be categorized as another class [131] to improve the generalization abilities.
Despite the challenges and limitations, the use of deep learning and computer vision technologies holds significant promise in structural health monitoring (SHM) and concrete research. A collaborative effort from researchers, scholars, and engineers in the construction, computer science, and civil engineering domains can establish more effective deep -earning-based SHM inspection systems for both damage identification and quantification, regardless of severity, delicacy of the damage, and the influence of the surrounding environment.
5. Conclusion
The aim of this research was to conduct a systematic review of the utilization of deep learning in the identification and quantification of concrete damage for SHM purposes. This study delved into the concepts and historical development of artificial intelligence (AI), computer vision (CV), and deep learning. With the aim of including the latest advancements in concrete research, the analysis was focused on studies spanning from 2017 to 2023, particularly those addressing vision-based crack identification, categorization, and measurement analysis. Additionally, we addressed four critical issues regarding the application of deep learning in the field of SHM. This research provides helpful insights that can aid in future applications of concrete damage identification and quantification with deep learning and guide researchers and engineers in respective applications.
Despite the challenges and limitations, the use of deep learning and computer vision technologies holds significant promise in structural health monitoring (SHM) and concrete research. A collaborative effort from researchers, scholars, and engineers in the construction, computer science, and civil engineering domains can establish more effective deep -earning-based SHM inspection systems for both damage identification and quantification, regardless of severity, delicacy of the damage, and the influence of the surrounding environment.