GeoAI in Integrated Hydrological and Fluvial Systems Modeling

GeoAI in Integrated Hydrological and Fluvial Systems Modeling: History

View Latest Version

Please note this is an old version of this entry, which may differ significantly from the current revision.

Subjects: Water Resources | Geography, Physical | Engineering, Environmental

Contributor: Carlos Gonzales-Inca ,

Mikel Calle

Danny Croghan

, Ali Torabi Haghighi ,

Hannu Marttila

, Jari Silander , Petteri Alho

Geospatial artificial intelligence (GeoAI) allows the harnessing of big and high-dimensional data to better understand the hydrological processes in a particular system. Specifically, GeoAI provides new data analytic tools to the entire data processing cycle, such as sensor data fusion, hydrological modeling, data assimilation, multi-objective scenario optimization, smart decision support, evaluation of climate change impact, construction of early warning systems, and geo-visualization.

GeoAI
artificial intelligence
hydrological
hydraulic
fluvial
water quality

1. Introduction

Geospatial artificial intelligence (GeoAI) applications in hydrology and fluvial studies are rapidly increasing and replacing the traditional methods. A reason for rapid GeoAI adoption in hydrological sciences might be linked to the progress in collecting big hydrological datasets, using automatic sensors with internet transmission, or the internet of things (IoT). Similarly, the evolution and increase in earth observation satellites (conventional and nanosatellites), unmanned aerial vehicles (UAV), light detection and ranging (LiDAR), and other surveying technologies produce high-resolution geospatial data, allowing better landscape characterization. GeoAI greatly enhances and supports decision making in integrated water resources management (IWRM) and nexus approaches [47]. Figure 1 depicts a GeoAI application model for a smart IWRM support system.

Figure 1. A GeoAI application model for a smart decision support system for integrated water resources management (IWRM). (1) Internet of things (IoT) supports real-time, high-frequency, hydrological monitoring. The data are stored in a cloud platform and accessed by an application programming interface (API). These data can be used for the real-time identification of problems in the system, e.g., a river basin. (2) GeoAI provides data analytic and online real-time modeling tools for hydrological system analysis and prediction. (3) GeoAI also supports multi-objective, multi-scenario optimization modeling, which in turn is the basis of smart decision support systems for IWRM. (4) Geovisualization in web mapping and mobile apps can be used for data dissemination and stakeholder engagements and implementing early warning systems. The smart IWRM system can be closed with the evaluation and adjustment of the IWRM plan and the improvement of the hydrological monitoring system. WQ (automatic water quality monitoring), ADCP (acoustic doppler current profiler for water current velocity measurement and river bathymetry), GW (automatic groundwater monitoring in wells), UAV (unmanned aerial vehicle for very high-resolution land cover mapping and surface elevation models), EOS (earth observation system for environmental condition monitoring), LiDAR (LiDAR survey of high-resolution topography data), and GNSS (use of global navigation satellite systems for ground truth data collection).

2. Hydrological and Hydraulic Modeling

Water flow in the catchments and river networks is a complex and stochastic process, operating in different spatio-temporal scales and characterized by non-stationarity, dynamism, and non-linearity [48,49]. These properties have limited the development of a reliable hydrological and hydraulic prediction model that can be generalized to a large geographical area. The increasing sensor-based, high-frequency (sub-hourly) hydrological data collection and the high spatial and temporal resolution mapping of land cover and topography have enhanced the understanding of hydrological processes. This fact has led to the development of more sophisticated physical-based hydrological models. However, these models are computationally expensive and limited to small-scale applications. Alternatively, several data-driven GeoAI methods have emerged for hydrological and hydraulic classification and prediction at multiple spatial-temporal scales.

2.1. Hydrological System Classification

The classification of the different types of hydrological systems is one of the most widely applied modeling tasks in hydrology and ecohydrology. It aims to find similarities between different hydrological systems, e.g., those based on the hydrological response, the hydromorphological and climatic characteristics, and other variables. Unsupervised GeoAI algorithms, such as K-mean clustering [63] and SOM [64,65], have been applied to catchment classification. Both algorithms organize multidimensional input data through linear and non-linear techniques, depending on the intrinsic similarity of the data themselves. Several studies highlight the SOM nonlinear techniques for producing robust and consistent hydrological classification [66,67,68], even though the classification consistency is highly influenced by the quality of the input variables [69]. Additionally, where training data is available, supervised GeoAI methods have produced highly accurate and biophysically meaningful catchment classification [70,71].

2.2. Hydrological Data Fusion and Geospatial Downscaling

Integrated hydrological modeling requires the extensive data collocation of different components of the hydrological system in various spatial and temporal scales. Therefore, it is necessary to complete data and/or create new data by integrating several datasets from different sources, resolutions, and measurement noisiness [72]. This approach is called data fusion. Data fusion can increase the measurement quality and reliability, estimate unmeasured states, and increase spatial and temporal coverage. Several probabilistic and GeoAI data fusion techniques are available [73,74]. The commonly used GeoAI techniques in data fusion are non-linear Bayesian regression, ANN, RF, and deep learning [73,74,75,76]. These methods provide several advantages in representing non-linear, complex, and lagged relationships in different hydrological datasets. GeoAI data fusion is also applied to automatic data denoising and anomaly detection and remote sensing data fusion [77]. GeoAI data fusion is also used in rain, soil moisture, and discharge data generation. Sist et al. [78] introduce the ANN-based data fusion of multispectral (visible and infrared) satellite data with radar (microwave) satellite data to improve rainy area mapping and the estimation of the precipitation amount. Zhuo and Han [79] used data fusion to generate soil moisture products from satellite data, land surface temperature, and multi-angle surface brightness reflectance and were able to significantly increase the availability of daily soil moisture products. Fehri et al. [80] used the best linear unbiased predictor data fusion technique to generate discharge data from crowdsourced data and existing monitoring systems. There are more examples of using data fusion in data integration in areas other than in improving knowledge, which could be the next step to be further explored.

Environmental geospatial data, particularly remote sensing data, are usually measured at different spatial and temporal scales; high-temporal resolution data are usually measured at coarse (low) spatial resolution, and fine (high) spatial resolution data are obtained with low temporal frequency [77,81]. Therefore, combining the different datasets by downscaling methods is necessary to generate spatio-temporal high-resolution data. GeoAI-based downscaling has shown several advantages. For example, CNN is frequently used for downscaling coarse-resolution to fine-resolution precipitation products, using different static and dynamic variables as predictors [82,83]. These studies have shown that CNN achieves different degrees of accuracy, depending on the precipitation rate and the condition complexity; it has, e.g., lower accuracy in extreme wet conditions [83]. Other studies have shown a higher downscaling accuracy of GeoAI methods by having a spatial component in the model, e.g., spatial RF vs. RF in downscaling daily fractional snow cover [84] and land surface temperature from MODIS data [85,86,87].

2.3. Spatial Prediction of Hydrological Variables

The application of GeoAI in hydrological spatial prediction is diverse; it can be used, for example, in the risk mapping of hydrological extremes such as flood and drought [88,89,90]. In particular, GeoAI is widely applied in flood mapping, using satellite imagery, UAVs, high resolution LiDAR topographic data, and automatic water level sensors [91,92,93]. The common GeoAI algorithms used, e.g., in flood prediction are SVM, RF, ANN, and deep learning [92,93,94]. The selection of the methods is variable and depends on the mapping objective, the system complexity, and the data availability [91]. In areas with limited data and/or complex systems, where nonlinear methods are not easily interpretable, ANFIS soft computing has been applied with good prediction accuracy and strong generalization ability [95]. ANFIS combines data and expert knowledge through a set of fuzzy semantic conditional rules [96,97,98].

Another GeoAI application is the spatial prediction of hydrological model variables, e.g., saturated hydraulic conductivity [99,100] and weather data [101]. This is particularly useful as spatial hydrological variables are not available. Thus, they can only be predicted using points observation and surrogate spatial data such as remote sensing data. GeoAI spatial prediction has shown advantages in modeling nonlinear processes. However, the prediction quality depends on the quality and quantity of the observed data points and the applied GeoAI method [102].

2.4. Hydrological Process Modeling

GeoAI has shown the potential for accurate hydrological modeling, such as for rainfall-runoff, river discharge, soil moisture dynamics, and groundwater table fluctuation [95,103,104]. The non-linear nature of these processes is challenging to model with simple empirical and physical-based models. Therefore, GeoAI methods such as ANNs have proved to be better for modeling complex hydrological processes and forecasting them in the short and long term and in different management scenarios [26]. However, traditional ANNs do not model sequential order data such as time-series data. Therefore, a further development for the temporal dynamics of hydrological sequential events is the RNN and LSTM neural networks. RNN and LSTM use the previous information in the sequence to produce the current output, although RNN is better designed to model short sequences only. In the case of long temporal sequences of the antecedent conditions, LSTM is preferred. LSTM uses an additional ‘memory cell’ compared to RNN to maintain information for long sequences or periods of time [105,106]. This memory cell lets the model learn longer-term dependencies, e.g., the effects of antecedent soil moisture conditions on runoff generation [105,107]. LSTM is advantageous for modeling hydrological processes in regions with strong seasonality, such as a northern climate with varying winter conditions [91,105]. The LTSM model also allows the use of multiple time-series predictors, such as precipitation, temperature, discharge, and time [58,108]. A further extension of LSTM is created by combining it with CNN. In CNN, learning is achieved through convolving an input with filter layers to speed up parameter optimization [107,109]. Combining CNNs and LSTM encodes both the spatial and the temporal information [87,110]. LSTM techniques can also be coupled with other signal-processing algorithms such as wavelet transformation (WT). WT is applied to time-series data decomposition, e.g., the decomposition of high- and low-frequency flow signals, the identification of seasonality and trend, the decomposition of non-stationary signals, and data denoising [30]. Denoised data are used as inputs for the LSTM model [111].

Another approach is to use a physical-based model coupled with GeoAI, e.g., for runoff and flood prediction [19,112,113]. Overall, the output of the physical-based model is used as the input for GeoAI model training. For example, Noori and Kalin [14] used the SWAT model to simulate daily streamflow and estimate baseflow and stormflow, which were used as inputs for ANNs. The benefit of this approach is that once the model is trained, it can perform orders of magnitude faster than the original physical-based models without impairing prediction accuracy [17]. Another benefit of the hybrid modeling is that a trained model, e.g., in catchment hydrological modeling, can achieve better performance for other catchments than the uncalibrated process-based models [105,112].

Overall, most of the GeoAI models achieved higher prediction accuracy than the physical-based hydrological models. However, there are several types of GeoAI algorithms, with different architectures and mathematical formulations (e.g., ANN, CNN, and LSTM) to perform similar tasks. In addition, different types of predictor variables and data sampling sizes are used, making the GeoAI model performance comparison challenging. GeoAI models are less physically interpretable, as they do not explicitly represent the physical laws governing the hydrological processes. Therefore, their causal inference is still limited. GeoAI applications are currently oriented towards hydrological prediction. GeoAI has the potential to provide accurate and timely information which is applicable to large areas, and using data from IoT sensors and cloud computing, it can deliver real-time prediction [114].

2.5. Hydraulic Modeling

The new generation of very-high-resolution river bathymetry has improved the 1D, 2D, and 3D hydraulic modeling of rivers [115,116]. River hydraulic models have been widely used in the estimation of flood extent, water depth and velocity, sediment transport, and the assessment of fluvial morphodynamics [5,11,117]. However, very complex hydraulic models (3D) are data and computationally demanding and restricted to small-scale applications. Hydraulic modeling is sometimes inconsistent and does not represent all the bio-physical processes occurring in the natural fluvial environment [118,119]. In addition, the numerical solving approach of the hydraulic model results in high numerical instability due to sensitivity to the initial and boundary conditions, model structure, and spatial and temporal discretization [120]. Thus, the GeoAI method has emerged as a promising tool for hydraulic modeling in large-scale and natural systems [19,119,121,122]. Emerging deep learning applications in computer fluid dynamics have also shown potential for the modeling of turbulent and complex flow structures [123,124,125]. Additionally, coupling the hydraulic model with the Bayesian GeoAI methods improves hydraulic modeling over a broad range of spatiotemporal scales and physical processes [126].

2.6. Hydrological Data Assimilation

Hydrological data assimilation (DA) is a state estimation theory that assumes that models are an imperfect representation of the system and that hydrological data might contain noise. Both can also contain different types of information and be complementary [127]. DA aims to harness the information in the hydrological model and in the observations to approximate the true state of the system, considering its uncertainty statistically [127,128,129]. DA methods include linear dynamics (e.g., Kalman filter, the most popular state estimation method) and nonlinear dynamics [127]. The DA methods can be related to ML. Data fusion and DA use similar techniques, but the problem formulation differs [130].

In hydrological modeling, the ML-based DA is the most common type of coupling of ML and the physical-based model, the so-called loosely hybrid hydrological model [131]. DA updates the state system predicted by a physical-based model at a given time or place with observational data, using Bayesian approximation such as the ensemble Kalman filter (EnKF) [127] or ML methods, e.g., ANN, RNN, and LSTM [132]. Both the DA and the ML methods solve an inverse problem, expressed as the model y = h(x,w), where h is the model function, x represents the state/feature variable, w is the parameters/weights of the model, and y is the observations/labels in DA/ML, respectively. DA is oriented to find the true state of the system (x) from the observation and ML is commonly oriented to find model parameters or weight (w) from the observation. DA holds w constant to estimate x; ML holds x constant to estimate w; see [133] for a detailed revision.

Many studies have shown that ANN data assimilation outperforms conventional DA, particularly for complex and non-linear response systems [61]. An additional development of ML-based DA methods is the so-called deep DA [132], which trains deep learning neural networks such as LSTM for high dynamic systems. Deep DA has shown potential for accurate prediction for periods or sites where observations are unavailable and conventional DA cannot be applied to reduce the model error [132].

3. Modeling Optimization Problems for Hydrological Model Calibration and Decision Support System

3.1. Hydrological Model Calibration

In hydrological modeling, the inverse modeling approach is widely applied. In inverse modeling, the model features and parameter values are unknown, and those are identified by minimizing the error between the model output and the observed data [134,135]. The model feature identification includes the definition of the main hydrological processes, the mathematical equations representing it, the boundary conditions, and the time regime [136]. The parameter identifications encompass the identifying of the model optimal parameter set values that reproduce the observed data acceptably [136]. In highly parameterized models, identifying the optimal values of the parameters is challenging and represents a substantial part of the modeling work. Usually, there is not a single set of optimal values of parameters that can simulate the observed data well but a set of optimal parameters values that can achieve similar model performance. This modeling phenomenon is called the non-uniqueness or equifinality problem [137]. The hydrological model calibration often requires specialized optimization algorithms, and several ML-based calibration algorithms have been developed to support model calibration.

Hydrological models are often calibrated with a single objective function, although adequate and fast multi-objective optimization techniques exist, which better support the several output variables [141]. There are many optimization algorithms, meta-heuristic and ML-based, for model parameter calibration, such as particle swarm optimization (PSO), grey wolf optimization (GWO), genetic algorithms (GAs), genetic programming (GP), strength Pareto evolutionary algorithms (SPEA), micro-genetic algorithms (micro-GA), and Pareto-archived evolution strategies (PAES). Depending on the selected performance indicators of the model, the best model for hydrologists varied. According to the free lunch theorem [149], this is not expected to change for a while; it proposes that no one model fits all. In any case, all the models performed well. See Yusoff et al. [150] and Ibrahim et al. [45] for a specific review of optimization algorithms.

Meta-heuristic optimization algorithms, which are mostly inspired by the biological/behavioral strategies of animals, provide a good solution to optimization problems, particularly with incomplete or imperfect information or limited computational capacity [151]. An advantage of these algorithms is that they make relatively few assumptions about the optimization problems and reduce the computational demand by randomly sampling a subset of solutions, which otherwise would be too large to be iterated entirely [151]. However, some meta-heuristic algorithms such as PSO may not guarantee that a globally optimal solution will be found, particularly when the number of decision variables or dimensions being optimized is large [45]. The GA is inspired by genetic evolutionary concepts, such as the non-dominated sorted genetic algorithm II (NSGA-II). The genetically adaptive multi-objective method (AMALGAM) [152] has been applied for multi-objective, multi-site calibration and to solve highly non-linear optimization problems [144,153]. AMALGAM is a multi-algorithm that blends the attributes of several optimization algorithms (NSGA-II, PSO, the adaptive metropolis search, and differential evolution) [144]. The GA has been shown to be well-suited for hydrological models, such as the SWAT semi-distributed hydrological models, which cannot be adequately calibrated by gradient-based calibration algorithms [144,153,154]. The objective function for each solution in a GA can be assessed in parallel computation, providing computational efficiency [144]. Additional calibration methods based on deep learning have also been developed, outperforming many of the existing evolutionary and regionalization methods [20,146].

3.2. Decision Support System for Integrated Water Resources Management

Integrated water resources management (IWRM) deals with multiple actors to consensually and communicatively integrate decisions in a hydrological unit to ensure equitable economic development and social welfare while assuring hydrological system sustainability [155]. IWRM demands quality and timely information. Hence, increasing automation with GeoAI-based decision support systems is thought to enhance IWRM [17,156]. Multi-objective and scenario analysis are typical applications of GeoAI techniques in IWRM to find solutions for conflicting objectives, forecast the impact of management strategies, and optimize hydrological system operation [157,158]. There are widespread applications of GeoAI in reservoir and water distribution optimization using ANN [159,160], assembled and deep learning algorithms, and genetic programming [161,162]. Another application is found in building a smart irrigation decision support system [147]. Here, partial least square regression and the adaptive network-based fuzzy inference system (ANFIS) are proposed as reasoning engines for automated decisions. An additional example of artificial intelligence application is the adaptive intelligent dynamic urban water resource planning [158]. It uses Markov’s decision process to tackle complex water management problems, predicting water demand, scheduling management, financial planning, tariff adjustment, and the optimization of water supply operations [158]. Overall, the GeoAI-based IWRM integrates various types of algorithms to perform different tasks, such as prediction and forecasting using various types of geospatial data, and optimization algorithms for management scenarios with multiple objectives. Algorithms such as ANFIS are used for system reasoning to automate the decision support [157,158,163]. ANFIS allows the mimicking of human reasoning and decision-making based on a set of fuzzy IF-THEN rules. ANFIS has the learning capability to approximate nonlinear functions and can self-improve in order to adjust the membership function parameters directly from the data [164].

4. Automatic Water Quality Monitoring and Spatio-Temporal Prediction

4.1. Automatic Water Quality Monitoring

The data collection of water quality with wireless sensor networks and internet of things (IoT) technologies is rapidly increasing and providing very-high-frequency WQ data (sub-hourly) [165,166]. There is evidence that the high-frequency data better represent the dynamics variation of river discharge and sediment and solute fluxes [167]. It enables the early mitigation of floods and drinking water problems [168,169]. High-frequency data can also lead to a more precise and accurate classification of the biochemical status of rivers and lakes [170]. However, such sensors and devices are subject to failures, poor calibration, and inaccurate data recording in certain conditions [171,172]. Therefore, automatic data quality control, error and anomaly detection, sensor drift compensation, and uncertainty assessment are important [171,172,173]. GeoAI showed advantages in managing WQ sensor networks and sensor data fusion, such as fault detection, data correction, and upgrades from different monitoring sensors by data fusion [174]. Additional applications of GeoAI are in the detection, localization, and quantification of pollutant critical sources and critical periods of loading in monitoring networks [175,176]. The most common GeoAI algorithms for WQ sensor fusion are based on Bayesian algorithms, fuzzy set theory, genetic programming, ANN, and LSTM [177,178,179,180].

Many WQ parameters cannot easily be measured in situ and in real time for various reasons, such as high-cost sensors, low sampling rate, multiple processing stages, and the requirement of frequent cleaning and calibration. Therefore, a common practice is the estimation of a particular WQ parameter value based on other surrogate parameters, called soft sensors [181,183,184]. ML techniques showed higher accuracy in implementing soft sensors than conventional regression-based models [181,183,184,192].

The ML method has also shown an advantage in automatic hysteresis pattern analysis using high-frequent water quality data with, e.g., restricted Boltzmann ANN [193]. A more detailed hysteresis pattern classification allows the gaining of new insights into WQ pollutants sources and drivers, the influence of catchment and riverine features, the effect of antecedent conditions, and the influence of changes in rainfall and snowmelt patterns [193].

4.2. Spatio-Temporal Water Quality Prediction

There are diverse applications of the GeoAI methods in WQ spatio-temporal pattern analysis, the classification of WQ, and the prediction of WQ variables and the pollutant loading estimation. A detailed review of the ML application in WQ prediction is found in Rajaee et al. [27], Naloufi et al. [29], and Chen et al. [194]. Commonly used GeoAI for WQ prediction and classification are unsupervised clustering such as k-means, density-based spatial clustering of applications with noise (DBSCAN), and SOM, but also time-series segmentation such as dynamic time warping [195]. Supervised ML classification and prediction algorithms for WQ are RF, SVM, the Bayesian network, and ANN, and deep learning such as LSTM is also frequently used [190,196,197].

High-frequency WQ monitoring data contains noise signals due to random and systematic errors, impairing the WQ prediction accuracy. Hence, combining data denoising techniques such as Fourier and wavelet transform with GeoAI improves WQ prediction. For example, Song et al. [198] found that combining synchro-squeezed wavelet transform and an LSTM network substantially improved the WQ parameter prediction. Similarly, Najah Ahmed et al. [28] integrated wavelet discrete transform with the artificial neuro-fuzzy inference system (WDT-ANFIS) to obtain high-accuracy prediction of river WQ parameters.

Additionally, the WQ data usually have temporal autocorrelation and multi-collinearity between the WQ parameters. To consider these characteristics in the prediction models, Zhou et al. (2020) [199] proposed an ML model based on t-distributed stochastic neighbor embedding (t-SNE) and self-attention bidirectional LSTM (SA-Bi-LSTM), demonstrating substantial WQ prediction improvement. Another promising approach is uniform manifold approximation and projection (UMAP) for multidimensional WQ data ordination and classification. Unlike other dimension reduction methods, UMAP retains a global and local information structure, and the data ordination is bio-physically meaningful [200].

Inland water has naturally high spatial variation. It requires complex spatial prediction models and large datasets. The GeoAI have shown breakthroughs in spatial WQ prediction by combining field observations, remote sensing data, or UAV imagery. For example, using deep learning, RF, genetic algorithm—RF, adaptive boosting (AdaBoost), genetic algorithm—AdaBoost and the genetic algorithm—extreme gradient boosting (GA-XGBoost) [183,194]. However, these models usually demand extensive training data, which are restricted to a few pilot areas or intensely monitored areas.

Another approach in WQ prediction is the application of hybrid models and the integration of physical-based models with GeoAI methods, such as SVM, RF, ANN, and LSTM. Hybrid models usually outperformed physical-based models. For example, Noori et al. [188] found substantial improvement in monthly nitrate, ammonium, and phosphate load prediction when using hybrid SWAT-ANN models. Hybrid models are also helpful for unmonitored catchment predictions [188]. The hybrid model also improves GeoAI explanatory and generalization capability, although some disadvantages observed in the physical-based model, such as extreme values not being well predicted, persisted in the hybrid models. Similarly, the process-guided recurrent neural network (RNN), which combines the biophysical principles of the process-based model and RNN, modeled the seasonal variation of lake phosphorus loading with lower bias and better reproduced the long-term changes of phosphorus loading compared to using the physical-based model and RNN independently [21].

Overall, the GeoAI water quality prediction depends not only on the selected algorithms and settings but also on the WQ parameters, data size, and training data quality for the learning models [183,188,191].

5. Machine Learning in Fluvial Geomorphic and Morphodynamic Mapping

Fluvial geomorphology triggered the quantitative dynamic paradigm [201] as an approach to quantifying and understanding the processes of the fluvial environment [5]. The simultaneous development of techniques such as multispectral satellite images, synthetic aperture radar (SAR), LiDAR, UAV imagery, structure from motion photogrammetry (SfM), multibeam sonar (sound navigation and ranging), among others, has resulted in an unprecedented, seamless characterization and quantification of the fluvial environment and its dynamics [202,203,204]. This geospatial dataset explosion, as in many other disciplines, has resulted in the perfect foundation for applying GeoAI methods in fluvial geomorphology.

The current state-of-the-art of GeoAI in fluvial geomorphology consists of an automatic extraction of fluvial features at a fine scale by integrating larger and multidimensional datasets, using unsupervised classifiers (e.g., K-means, SOM), supervised classifiers (e.g., RF, SVM, ANN, deep learning, CNN), or by combining both methods, e.g., K-means with ANN. Most of the articles were focused on the development of the methods and workflow, the testing of new applications, or the comparison of algorithm performances [205,207,209], rather than the study of fluvial processes and underlying dynamics. These applications of GeoAI provide the basis to the discovery of new fluvial patterns and trends and increase knowledge about fluvial environments (e.g., Ling et al. 2019; Guillon et al. 2020, Heasley et al. 2020) [208,214,217].

Overall, GeoAI outperforms conventional methods of fluvial landform classification, reaching a classification accuracy of over 80%. Most common applications are found in river channels and water body mapping [208,216], the classification of riverine landforms and vegetation successions [213,214,219,220], the estimation of catchment hydrogeomorphic characteristics (e.g., valley bottom, floodplain, and terrace) [212,221], and benthic and fish habitat mapping [207,211,222,223].

Another application of GeoAI is the integration of multiple techniques to provide more accurate and very-high-resolution data for fluvial studies. For example, the fluvial environment is highly dynamic and demands frequent bathymetry surveys to understand the change and morphodynamic drivers in lakes and rivers. Emerging technologies, such as acoustic Doppler current profiler (ADCP), green LiDAR, high-resolution image radiometric model, and 3D cloud points generation with SfM, allow more frequent and accurate bathymetry mapping [203,204]. However, each approach has limitations, e.g., ADCP collects data only from areas where the sensor has passed, and it does not provide continuous spatial scanning. It does not measure near-bank areas, and it is subject to the acoustic side-lobe effect [224]. Photogrammetry and the green LiDAR method are sensitive to water turbidity and light penetration in the water column [225,226]. Therefore, multisource bathymetry modeling using the GeoAI method increases the bathymetric data accuracy and reduces uncertainties due to data quality in change detection. For example, ADCP data, image radiometric-based water depth, and SfM depth data can be integrated using U-Net convolutional neural networks [218,227].

The GeoAI approach, when using multi-temporal remote sensing data, allows the mapping of a broader fluvial landscape and its change, thereby revealing spatiotemporal scales of fluvial morphodynamics, as in e.g., Van Iersel et al. [228], Hemmelder et al. [229], and Boothroyd et al. [230]. There are different GeoAI approaches for automatic change detection using multi-temporal images such as generative adversarial networks (GAN), autoencoder, CNN, and others, as presented by Shi et al. [231].

Although GeoAI has been rapidly adopted in fluvial geomorphological studies, a wide spectrum of workflows and software is found; many GeoAI approaches seem to be under development and in the testing stage. Therefore, without a general, consistent, and robust workflow among them, it is difficult to generalize and compare the GeoAI methods performance and overall accuracies, as well as the study results.

The current limitations of GeoAI methods in fluvial studies are that the classification quality is highly dependent on expert knowledge. The unsupervised classification output is often inconsistent, and the cluster classes do not have direct geomorphic or fluvial process meaning and need a post-classification labeling. Supervised GeoAI classifiers require a large training sampling, and the training data quality is highly dependent on expert knowledge. In addition, many of the studies using GeoAI to classify fluvial landform or river typologies have been conducted in areas where an extensive quantity of previous studies and data collection exists [212,214]. Therefore, its application in poorly sampled areas is somewhat limited.

In many cases, GeoAI is enhanced with the use of fine-scale fluvial geomorphic mapping, e.g., LiDAR or UAV-based images, which are still restricted to pilot areas, mostly in Western countries. In addition, several different landform class names are used to rename fine-scale fluvial landforms, and therefore, a standardized fluvial landform taxonomy is lacking [232].

Another limitation of supervised GeoAI applications is the misclassification of elements out of the GeoAI training range, as presented, e.g., in Carbonneau et al. [205]. Moreover, the use of very different methods for assessing the GeoAI algorithm’s performance and accuracy may lead to inconsistencies in the validity of results, e.g., map cross-tabulation often uses limited validation points rather than areal-based reference data, due to the lack of geomorphological reference maps at a very fine scale. Another issue with regard to performance and accuracy assessments is the use of scalar error statistics, such as root mean square error, which may not be reliable in fluvial mapping. Here the resulting error is a complex combination of random and systematic components, and the isotropy and stationary assumptions do not apply to the fluvial process [233]. It is also heavily influenced by a small percentage of classification errors, which lead to incorrect rankings of overall model performances or to prediction error [206]. Therefore, a more consistent and comparable GeoAI-based fluvial mapping accuracy assessment is needed.

This entry is adapted from the peer-reviewed paper 10.3390/w14142211

© Text is available under the terms and conditions of the Creative Commons Attribution (CC BY) license; additional terms may apply. By using this site, you agree to the Terms and Conditions and Privacy Policy.