Data Used in Urban Flooding Management: Comparison
Please note this is a comparison between Version 2 by Camila Xu and Version 3 by Camila Xu.

Data-driven approaches to urban flooding management require a comprehensive understanding of how heterogenous data are leveraged in tackling this problem.

  • data
  • urban waterlogging
  • urban flooding

1. Introduction

Emergency managers interpret disasters as recurring events whose control focuses on four phases: mitigation, preparedness, response, and recovery [1]. During the entire cycle, it is critical to share data and knowledge for effective decision making. On the one hand, disasters are usually a continuous and changeable process, with no clear boundaries between the phases. This calls for continuous monitoring and data sharing across the phases [2]. On the other hand, there are multiple individuals and organizations with different backgrounds and expertise, which requires communication and collaboration across different levels and locations [3].
The acquisition, storage, and elaboration of large-scale, multi-modal data has become more affordable due to advancement and diffusion of smart city technologies, such as Internet of Things (IoT) solutions, sensor networks, and cloud computing [4]. Urban flooding research largely acknowledges that the combination of data and case-based reasoning can provide relevant insight into natural disaster reduction (e.g., [5][6][7][8]). Despite these premises, a comprehensive understanding is still missing on how heterogeneous data should be leveraged in urban flooding management.
Most of the existing systems offer singular functions that are designed to satisfy specific user needs, however, may not meet needs of other user communities. One of the reasons is that the related tasks are diverse, and the data required for analysis are highly heterogeneous in form and interdisciplinary and distributed in nature [8]. In addition, the links between the tasks and data are unclear, which makes it difficult to decide on appropriate data to be collected for a specific task.
Data-driven approaches to urban flooding management require the consideration of task–data configurations. Although several studies have acknowledged the importance of such a goal, they dealt with the problem at a conceptual level or used ontology to model the tasks and data but without specifying what data are required and how tasks are associated [2][9].

2. Data Used in Urban Flooding Management

The data are categorized based on the subject nature as hydrological data, topographic data, urban planning data, traffic data, disaster damage data, census data, human perception and behavior data, and parameter data. Such data are collected through various sources, including curated sources, aerial images, radar images, physical sensors, social media, open web datasets, web news (excluding social media), and surveys and interviews. Data of the same category can be acquired from multiple sources.

2.1. Hydrological Data

Hydrological data often include data related to precipitation and watercourse evaporation in the context of urban flooding. Precipitation or rainfall is typically measured by intensity and duration. For intensity, most estimates an average from historical data, and the timespan for consideration can be annual [10], daily [10], or hourly [11]. For duration, minute-based [12][13], hourly [14][15][16][17][18][19], daily [20][21][22], and multi-day-based [10] models are all reported. It is also noticed all studies made use of historical data, and real-time precipitation are only addressed in theoretical studies on data and system integration models [5][8][23]. Watercourse data describe the watercourse network [10][15] or dynamic data about the water flows [21][24]. It is used in those studies where the areas of concern contain significant water bodies, such as rivers, canals, and lakes. Evaporation data describe the speed of which surface water vapors. It is used in only a handful of studies [16][20], since evaporation is recognized to play a minor role. Hydrological data are predominantly collected from curated sources, particularly those managed by administrative bodies (e.g., weather stations, research institutions). Evaporation rates are simply treated as constant variables based on scientific parameter data.

2.2. Topographic Data

Topographic data in the context of urban flooding predominantly refer to data recording the elevation and form of terrain that is divided into units of areas. The majority simply dividing a geographical area into equal sized grids, and the granularity of the grids varies, ranging from several square meters to hundreds of square meters to square kilometers. Topographic data are widely used in inundation simulation and prediction (e.g., [13][25]) and risk analysis [16][26]. They are typically extracted from curated sources, such as Digital Elevation Models (DEM) built on various data inputs (e.g., high-resolution aerial images), and contour maps. Topographical data may come from authoritative organizations and government agencies [22][27] and contour maps. A couple of studies used open web datasets, such as OpenStreetMap [11].

2.3. Urban Planning Data

Urban planning data refer to a broad range of data describing the development and design of land use and the built environment in a human populated area. They are often coupled with topographic data [18][28]. Popularly-used urban planning data are summarized as below. Drainage network data describe the layout of the drainage/sewage system in a city, and are mainly used to calculate discharge capacity and surface water flows. While some use specific parameters (the locations of manholes, pipe length, junction depth, conduit size, diameters, and pipe materials) to calculate the discharge at different locations [17][21][28][29], the majority use measurements from design standards [13][14][20][28][30]. Drainage network data are predominantly collected from curated sources. Drainage monitoring data refer to data about discharge flow within the drainage network, often collected in real-time. These can be used for flood monitoring [23]. Such data needs to be collected through physical sensors. Catchment areas data describe how an area is subdivided into smaller units, which are arguably not directly encoded in any sources but defined on an ad hoc basis. A common approach is to define granular areas into equal sized shapes, such as the grid/cell/block system [11][20] or based on locations of manholes [28]. Huang et al. created areas of irregular shapes and sizes based on ecological and hydrological rules [24]. Luan et al. [29] used ArcGIS to digitize the properties of the confluence nodes of the drainage pipeline network in order to identify the flooded locations. Land use determines the capacity of draining excess surface water by natural means, such as infiltration, and are primarily used for cause analysis [10], inundation simulation and prediction [20][31], and risk analysis [30]. Land use definitions are often ad hoc and case-driven. For example, Wu et al. [32] identified three types: agricultural, residential and industrial, and transport; Hou and Du [27] highlighted water body, green land and unused land; Yu and Coulthard [20] only distinguished urban from rural land; Hu et al. [12] defined six types: open land, low-density residence, green/garden area, high density residence, road, and lake. Land use data are primarily used for cause analysis [10], inundation simulation and prediction [20][31], and risk analysis [30], and are typically gathered from curated sources, such as administrative bodies, and can be analyzed based on satellite images [10][13] or radar images [33]. Point-of-Interest (POI) data describe public facilities, carrying information about their different degrees of attracting the crowd [34]. Zhang et al. hypothesized that different types of POIs (e.g., green area vs. stadiums) may be useful indications of land use and therefore can inform risk analysis [11]. Ferligoj identified not only common POIs (e.g., schools) but also those that may affect evacuation planning (e.g., hospitals) [35]. POI data can be collected from open web datasets [11][34] or curated sources [35]. Road network and public transport are both data related to the transportation system, and are widely used in inundation simulation and prediction [11], risk analysis [16], flood monitoring [36], and response and evacuation planning [35].

2.4. Traffic Data

Traffic data describe the movement of transportations in a human populated area. They record information, such as the volume, speed, direction, and location of traffic. In theory, they are particularly useful for risk analysis [16], flood monitoring [36], and response and evacuation modelling; however, they are rarely used. She et al. used GPS data uploaded by taxis to estimate traffic flows during rainstorms and predicted flooded streets based on the changes in traffic movement [36]. Su et al. used a traffic simulation model that takes input of a series of parameters, such as volume, speed, and traffic signal operation data [16]. Traffic data can be collected via physical sensors (i.e., GPS) [36] or curated sources [16].

2.5. Disaster Damage Data

Disaster damage data describe the extent of physical damage caused by urban flooding, and the economical and societal loss. The extent of physical damage is often described in terms of flooded areas and severity. These usually record the exact locations (e.g., streets, buildings, or as precise as geo-coordinates), and parameters, such as the area size, water depth, and duration. Such data can be obtained by analyzing textual and imagery data or geo-coordinate data in social media posts, and the analysis often involves image recognition, text analysis, or manual processing. Such data are often collected for flood monitoring [37] and are used in a wide range of tasks, including in inundation simulation and prediction [11], cause analysis [10], risk analysis [30], response and evacuation planning [38][39][40], and trend analysis [24]. Data for assessing economic and societal loss are less. Chang and Huang proposed an integrated ecological and economic system to evaluate the ‘emergy’ values of vulnerability [41]. Quan reported unitary costs (CNY/m2) for replacing certain residence building structures [30]; while Han et al. [21] related different levels of water depth to traffic conditions measured by vehicle discharge per hour. Damage data can be sourced from a wide range of channels. In addition to curated sources typically maintained by government administrative bodies [8][10][20][41][42], there is also wide use of aerial images from satellites [24][32][42] and UAVs [43][44], radar images [33], physical sensors [45][46], social media [11][26][42][47][48][49], and web news [25][26][34].

2.6. Census Data

Census data describe the population of an administrative area and may include (but are not limited to) the size and density of a population, demographics, social economic status, and household composition. Census data are often needed to quantify vulnerability of an area during urban flooding in risk analysis, to inform response and evacuation planning, or to evaluate the damage. For example, Ferligoj used the population density of Buenos Aires to quantify access to public facilities (e.g., public transport and hospitals) [35]. Similar work can be found in [26][34]. Census data are predominantly collected from curated sources, typically government administrative data, such as China City Statistics Yearbook [50]. Some of these have been made available as open web datasets (e.g., the UK open census data).

2.7. Human Perception and Behaviour Data

Human perception and behavior data describe people’s perceptions about urban flooding issues and understandings of how they behave during flooding incidents. Such data can benefit various tasks, such as policy analysis and cause analysis [51], and response and evacuation planning [39]. Human perception and behavior data are difficult to observe directly [52] and can be collected through surveys and interviews [39]. Social media also provides information on emotions, thoughts, and behaviors [42][47].

2.8. Parameter Data

Parameter data are those acting as configuration variables that are internal to a model, and are often found as arbitrary, ad hoc parameters in computational models or decision analysis models. For example, Chang et al. used parameters, such as equipment type, unit rent, average operating cost, and the unit penalty for shortage, in evaluating flood emergency plans [38]. Chen et al. evaluated evacuation plans by simulation, in which vehicles (e.g., ambulance and emergency communication vehicles) were assigned different degrees of mobility in terms of the number of grids they move at each single turn [53]. Concerning evacuation planning, Ding et al. defined the costs of different sizes of rescue team based on the labor cost, equipment rental cost, and material consumption [54]. Earlier in Section 3.1, some studies used the runoff coefficient as a constant parameter in their inundation simulation models. The parameter values are typically estimated by considering scenarios that represent the possible realistic situations or learned from the statistics [38][54].


  1. Federal Emergency Management Agency. Information Technology Architecture, Version 2.0: The Road to e-FEMA (Volume 1). 2001. Available online: (accessed on 28 June 2021).
  2. Qiu, L.; Du, Z.; Zhu, Q.; Fan, Y. An integrated flood management system based on linking environmental models and disaster-related data. Environ. Model. Softw. 2017, 91, 111–126.
  3. Xia, W.; Becerra-Fernandez, I.; Gudi, A.; Rocha-Mier, J. Emergency management task complexity and knowledge-sharing strategies. Cutter IT J. 2011, 24, 20–25.
  4. Sinha, A.; Kumar, P.; Rana, N.P.; Islam, R.; Dwivedi, Y.K. Impact of internet of things (IoT) in disaster management: A task-technology fit perspective. Ann. Oper. Res. 2019, 283, 759–794.
  5. Shao, W.; Zhang, H.; Liu, J.; Yang, G.; Chen, X.; Yang, Z.; Huang, H. Data integration and its application in the sponge city construction of CHINA. Procedia Eng. 2016, 154, 779–786.
  6. Wu, X.; Yang, X.; Li, L.; Wang, G. Review and prospect of the emergency management of urban rainstorm waterlogging based on big data fusion. Chin. Sci. Bull. 2017, 62, 920–927. (In Chinese)
  7. Yu, F.; Li, X.; Han, X. Risk response for urban water supply network using case-based reasoning during a natural disaster. Saf. Sci. 2018, 106, 121–139.
  8. Wu, Z.; Shen, Y.; Wang, H.; Wu, M. An ontology-based framework for heterogeneous data management and its application for urban flood disasters. Earth Sci. Inform. 2020, 13, 377–390.
  9. Zlatanova, S.; De Vries, M.E.; Van Oosterom, P.J.M. Ontology-based query of two dutch topographic data sets: An emergency response case. In Proceedings of the Core Spatial Databases-Updating, Maintenance and Services-from Theory to Practice, Haifa, Israel, 15–17 March 2010. IAPRS, XXXVIII (4-8-2/W9).
  10. Wu, X.; Yu, D.; Chen, Z.; Wilby, R. An evaluation of the impacts of land surface modification, storm sewer development, and rainfall variation on water-logging risk in Shanghai. Nat. Hazards 2012, 63, 305–323.
  11. Zhang, N.; Chen, H.; Chen, J.; Chen, X. Social media meets big urban data: A case study of urban waterlogging analysis. Comput. Intell. Neurosci. 2016, 2016, 1–10.
  12. Hu, M.; Sayama, T.; Zhang, X.; Tanaka, K.; Takara, K.; Yang, H. Evaluation of low impact development approach for mitigating flood inundation at a watershed scale in China. J. Environ. Manag. 2017, 193, 430–438.
  13. Chen, J.; Hill, A. A GIS-based model for urban flood inundation. J. Hydrol. 2009, 373, 184–192.
  14. Fan, X.; Matsumoto, T. GIS-Based Social Cost-Benefit Analysis on Integrated Urban Water Management in China: A Case Study of Sponge City in Harbin. J. Manag. Stud. 2019, 11, 5527.
  15. Shi, Y.; Shi, C.; Xu, S.; Sun, A.; Wang, J. Exposure assessment of rainstorm waterlogging on old-style residences in Shanghai based on scenario simulation. Nat. Hazards 2010, 53, 259–272.
  16. Su, B.; Huang, H.; Li, Y. Integrated simulation method for waterlogging and traffic congestion under urban rainstorms. Nat. Hazards 2016, 81, 23–40.
  17. Xu, H.; Ma, C.; Lian, J.; Xu, K.; Chaima, E. Urban flooding risk assessment based on an integrated k-means cluster algorithm and improved entropy weight method in the region of Haikou, China. J. Hydrol. 2018, 563, 975–986.
  18. Bao, S.; Kim, C.; Ai, W.; Lai, Z.; Wang, J. Urban water-log simulation and prediction based on multi-agent systems. In Proceedings of the 13th International Conference on GeoComputation, Richardson, TX, USA, 20–23 May 2015; pp. 317–325.
  19. Xue, F.; Huang, M.; Wang, W.; Zou, L. Numerical simulation of urban water-logging based on flood area model. Adv. Meteorol. 2016, 1, 3940707.
  20. Yu, D.; Coulthard, T. Evaluating the importance of catchment hydrological parameters for urban surface water flood modelling using a simple hydro-inundation model. J. Hydrol. 2015, 524, 385–400.
  21. Han, S.; Xie, Y.; Li, D.; Li, P.; Sun, M. Risk analysis and management of urban rainstorm water logging in Tianjin. J. Hydrodyn. 2006, 18, 552–558.
  22. Sarkar, S.; Rahman, A.; Esraz-Ul-Zannat, M.; Islam, F. Simulation-based modelling of urban waterlogging in Khulna City. J. Water Clim. Chang. 2021, 12, 566–579.
  23. Ma, Q.; Yang, B.; Wang, J. Application of Internet of Things in Urban Waterlogging Prevention Management System. Adv. Internet Things 2017, 7, 1–9.
  24. Huang, C.; Chen, Y.; Wu, J. Mapping spatio-temporal flood inundation dynamics at large river basin scale using time-series flow data and MODIS imagery. Int. J. Appl. Earth Obs. Geo-Inf. 2014, 26, 350–362.
  25. Zeng, Z.; Lan, J.; Hamidi, A.; Zou, S. Integrating Internet media into urban flooding susceptibility assessment: A case study in China. Cities 2020, 101, 102697.
  26. Wu, Z.; Shen, Y.; Wang, H. Assessing Urban Areas Vulnerability to Flood Disaster Based on Text Data: A Case Study in Zhengzhou City. Sustainability 2019, 11, 4548.
  27. Hou, J.; Du, Y. Spatial simulation of rainstorm waterlogging based on a water accumulation diffusion algorithm. Geomat. Nat. Hazards Risk 2020, 11, 71–87.
  28. Meng, X.; Zhang, M.; Wen, J.; Du, S.; Xu, H.; Wang, L.; Yang, Y. A Simple GIS-Based Model for Urban Rainstorm Inundation Simulation. Sustainability 2019, 11, 2830.
  29. Luan, Q.; Zhang, K.; Liu, J.; Wang, D.; Ma, J. The application of Mike Urban model in drainage and waterlogging in Lincheng county, China. Proc. Int. Assoc. Hydrol. Sci. 2018, 379, 381–386.
  30. Quan, R. Rainstorm waterlogging risk assessment in central urban area of Shanghai based on multiple scenario simulation. Nat. Hazards 2014, 73, 1569–1585.
  31. Bandyopadhyay, M.; Singh, V. Agent-based geosimulation for assessment of urban emergency response plans. J. Geosci. 2018, 11, 165.
  32. Ticehurst, K.; Guerschman, J.; Chen, T. The strengths and limitations in using daily MODIS data for identifying flood events. Remote Sens. 2014, 6, 11791–11809.
  33. Dourado, F.; Fernandes, A. RADAR images supporting rescue and recovery actions for Landslide and flood disasters: A Rio de Janeiro State case study. In Landslide Science for a Safer Geoenvironment; Springer: Cham, Germany, 2014; pp. 551–555.
  34. Lin, T.; Liu, X.; Song, J.; Zhang, G.; Jia, Y.; Tu, Z.; Zheng, Z.; Liu, C. Urban waterlogging risk assessment based on internet open data: A case study in China. Habitat Int. 2018, 71, 88–96.
  35. Ferligoj, Y. Urban Impact Assessment and Emergency Response to Flooding in Buenos Aires, Argentina. Master’s Thesis, University of Canterbury, Christchurch, New Zealand, 2018.
  36. She, S.; Zhong, H.; Fang, Z.; Zheng, M.; Zhou, Y. Extracting Flooded Roads by Fusing GPS Trajectories and Road Network. ISPRS Int. J. Geo-Inf. 2019, 8, 407.
  37. Jiang, R.; Yang, S.; Xie, J. Three-dimensional visualization emergency management information system of urban waterlogging. Comput. Eng. 2019, 45, 46–51. (In Chinese)
  38. Chang, M.; Tseng, Y.; Chen, J. A scenario planning approach for the flood emergency logistics preparation problem under uncertainty. Transp. Res. Part E Logist. Transp. Rev. 2007, 43, 737–754.
  39. Simonovic, S.; Ahmad, S. Computer-based Model for Flood Evacuation Emergency Planning. Nat. Hazards 2005, 34, 25–51.
  40. Jiang, J.; Zhang, P. Route optimization of urban waterlogging rescue based on improved ant colony optimization. J. Comput. Appl. 2014, 34, 2103–2106. (In Chinese)
  41. Chang, L.; Huang, S. Assessing urban flooding vulnerability with an emergy approach. Landsc. Urban Plan. 2015, 143, 11–24.
  42. Rosser, J.; Leibovici, D.; Jackson, M. Rapid flood inundation mapping using social media, remote sensing, and topographic data. Nat. Hazards 2017, 87, 103–120.
  43. Feng, Q.; Liu, J.; Gong, J. Urban flood mapping based on unmanned aerial vehicle remote sensing and random forest classifier-a case of Yuyao, China. Water 2015, 7, 1437–1455.
  44. Perks, M.; Russell, A.; Large, A. Technical note: Advances in flash flood monitoring using unmanned aerial vehicles (UAVs). Hydrol. Earth Syst. Sci. 2016, 20, 4005–4015.
  45. Liu, Y.; Du, M.; Jing, C.; Cai, G. Design and implementation of monitoring and early warning system for system for urban roads waterlogging. In Computer and Computing Technologies in Agriculture; Springer: Cham, Switzerland, 2015; Volume VIII, pp. 610–615.
  46. Jiang, J.; Liu, J.; Cheng, C.; Huang, J.; Xue, A. Automatic Estimation of Urban Waterlogging Depths from Video Images Based on Ubiquitous Reference Objects. Remote Sens. 2019, 11, 587.
  47. Choi, S.; Bae, B. The real-time monitoring system of social big data for disaster management. In Computer Science and Its Applications; Springer: Berlin/Heidelberg, Germany, 2015; pp. 809–815.
  48. Zhang, N.; Zheng, G.; Chen, H.; Chen, X.; Chen, J. Monitoring urban waterlogging disasters using social sensors. In Proceedings of the Chinese Semantic Web and Web Science Conference, Wuhan, China, 8–12 August 2014.
  49. Wang, Y.; Wang, T.; Ye, X.; Zhu, J.; Lee, J. Using social media for emergency response and urban sustainability: A case study of the 2012 Beijing rainstorm. Sustainability 2016, 8, 25.
  50. Sun, S.; Zhai, J.; Li, Y.; Huang, D.; Wang, G. Urban waterlogging risk assessment in well-developed region of Eastern China. Phys. Chem. Earth 2020, 115, 102824.
  51. Wang, Y.; Sun, M.; Song, B. Public perceptions of and willingness to pay for sponge city initiatives in China. Resour. Conserv. Recycl. 2017, 122, 11–20.
  52. Wang, B.; Loo, B.; Zhe, F.; Xi, G. Urban resilience from the lens of social media data: Responses to urban flooding in Nanjing, China. Cities 2020, 106, 102884.
  53. Chen, P.; Zhang, J.; Sun, Y.; Liu, X. Wargame simulation theory and evaluation method for emergency evacuation of residents from urban waterlogging disaster area. Int. J. Environ. Res. Public Health 2016, 13, 1260.
  54. Ding, J.; Cai, J.; Guo, G.; Cheng, C. An emergency decision-making method for urban rainstorm water-logging: A China study. Sustainability 2018, 10, 3453.