Taxonomy of ML Methods for Urban Applications

Taxonomy of ML Methods for Urban Applications: History

Please note this is an old version of this entry, which may differ significantly from the current revision.

Contributor: Sesil Koutra , Christos S. Ioakimidis

Machine Learning (ML) as an intersection of informatics and statistics is a promising challenge for more evidence-based decisions to fill in the gap of existing technological tools and instruments for spatiotemporal requirements. As ML transcended the conventional techniques of modeling, a huge potential of big data management to address complex city problems is presented at the crossroads of modern urban planning challenges to make up their dynamics. Generally speaking, the ML methods are categorized based on the type of ‘learning’.

case-study analysis
machine learning
urban planning
spatiotemporal
smart planning
smart city

1. Introduction

Digitalization is gaining rising interest in all fields of daily life, being favored by the increasing computational capacities and the emergence of efficient algorithmic processes which facilitate data mining. In line with this, Machine Learning (ML) as an intersection of informatics and statistics is a promising challenge for more evidence-based decisions ^[1] to fill in the gap of existing technological tools and instruments for spatiotemporal requirements. Bhavsar et al. ^[2] define ML as a collection of data-driven models to automate data through significant patterns, while the first attempts to develop machines to imitate living behavior dates to the 30s by Ross ^[3]. In 1959, Samuel approaches the concept as the ‘field of study that provides computes with the ability to learn without being further programmed’ ^[4].

As living laboratories in a multidimensional context with tremendous environmental and social challenges, cities are being more and more implicated in these applications, especially those oriented towards meeting the complex ambitions of sustainability, resilience and climate adaptation, to cite some of them, and dealing with a noticeable mass of data (^[2]^[3]). At the same time, rapid urbanization challenges and quality of life (QoL) degradation puts pressure on planners to channel the growth and provide monitoring strategies, while the traditional methods (e.g., surveys, etc.) are time-consuming with insufficient outcomes. Advancements in urban geography and relevant sciences, commonly geographical information systems (GIS) tools, use simulations to evaluate and analyze the complex interactions in a city with limited efficiency to simulate scenarios for future growths ^[4]^[5] and visualize spatial, demographic and other relevant data to benefit from digital innovations and patterns. These are the key transformations needed for the abovementioned roadmaps.

Combined with the technology and software advancements, ML and the field of Artificial Intelligence (AI) are being prioritized and becoming more and more essential for cities’ operations towards smart solutions, e.g., optimization of energy performance or waste management, etc. (^[6]^[7]). They are adopted widely for diverse tasks of the digital society while reducing the human effort ^[8], a recognition of the achievements in data acquisition and the practical use of algorithmic approaches ^[9]. Many scholars have already stressed the importance of ML for accurate predictions and correlations between spatial indicators (e.g., ^[10]^[11]).

As ML transcended the conventional techniques of modeling, a huge potential of big data management to address complex city problems is presented at the crossroads of modern urban planning challenges to make up their dynamics ^[12]^[13]. Broadly speaking, ML consist of a group of different models and patterns with the ability to minimize error using repeated processes from data collection, analysis and monitoring ^[14]. Based on the existing definitions, Machine Learning consists of a set of techniques to automatically detect and predict data or perform decision-making processes under an important level of uncertainty ^[15]. Hence, ML consists of methods leading to evidence-based processes to meet the standards and quality of a complex problem. Its rapid evolution and growth, with the parallel emergence of its potential, will equal the challenges of modern cities in several fields (mobility, energy, etc.). More and more cities are being included in this dynamic, which concerns the drivers of the urban functionalities or decision-making processes optimizing performance and leading to automation. Overall, the existing ML demonstrations on urban and spatial problems consist of spatiotemporal subjects ^[16]; however, their implication has not yet been fully explored, despite their large repertoire ^[17].

Despite the technological achievements and the progress in ML uses, data availability remains a sophisticated task and not equally distributed in every corner of the world. Lack of standards, the topics of private life and confidentiality, spatial granularities or even the lack of synergies and the nature of the heterogeneous data hinder the ML operation. On the other hand, the applications of ML algorithms on specific fields, such as geography, demonstrate the complexity of benchmarking the relevant studies due to the type of data used for the ML analyses or the missing parameters ^[18].

2. Taxonomy of ML Methods for Urban Applications

Artificial Intelligence, globally, is divided into different parts, namely knowledge representation, genetic algorithms, Artificial Neural Networks (ANN), data mining, etc. The fields of urban planning and engineering are set to expand globally due to their strong, fast-growing relationship to data mining, especially in the smart cities’ fields ^[19].

Despite the ML type and use, the quantity and type of data affect their accuracy and efficiency and enable (or not) the path towards the solution and alternative developments. In this process, Bhavsar et al. ^[2] underline the importance of problem definition for the appropriate application of ML methods (Figure 1). In reality, problem identification is a complex process depending on different factors, such as data mining, user skills and perceptions, etc.

Figure 1. Machine learning algorithm steps.

Generally speaking, the ML methods are categorized based on the type of ‘learning’. The most commonly known as follows ^[2]:

2.1. Supervised Learning

Supervised learning methods deal with a function (or an algorithm) to compute outputs based on given information and present data (e.g., the number of dwellings per ha). This information will be used for an automated process to minimize the possible risks of a prediction error, expressed as the difference between the real (data) and the computed values. Examples of this ML are the binary classifications (True or False), etc., or regression problems.

2.2. Unsupervised Learning

On the other side, unsupervised learning methods depend only on the unlabeled data and aim to identify hidden patterns of data. An example of this category is clustering, which focuses on the data grouping based on similarities or the method of association for the trends’ identification concerning a specific problem.

2.3. Machine Learning Algorithms: An Overview

However, the classification and taxonomy of ML require a thorough analysis of a set of attributes when discussing urban developments. Although there are many areas of focus, ML use has a major driver on land use and cover as great support for sustainable development. Nonetheless, despite the rise of smart cities and related concepts and the advancement of big data, etc., there is little evidence regarding classification, simulation or predictions ^[20]; this section is an step towards the development of this ground.

Murphy ^[21] proposes three main types of ML methods, namely supervised (predictive) learning to identify a mapping from outputs to inputs considering a specific set of input-outputs, unsupervised learning, where only the inputs are given, and reinforcement learning, which is less commonly used and explain how to perform with the occurrence of given occasional rewards (Figure 2).

Figure 2. Taxonomy of ML common practices ^[21].

Emerging methods, such as convolutional neural networks (CNN), proved their efficiency in extracting features from spatial data ^[22], and recurrent neural networks (RNN) are promising approaches to accurate urban simulations. Examples of successful applications are found in various studies applied to road extraction from the wider perspective of both 2D optical remote sensing images and 3D point clouds commonly used for road data acquisition developed by Chen et al. ^[23]^[24]. In the same study, a comprehensive approach to the definition of morphological feature-based tools for road shape features is designed including support vector machines (SVM) (^[25]). In the same study, Chen et al. provided three classifications for road area extraction based on traditional methods for identifying of road features (e.g., Lu et al. ^[26], Perciano et al. ^[27], etc.) or deep learning ^[28]).

Ensemble-based methods, such as random forests (RF) and similar methods, are boosted for the problem–solution studies of smart urban forms (e.g., ^[29]^[30]^[31]^[32]). On the other hand, ML methods are commonly used as a promising area to achieve smarter and more inclusive urban configurations in the tissues of modern cities ^[33].

2.4. Decision-Making Urban Planning Processes

Decision-making processes are fundamental in urban planning strategies, consisting, as they do, of simplified approaches to reality to enable decisions and interactions and allow planners to adjust or modify them in vitro using parametric proposals. Decision-support systems (DSS) facilitate the integration of models and enable the interactions between the diverse parameters to adjust or test solutions and evaluate the consequences leading to desirable and viable solutions. For the special case of predictive modeling, ML has been used for the identification of urban patterns and related indicators. Taking a quick look at the existing literature and the Scopus database correlations, one identifies of 585 documents for ML and the decision-making processes published in the United States and China, as presented in Figure 3.

Figure 3. Correlations of ML and secision-making processes, Scopus database.

Among the decision-making tree algorithms, the CART (Classification and Regression Tree) and ID3 (Iterative Dichotomizer 3) are the most commonly used for the land-use classifications acting as a random subset of the predicting parameters. On the other side, random forest (RF) has the particular functionality of both classification and regression analyses and handles an important volume of indicators ^[34]. In ML use, the DSS processes are usually represented in modeling for predictive forms to enable the decision and improve the design of the forms (Figure 4) ^[33].

Figure 4. A cycle of ML application for urban form decision support system ^[33].

This entry is adapted from the peer-reviewed paper 10.3390/land12010083

References

Jordan, M.I.; Mitchell, T.M. Machine Learning: Trends. perspectives and prospects. Science 2015, 349, 255–260.
Al-Garadi, M.A.; Mohamed, A.; Al-Ali, A.K.; Du, X.; Guizani, M. A survey of machine and deep learning methods for Internet of Things security. IEEE Commun. Surv. Tutor. 2020, 22, 1646–1685.
Bhavsar, P.; Safro, I.; Bouaynaya, N.; Polikar, R.; Dera, D. Machine Learning in Transportation Data Analytics. In Data Analytics for Intelligent Transportation Systems; Elsevier: Amsterdam, The Netherlands, 2017; pp. 283–306.
Ross, T. The synthesis of intelligence-its implications. Psychol. Rev. 1938, 45, 185.
Choung, Y.J.; Kim, J.M. Study of the relationship between urban expansion and PM10 concentration using multi-temporal spatial datasets and the machine learning technique: Case study fo Daegu. South Korea. Appl. Sci. 2019, 9, 1098.
Fecht, D.; Beale, L.; Briggs, D. A GIS-based urban simulation model for environmental health analysis. Environ. Model. Softw. 2014, 58, 1–11.
Samuel, A.L. Some studies in Machine Learning using the game of checkers. IBM J. Res. Dev. 1959, 3, 210–229.
Li, X.; Cheng, S.; Lv, Z.; Song, H.; Jia, T.; Lu, N. Data analytics of urban fabric metrics for smart cities. Futur. Gener. Comput. Syst. 2020, 107, 871–882.
Schwab, K. The Fourth Industrial Revolution, 1st ed. 2016. Available online: https://law.unimelb.edu.au/__data/assets/pdf_file/0005/3385454/Schwab-The_Fourth_Industrial_Revolution_Klaus_S.pdf (accessed on 12 November 2022).
Chaturvedi, V.; De Vries, W. Machine Learning Algorithms for Urban Land Use Planning. A review. Urban Sci. 2021, 5, 68.
Shaikh Hameed, P.; Mohd Nor, N.; Nallagownden, P.; Elamvazuthi, I.; Ibrahim, T. A review on optimized control systems for building energy and comfort management of smart sustainable buildings. Renew. Sustain. Energy Rev. 2014, 34, 409–429.
Neri, E.; Coppola, F.; Miele, V.; Bibbolino, C.; Grassi, R. Artificial Intelligence: Who is responsible for the diagnosis? Radiol. Med. 2020, 125, 517–521.
Samardzic-Petrovic, M.; Kovacevic, M.; Bajat, B.; Dragicevic, S. Machine learning techniques for modelling short term land-use change. ISPRS Int. J. Geo-Inf. 2017, 6, 387.Batty, M. Big data and the city. Built Environ. 2016, 42, 321–337.
Hagenauer, J.; Omrani, H.; Helbich, M. Assessing the performance of 38 Machine Learning models: The case of land consumption rates in Bavaria. Germany. Int. J. Geogr. Inf. Sci. 2019, 33, 1399–1419.
Robert, C. Machine Learning, a Probabilistic Perspective. Chance 2014, 27, 62–63.
Gomez, J.A.; Patino, J.E.; Duque, J.C.; Passos, S. Spatiotemporal modeling of urban growth using machine learning. Remote Sens. 2020, 12, 109.
Lim, T.S.; Loh, W.Y.; Shih, Y.S. A comparison of prediction accuracy. complexity. and training time of thirty-three old and new classification algorithms. Mach. Learn. 2000, 40, 203–228.
Casali, Y.; Yonca Aydin, N.; Comes, T. Machine learning for spatial analyses in urban areas: A scoping review. Sustain. Cities Soc. 2022, 85, 104050.
Varshney, H.; Khna, R.A.; Khan, U.; Verma, R. Approaches of Artificial Intelligence and Machine Learning in Smart Cities: A critical review. IOP Conf. Ser. Mater. Sci. Eng. 2021, 1022, 012019.
Gao, J.; O’Neill, B.C. Data-driven spatial modeling of global long-term urban land development: The select model. Environ. Model. Softw. 2019, 119, 458–471.
Murphy, K. Machine Learning: A Probabilistic Perspective. 2012. Available online: http://noiselab.ucsd.edu/ECE228/Murphy_Machine_Learning.pdf (accessed on 12 November 2022).
Fathi, S.; Srinivasan, R.; Fenner, A.; Fathi, S. Machine learning applications in urban building energy performance forecasting: A systematic review. Renew. Sustain. Energy Rev. 2020, 133, 110287.
Chen, Z.; Deng, L.; Luo, Y.; Li, D.; Marcato, J.; Gonçalves, W.N.; Nurunnabi, A.; Li, J.; Wang, C.; Li, D. Road extraction in remote sensing data: A survey. Int. J. Appl. Earth Obs. Geoinf. 2022, 112, 102833.
Poullis, C.; You, S. Delineation and geometric modeling of road networks. ISPRS J. Photogramm. Remote Sens. 2010, 65, 165–181.
Wegner, J.D.; Montoya-Zegarra, J.A.; Schindler, K. Road networks as collections of minimum cost paths. ISPRS J. Photogramm. Remote Sens. 2015, 108, 128–137.
Lu, P.; Du, K.; Yu, W.; Wang, R.; Deng, Y.; Balz, T. A new region growing-based method for road network extraction and its application on different resolution SAR images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 4772–4783.
Perciano, T.; Tupin, F.; Hirata, R.; Cesar, R.M. A two-level markov random field for road network extraction and its application with optical. Sar and multitemporal data. Int. J. Remote Sens. 2016, 37, 3584–3610.
Khesali, E.; Zoej Valadan, M.J.; Mokhtarzade, M.; Dehghani, M. Semi automatic road extraction by fusion of high resolution optical and radar images. J. Indian Soc. Remote Sens. 2016, 44, 21–29.
Jochem, W.C.; Bird, T.J.; Tatem, A.J. Identifying residential neighbourhood types from settlement points in a machine learning approach. Comput. Environ. Urban Syst. 2018, 69, 104–113.
Ma, J.; Cheng, J.C.; Jiang, F.; Chen, W.; Zhang, J. Analyzing driving factors of land values in urban scale based on big data and non-linear machine learning techniques. Land Use Policy 2020, 94, 104537.
Novack, T.; Esch, T.; Kux, H.; Stilla, U. Machine learning comparison between worldview-2 and quickbird-2-simulated imagery regarding object-based urban land cover classification. Remote Sens. 2011, 3, 2263–2282.
Shafizadeh-Moghadam, H.; Asghari, A.; Tayyebi, A.; Taleai, M. Coupling machine learning. tree-base and statistical models with cellular automata to simulate urban growth. Comput. Environ. Urban Syst. 2017, 64, 297–308.
Koumetio Tekouabou, S.C.; Diop, E.B.; Azmi, R.; Jaligot, R.; Chenal, J. Reviewing the application of machine learning methods to model urban form indicators in planning decision support systems: Potential. issues and challenges. J. King Saud Univ. Inf. Sci. 2022, 22, 5943–5967.
Hernandez Ruiz, I.E.; Shi, W.A. Random forests classification method for urban land-use mapping integrating spatial metrics and texture analysis. Int. J. Remote Sens. 2018, 39, 1175–1198.

© Text is available under the terms and conditions of the Creative Commons Attribution (CC BY) license; additional terms may apply. By using this site, you agree to the Terms and Conditions and Privacy Policy.