Google Earth Engine (GEE) is a cloud computing platform that was launched by Google in 2010. Since then, GEE has demonstrated its capacity of preprocessing and mining of geographic big data. GEE enables cloud computation and is an effective tool for carrying out the analysis of global geospatial big data.
Global environmental change is one of the most critical challenges facing attempts to achieve sustainable development. The concept of Earth system science was first proposed by the Earth System Sciences Committee of NASA (National Aeronautics and Space Administration) in the 1990s [1]. This interdisciplinary science emerged to address global environmental change in a way that was systematic and integrated. Yu and Gong [2] identified three challenges facing traditional geographic information science (GIS): (1) the 3D global representation and visualization of geospatial data from local to global scales; (2) the preprocessing and mining of geographic big data; and (3) spatial-temporal techniques for research into geographic processes. There is an urgent need to make significant technological advances to solve global environmental problems [2]. In connection with this, the use of Google Earth Engine (GEE) to analyze environmental problems at global scales is becoming common.
Google Earth Engine (GEE) is a cloud computing platform that was launched by Google in 2010 [3]. Since then, GEE has demonstrated its capacity to address the second challenge listed at the beginning of this section. GEE enables cloud computation and is an effective tool for carrying out the analysis of global geospatial big data. Other cloud platforms that can be used for processing geospatial big data include Amazon Web Services (AWS, released in 2006) and Microsoft Azure (released in 2010) [4]. Compared with other cloud platforms, GEE supports more types of geospatial data (for example, Sentinel and early Landsat data) and provide services free to all users, which is especially important in less developed countries. GEE is currently the most popular cloud computing platform in Earth system science and was extensively used to process data related to a variety of fields concerned with environmental change, including agriculture [5][6][7][8][9], water [10][11][12][13][14], land cover/land use [15][16][17][18], disasters [19][20][21][22][23], climate change [24][25], soil [26][27][28][29], wetland [30][31][32][33][34][35], forest [36][37], and urbanization [38][39][40][41] as well as other fields [4][42]. GEE provides users with publicly downloadable Earth observation data at the petabyte scale, advanced algorithms for analyzing geographic big data, and an interactive programming environment. GEE also hosts long time-series of Earth observation records and plays a vital role in environmental monitoring and analysis [3].
We used“Google Earth Engine” as keywords to retrieve relevant articles and review articles from the Web of Science (WoS) (https://www.webofscience.com/ ) core collection, including the SCIE (Science Citation Index Expanded) and SSCI (Social Sciences Citation Index), dated up to January 2021. After screening, 565 articles related to “Google Earth Engine” were obtained.
Most of the analysis is based on CiteSpace, a powerful bibliometric analysis software [43]. CiteSpace can analyze document co-citations, keyword co-occurrences, and cooperative maps, thus enabling the exploration of knowledge base, structural frameworks, and research frontiers in research that is based on GEE. These types of analyses can provide a basis for the subsequent application and development of GEE.
GEE was launched in 2010; however, it was difficult to make use of GEE in scientific research until 2015. As a result, there were few relevant publications or citations made during the period 2010–2015. After 2015, the number of publications and citations increased exponentially, due to the significantly improved interface, data cube, and programming environment.
The journals that published papers on GEE were mainly thematic. A total of 565 papers referencing GEE were published in 121 journals, and the top ten journals accounted for 62% of the total (Figure 1). These ten were Remote Sensing (194), Remote Sensing of Environment (65), the ISPRS Journal of Photogrammetry and Remote Sensing (18), the IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (16), the International Journal of Applied Earth Observation and Geoinformation (12), the International Journal of Remote Sensing (10), the ISPRS International Journal of Geo-Information (10), Water (10), PloS One (9) and Sustainability (8). This means that the journals Remote Sensing and Remote Sensing of Environment accounted for 46% of the total. Of the 121 journals, 68 published only one paper on GEE. Publications on GEE were mostly about remote sensing, geo-information, and the environment.
Figure 1. Journals in which papers related to GEE were published
The number of research disciplines covered by the GEE publications was far fewer. The 565 papers on GEE covered only 35 of the WoS disciplines and were concentrated in fewer disciplines than the GE papers. The research areas with more than 100 publications were environmental sciences and ecology (365), remote sensing (363), and imaging science and photographic technology (129). These three research areas alone accounted for 70% of the total (Figure 2).
Figure 2. Research disciplines in which GEE was applied
According to the analysis of keywords in the literature on GEE (Figure 3), it was found that “Remote sensing”, “Regions”, and “Landsat Satellite” appeared most frequently—203, 197, and 189 times, respectively. GEE has been used to collect various types of open-source geographic information, including remote sensing data, data on land-surface conditions, and other relevant derived products that are freely accessible to users. However, there are several limitations on the use of datasets in GEE: e.g., the data sets stored in GEE are not comprehensive (such as complex SAR is not included), and there is a limit to the amount of user-uploaded data that can be stored. From Figure 3, it can be seen that the most frequently occurring keywords include “Water”, “High Resolution”, “Management”, “Random Forest”, “Algorithm”, “Cloud Platform”, and “Model”, which appear 174, 165, 154, 134, 129, 126 and 124 times, respectively. The machine-learning algorithms in GEE allow for convenient data processing and information extraction. However, there are still some types of image-processing algorithms that are not supported by GEE, such as Gaussian and Laplacian filters, edge-detection methods, and frequency domain algorithms [3]. Due to the advantages that GEE has in processing geographic big data, the use of GEE makes it simpler and more efficient to obtain annual regional geographical information [4].
Figure 3. Keywords related to GEE
Papers describing the application of GEE began to appear in 2015 (Figure 4), and, since then, many keywords have appeared, including “GEE”, “Landsat”, “Land Cover Change”, “Classification”, “Cover”, “Image Classification”, “Area”, “Forest”, “Map”, “Cloud Computing”, “Time Series”, “Remote Sensing”, “MODIS”, “Vegetation”, “China”, “Algorithm”, “Climate”, “Random Forest”, “Sentinel”, and “Dynamics”.
Figure 4. Co-occurrence keywords were used in relation to GEE. The size of the node represents the frequency of the occurrence of the keyword, the connecting lines indicate the co-occurrence relationships for the keyword, and a purple outer circle indicates that the node is a key node (betweenness centrality > 0.1).
Table 1 lists the cited papers that were found to have the strongest citation bursts in GEE articles. GEE is a cloud computing platform that has the capacity to provide and process geographic big data and provides an excellent platform for carrying out global land cover research [44][45][46]. Landsat data are a vital source of satellite imagery that is available on GEE. Roy, et al. [47] evaluated the capabilities of Landsat 8 data and identified new opportunities for scientific research and applications using these data. This paper also laid a solid foundation for the application of Landsat 8 remote sensing data in resource management and global change studies using GEE. An important characteristic of GEE is the availability of geographic big data. Wulder, et al. [48] explained the role of free data in promoting new scientific research, applications, and international collaboration on Earth observation. GEE’s integration of a range of accessible geographic data (including Landsat [47], Sentinel [49], and MODIS [50] data) also promotes progress in Earth system science. Studies on various image processing and geophysical dataset acquisition methods have become the main driving force behind the promotion of the application of GEE [51][52][53][54].
Table 1. GEE papers with the strongest citation bursts
Article |
Author |
Year |
Strength |
Begin |
End |
High-Resolution Global Maps of 21st-Century Forest Cover Change [45] |
Hansen, MC |
2013 |
24.13 |
2015 |
2018 |
Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data [44] |
Gong, P |
2013 |
6.46 |
2016 |
2018 |
Good practices for estimating area and assessing accuracy of land change [46] |
Olofsson, P |
2014 |
5.36 |
2016 |
2019 |
Landsat-8: Science and product vision for terrestrial global change research [47] |
Roy, DP |
2014 |
5.06 |
2016 |
2019 |
Opening the archive: How free data has enabled the science and monitoring promise of Landsat [48] |
Wulder, MA |
2012 |
4.39 |
2016 |
2017 |
Object-based cloud and cloud shadow detection in Landsat imagery [51] |
Zhu, Z |
2012 |
4.39 |
2016 |
2017 |
Development of gridded surface meteorological data for ecological applications and modelling [52] |
Abatzoglou, JT |
2013 |
3.7 |
2015 |
2018 |
Automated Water Extraction Index: A new technique for surface water mapping using Landsat imagery [53] |
Feyisa, GL |
2014 |
3.65 |
2018 |
2019 |
Sentinel-2: ESA’s Optical High-Resolution Mission for GMES Operational Services [49] |
Drusch, M |
2012 |
3.09 |
2015 |
2017 |
Multitemporal settlement and population mapping from Landsat using Google Earth Engine [54] |
Patel, NN |
2015 |
2.75 |
2015 |
2017 |
At the institutional level, Chinese scientific institutions have dominated the application of GEE. The top ten institutions were the Chinese Academy of Sciences (102), the University of Chinese Academy of Sciences (53), the United States Geological Survey (23), Tsinghua University (20), Google Incorporated (18), the National Aeronautics and Space Administration (17), Wuhan University (17), the University of Oklahoma (15), the United States Forest Service (13), and the China University of Geosciences (12). The number of published papers from Chinese scientific institutions was far greater than that from institutions in other countries.
At the national level, the number of publications was more concentrated. The top five countries published 526 GEE papers (including duplicates), which accounted for 57.9% of the total number of published papers. The top five countries, in this case, were the USA (229), China (203), the United Kingdom (43), Australia (31), and Canada (31). In summary, the United States and China have dominated the application of GEE, and have published a considerable number of related papers, as compared to other countries. According to Nature Index (https://www.natureindex.com/ ), the United States and China have the largest number of scientific institutions (2637 and 1485, respectively) and published articles (29,207 and 19,084, respectively in 2020) in high-quality peer-reviewed journals. These top-ranking countries produce high-quality research output and participate in high-quality collaboration, which supports our results. Although the United States is not as prominent in terms of the number of published papers at the institutional level, it has many high-level scientific institutions and is important in terms of the number of published papers at the national level.
Cloud computing has emerged as a popular research topic in recent years. It was widely applied to geospatial big data, and a large number of remote sensing cloud computing platforms were launched. AWS is a comprehensive and widely adopted cloud platform that was developed earlier than GEE and provides a large number of application program interfaces (APIs) [3][55]. Microsoft Azure is a platform contemporary with GEE that provides machine-learning services [3][4]. AWS and Microsoft Azure are both pay-as-you-go platforms [3]. NASA Earth Exchange (NEX) works in a virtual environment to ensure maximum ease-of-use and reproducibility [56]. GEE provides a larger number of geospatial data sets than all of these platforms. Although the API of GEE includes only JavaScript and Python, it is able to meet the needs of most users [55]. Gomes et al. [57] made detailed comparisons of other cloud computing platforms, such as Sentinel Hub (SH), Open Data Cube (ODC), and OpenEO. They believed that GEE is an easy-to-use and mature remote sensing cloud computing platform, and that the data abstraction, physical infrastructure abstraction, processing scalability, and storage scalability provided by GEE stand out as advantages over existing platforms. ODC is currently the best available solution for geospatial big data and has significant advantages over GEE in terms of open governance, infrastructure replicability, data access interoperability, and extendibility. Due to the ease-of-use and maturity of GEE, it was used more widely in Earth system science than other platforms (with the number of relevant articles retrieved from WoS being 14 for AWS, 3 for Microsoft Azure, 34 for NASA Earth Exchange, 5 for SH, and 3 for ODC).
GEE is a advanced platform that has stronger cloud computing power (Table 2). Although GEE is not a virtual Earth, the technologies available in GEE can support the development and improvement of the virtual Earth. The main part of GEE (the Earth Engine (EE) Code Editor) is a web-based interactive development environment (IDE) for the Earth Engine JavaScript API, which develops complex geospatial workflows. At present, GEE supports a web APP (EE Explorer), which has limited functionality. In the future, it may be possible to develop an APP that is convenient for all users by, for example, optimizing the visualization of GEE and adding more data interfaces that can accept different types of geospatial data (such as geospatial data collected in real-time). GEE runs calculations as scripts, which is not convenient for users carrying out complex procedures or who require transitional data sets. It would be an improvement if users could control more of the data processing and if there was an interface of modules that could directly combine different tools. It is also important to increase the necessary data and algorithms mentioned in Table 2.
Table 2. Merits and limitations of GEE
Merits |
|
Limitations |
|
GEE has a comprehensive range of functions. It also has the capability to analyze geospatial big data, provides a greater number of geospatial data sets, and supports various image-processing algorithms. Still, the visualization capability of GEE is not an advantage. GEE’s modeling capabilities (simulation of complex geographical/ecological processes, such as urban expansion modeling and vegetation dynamic modeling) are limited. It also lacks complex SAR and high-resolution geospatial data sets and the ability to perform optimization. Complex algorithms are difficult to implement, and optimization algorithms are implemented locally. At present, the development of GEE is still in its infancy. Nonetheless, it is undeniable that GEE has the potential to become an effective tool for use in Earth system science. Significant improvements in the supporting technology are needed, however. The addition of more computing resources, more open-source data sets, more image-processing algorithms, and the contributions of more Earth system scientists will allow GEE to become an effective tool that combines monitoring, modeling, analysis, assessment, and management decisions that can be applied to solve global environmental problems.