Global Isotopic Hydrograph Separation: Comparison
Please note this is a comparison between Version 2 by Camila Xu and Version 1 by Yunlong Yu.

IHS is a novel research field that has shown a great shift in separating runoff into pre-event and event water throughout its production lifetime.

  • isotope hydrology
  • text mining
  • bibliometric indicator
  • web of science

1. Introduction

Similar to all scientific fields, the study of IHS has changed over time. Interests in certain themes have emerged and disappeared, and the breadth and depth of research content have expanded [10]. The most significant advances were made around the five well-known assumptions implicit in IHS models [9]. The five underlying assumptions are listed as follows: (1) significant isotopic differences occurred between the pre-event (old) and event (new) water; (2) the new water retains a constant isotopic signature in space and time, or any variations can be accounted for; (3) the isotopic signature of the old water is constant in space and time, or any variations can be accounted for; (4) contributions of water from the vadose zone must be negligible, or the isotopic content of soil water must be similar to that of groundwater; and (5) the contributions from surface storage to streamflow are negligible. Many IHS studies have concentrated on the validity and effects of these assumptions in recent decades [2,11,12,13,14,15,16,17]. Tracing IHS literature by quantitative approaches is interesting and vital for enhancing the understanding of runoff generation and relevant hydrological processes.

In 1969, Hubert et al. [1] published their pioneering paper on hydrograph separation using stable isotope tracers. Since then, isotope hydrograph separation (IHS) has gradually become the principal method for determining the relative contributions of different sources of runoff or streamflow (i.e., event or pre-event water) [2,3,4,5]. Buttle [6] summarized the hydrological processes, including groundwater ridging, lateral throughflow, preferential flow, saturation overland flow, kinematic waves, and output from surface storage, which significantly improved the knowledge of the mechanisms of runoff generation. In 2013, Klaus and McDonnell [7] published a qualitative and comprehensive review of IHS and concluded that, despite certain limitations, water stable isotopes were still the most effective technology for understanding runoff generation processes and mechanisms. Moreover, several IHS reviews have focused on streamflow generation and associated processes and methods in special environmental settings, which has accelerated the accumulation of IHS studies [8,9]. Since 2013, there has been a rapid increase in scientific output in the field, which provides an unprecedented opportunity to explore the dynamics of IHS research based on data from a large body of published scientific work. However, to our knowledge, there has not been a quantitative assessment of the publication data in this field, and there are limited documents on text mining and relevant data science methods applied in hydrology research. Utilizing a quantitative and systemic method, we can analyze the characteristics and performances of documents to gain both macro- and micro-insights into the history and development of the field.

Text mining is the process of automatically extracting high-quality information from unstructured or structured texts with diverse formats and types using linguistic and statistical techniques [18,19]. Thematic analysis is an important subject of textual data mining. Studies have found that temporal changes in themes, usually represented by keywords for a given research domain, can help to uncover the evolution of a topic and its trend [20,21,22]. For example, Yao et al. [23] analyzed the frequency changes in author keywords related to nitrogen in eutrophic lakes or reservoirs at five-year intervals and successfully determined the current and future trends in nitrogen field research. Chen et al. [24] analyzed research trends in management science and engineering in China based on co-keyword analysis and concluded that the foci were game theory, supply chain management, complex networks, data mining, optimization, risk management, and data envelopment analysis. In addition, theme variations based on keyword analysis can also help new researchers to situate themselves and their topics within the field and within changing research interests. Similarly, a systemic analysis of the thematic changes in IHS research is indispensable to understanding how this field evolved over time and predicting future research trends.

Similar to all scientific fields, the study of IHS has changed over time. Interests in certain themes have emerged and disappeared, and the breadth and depth of research content have expanded [10]. The most significant advances were made around the five well-known assumptions implicit in IHS models [9]. The five underlying assumptions are listed as follows: (1) significant isotopic differences occurred between the pre-event (old) and event (new) water; (2) the new water retains a constant isotopic signature in space and time, or any variations can be accounted for; (3) the isotopic signature of the old water is constant in space and time, or any variations can be accounted for; (4) contributions of water from the vadose zone must be negligible, or the isotopic content of soil water must be similar to that of groundwater; and (5) the contributions from surface storage to streamflow are negligible. Many IHS studies have concentrated on the validity and effects of these assumptions in recent decades [2,11,12,13,14,15,16,17]. Tracing IHS literature by quantitative approaches is interesting and vital for enhancing the understanding of runoff generation and relevant hydrological processes.

Bibliometric analysis is a popular, powerful, and systematic way to analyze the performance of scientific production using mathematical and statistical methods [25,26]. The results of bibliometric analyses can provide objective views of scientific productions in a given research field and provide effective support for the subjective perceptions of researchers. Many research fields have applied this methodology to assess and predict scientific productivity, development, and future trends [27,28]. Padilla et al. [29], for example, conducted a bibliometric analysis of global nitrate leaching publications and found an overriding interest in recent decades on the theme of soil nitrogen loss in agroecosystems. Moreover, many different bibliometric indicators can be used to evaluate the literature characteristics as well as thematic influences, such as the number of documents, number of citations, and rank [30]. A quantitative analysis of bibliometric indicators can reveal the most cited papers, hot issues, and other valuable information in a given research field, which can help scientists to quickly develop an understanding of the research situation in their field [31,32]. Therefore, an analysis of the performance of themes with bibliometric indicators is of vital importance in understanding the global state of IHS.

Text mining is the process of automatically extracting high-quality information from unstructured or structured texts with diverse formats and types using linguistic and statistical techniques [18,19]. Thematic analysis is an important subject of textual data mining. Studies have found that temporal changes in themes, usually represented by keywords for a given research domain, can help to uncover the evolution of a topic and its trend [20,21,22]. For example, Yao et al. [23] analyzed the frequency changes in author keywords related to nitrogen in eutrophic lakes or reservoirs at five-year intervals and successfully determined the current and future trends in nitrogen field research. Chen et al. [24] analyzed research trends in management science and engineering in China based on co-keyword analysis and concluded that the foci were game theory, supply chain management, complex networks, data mining, optimization, risk management, and data envelopment analysis. In addition, theme variations based on keyword analysis can also help new researchers to situate themselves and their topics within the field and within changing research interests. Similarly, a systemic analysis of the thematic changes in IHS research is indispensable to understanding how this field evolved over time and predicting future research trends.

Based on the techniques of text mining and bibliometric analysis, this study analyzed the evolving history and future trends of IHS research from a global perspective by using the Web of Science (WoS) database. The aims of this study were to (1) quantitatively summarize the characteristics of yearly publication and explore the development nature of IHS; (2) mine and assess IHS themes in different subperiods and reveal how these themes enhance our understanding of hydrological processes related to IHS, and (3) discuss the upcoming research trends and provide a certain guide for future IHS research.

2. Materials and Methods

We extracted various performance indicators from 392 downloaded IHS recodes to conduct a bibliometric analysis that included frequency calculation, co-word analysis, and science mapping. Science mapping is a spatial representation of how disciplines, domains, specialities, and individual research units relate to one another [35]. Frequency calculations are widely used in bibliometric analyses to investigate the influence of research units, such as publications, authors, and countries/regions, by counting the number of documents and total citations (TC) or global citation scores (GCS), local citation scores (LCS), and themes based on counting words of interest [28]. LCS is the number of citations of one document by other documents in the collected dataset, while TC or GCS is the number of citations of one document by the documents in the WoS core collection [23]. More information on the relevant bibliometric indicators in this study is shown in

Bibliometric analysis is a popular, powerful, and systematic way to analyze the performance of scientific production using mathematical and statistical methods [25,26]. The results of bibliometric analyses can provide objective views of scientific productions in a given research field and provide effective support for the subjective perceptions of researchers. Many research fields have applied this methodology to assess and predict scientific productivity, development, and future trends [27,28]. Padilla et al. [29], for example, conducted a bibliometric analysis of global nitrate leaching publications and found an overriding interest in recent decades on the theme of soil nitrogen loss in agroecosystems. Moreover, many different bibliometric indicators can be used to evaluate the literature characteristics as well as thematic influences, such as the number of documents, number of citations, and rank [30]. A quantitative analysis of bibliometric indicators can reveal the most cited papers, hot issues, and other valuable information in a given research field, which can help scientists to quickly develop an understanding of the research situation in their field [31,32]. Therefore, an analysis of the performance of themes with bibliometric indicators is of vital importance in understanding the global state of IHS.

2. Publication and Keyword Performance

The primary information and statistics regarding the analyzed IHS collection are reported in

Table 1 . In this study, the primary bibliometric index of assessing the influence of research activities was LCS, followed by the number of publications, then others.

The main concepts of a specific field could be discovered using co-word analysis. Moreover, co-word analysis is a powerful method to discover and describe the interactions between different research topics [38]. The methodological foundation of co-word analysis is the idea that the co-occurrence of keywords describes the contents of the documents in a file [42]. An equivalence index and clustering were applied to measure the keyword co-occurrence matrix [42,43] ( Tabl2

. The 392 IHS documents published in 87 sources were written by 1138 authors from 453 institutions in 53 countries. The numbers of TC and average citations per document were 12,413 and 31.67, respectively. The annual growth rate was, on average, 11.3%. Figure 1

). In this study, co-word analysis, including the co-occurrence and clustering of keywords, was used to detect themes through each subperiod. It was possible that there was more than one theme in one paper.

Then, two indicators, Callon’s centrality and Callon’s density, can be used to measure the performance of the themes in each subperiod [42] (

2 shows the yearly output of IHS documents published from 1986 to 2019. The output presented obvious periodical characteristics in terms of the mean value of yearly publications. The whole period could be divided into three subperiod based on the significant changes in average publication output. Before 2000 (subperiod I), the literature output grew slowly, with only 50 documents in 13 years and an average annual output of only 3.6. From 2000 to 2014 (subperiod II), the annual volume of documents increased, with an average production of 11.7 documents per year. The most influential literatures were published in subperiods I and II (

Table 1 ). Callon’s centrality is an indicator of the importance of a theme across a full set of publications, while Callon’s density is an indicator of the theme’s development. Then, a strategic diagram was applied to show the themes with different Callon’s centrality ( x -axis) and density ( y -axis) [37] (

3). During 2015 and 2019 (subperiod III), there was a significant upward trend in the number of cumulative publications, increasing from 225 in 2014 to 392 in 2019. The mean output of 33.4 in this subperiod was much higher than that in the other two subperiods. No important literature was published in subperiod III, but this may be due to delayed citing. The number of cumulative documents grew from 1986, following a quadratic function in the form of y = 157,000 − 158x + 0.04x2 (adjusted R2 = 0.988, p < 0.001), indicating that there is a generally increasing trend in IHS publications.

Figure 1 ). The themes with high density and high centrality were named motor themes (important and developed topics that show strong links with other themes in other quadrants), which were located in quadrant I; those with low density and high centrality were developed and isolated themes, which were located in quadrant II; those with low density and low centrality were emerging or declining themes, which were located in quadrant III; and those with low density and high centrality were basic and transversal themes (focusing on general issues that were transversal to the different research areas of a domain), which were located in quadrant IV. Then, a strategic diagram was constructed to assess the role of each theme in each subperiod qualitatively. Severe bibliometric indicators were used to quantify the impact of themes in different subperiods. Finally, a Sankey diagram was applied to present how different themes were connected and developed over previous decades. Thematic analysis and visualization, supplemented by the interpretation of important documents, help us clearly capture the details of IHS development and enhance our understanding of hydrological processes in streamflow generation.

All analyses were performed using R 4.0.2 software (R Core Team, 2020). The bibliometric and thematic analysis was mainly carried out using an open-source R package named the bibliometrix package built by Aria and Cuccurullo [44].

3. Results

To further explore the topics of IHS, the co-word matrix and clustering methods were applied to determine the themes in different subperiods. The themes of different subperiods using strategic diagrams are presented in Figur2.

The publication output of IHS between 1986 and 2019. The transverse dashed lines present the mean values of the number of documents per year in each subperiod.
Table 4

a–c. Moreover, their evolution paths from subperiod I to subperiod III are clearly shown using a Sankey diagram in Figure 4 d.

2. Principal information about the 1986–2019 IHS collection.
DescriptionResults
Total documents392
10.1177/030913339401800102Timespan1986:2019
Annual growth rate
Countries
53

Table 5 shows the top 10-most frequent keywords of each theme. The bibliometric indicators in

3. Twenty most influential papers based on LCS.
PaperDigital Object Identifier (DOI)LCSGCSLCS Per YearGCS Per Year
1182974.411.0
Klaus J, 2013, J Hydrol (IF: 5.722) [7]11.28%
10.1016/j.jhydrol.2013.09.006Total citations (TC)12,413
Average citations per document31.67
Buttle JM, 1994, Prog Phys Geog (IF: 3.580) [6]9222111.527.6
Hooper RP, 1986, Water Resour Res (IF: 5.240) [45]10.1029/WR022i010p01444882522.57.2
Wels C, 1991, J Hydrol [46]10.1016/0022-1694(91)90181-G601392.04.6
Laudon H, 2002, Water Resour Res [47]10.1029/2002WR00151051802.74.2Sources87
Brown VA, 1999, J Hydrol [3]10.1016/S0022-1694(98)00247-9461872.18.5Authors
Taylor S, 2001, Water Resour Res [1138
13]10.1029/2000WR900341441292.26.5Institutions453
Ogunkoya OO, 1993, J Hydrol [48]10.1016/0022-1694(93)90005-T43931.53.3
Kong YL, 2012, J Hydrol [49]10.1016/j.jhydrol.2012.02.02941784.68.7
Hinton MJ, 1994, Water Resour Res [50]10.1029/93WR03246391101.44.1
Laudon H, 1997, J Hydrol [51]10.1016/S0022-1694(97)00030-938851.63.5
Ladouche B, 2001, J Hydrol [52]10.1016/S0022-1694(00)00391-7381361.96.8
Taylor S, 2002, Hydrol Process (IF: 3.565) [53]10.1002/hyp.123238692.03.6
Shanley JB, 2002, Hydrol Process [54]10.1002/hyp.31235941.84.9
Lyon SW, 2009, Hydrol Process [55]10.1002/hyp.732635562.94.7
Liu YH, 2008, J Hydrol [56]10.1016/j.jhydrol.2008.02.01733622.54.8
Mcdonnell JJ, 1991, Water Resour Res [4]10.1029/91WR02025311261.04.2
Bazemore DE, 1994, J Hydrol [57]10.1016/0022-1694(94)90004-3311371.15.1
Unnikrishna PV, 2002, J Hydrol [58]10.1016/S0022-1694(01)00596-028661.53.5
Weiler M, 2003, Water Resour Res [59]10.1029/2003WR002331281461.68.1
IF: Impact Factor of 2020 Journal Citation Reports.
Two papers published before 1991 were excluded from the textual analysis because there were no keywords in these documents.

Table 6 were used to measure the influence of each theme in different subperiods.

IHS publications in subperiod I were defined by six themes (

4 presents the number of publications and keywords per period. The number of keywords used per year was 17.6, 42.1, and 125.8 in subperiods I, II, and III, respectively, showing a noticeable expansion in research topics over time. There were 158 keywords in all of subperiod I, including ‘runoff’ (19), ‘storm’ (19), ‘groundwater’ (17), ‘catchment’ (16), and ‘flow’ (14), located in the center of the word cloud of subperiod I (

Figure 4 a). ‘ Capillary fringe’, ‘forest’, and ‘acidification’ (frequency: 52, Tabl3

a). There was a significant increase to 632 keywords in subperiod II. ‘Catchment’ (99) rose in rank from the fourth to first position (in the center of the word cloud of subperiod II), and ‘soil’ rose from the thirteenth (6) to third position (51), which was close to the ‘catchment’ (Figure 5

) were the motor themes of IHS, indicating that publications containing these keywords were more important in the research domain. Moreover, ‘catchment’ that linked with ‘runoff-generation’ and ‘deuterium’ showed a motor theme, which showed high centrality and high density and thus represented the processes and mechanisms of streamflow generation at the catchment scale. ‘Storm’/‘runoff’/‘groundwater’ (frequency: 132) appeared as a general theme with the highest centrality, the greatest number of publications (38), and the highest LCS (62, Table 6 ). ‘Stable-isotope’/‘recharge’/‘tropical’ (publications: 5; LCS: 0) in quadrant II were isolated themes regarding tropical climates. One specialized theme, ‘oxygen-stable-isotope’/‘nitrate’/‘temperate’ (publications: 8; LCS: 1), had low density and the lowest centrality. These themes had little influence on IHS research, with few publications and low LCS. However, the final theme for subperiod I linked with ‘model’/‘tracer’/‘soil’ attained certain attention, with 15 articles and 17 LCSs.

In subperiod II (

3b). There were 629 keywords in subperiod III, including ‘stable-isotope’ (99), ‘catchment’ (90), ‘groundwater’ (61), ‘runoff’ (60), ‘precipitation’ (59), ‘river’ (55), and ‘basin’ (51) (in the center of the word cloud of subperiod III (

Figure 4 b), eight principal topics emerged. The theme ‘runoff’, ‘soil’, and ‘storm’ (frequency: 298, Table 5 ) developed mainly from ‘model’/‘tracer’/‘soil’ and ‘storm’/‘runoff’/‘groundwater’ in subperiod I and shifted to quadrant I as a motor theme with a significant increase in the number of publications (117) and LCS (264) ( Table 6 ). ‘Stable-isotope’, which was inherited from the previous ‘stable-isotope’ and ‘oxygen-stable-isotope’, moved to quadrant IV as a basic theme, with the largest centrality and lowest density and was especially linked with ‘groundwater’ and ‘precipitation’. This theme had the second-largest LCS (401), indicating that it was cited widely by other themes. ‘Catchment’ and ‘runoff-generation’ had changed to a basic theme in quadrant IV with the largest publication (156) and LCS (425), which came from five themes in subperiod I. A transversal theme in this subperiod includes ‘chemistry’, ‘mountain’, and ‘nitrogen’ (publication: 78; LCS: 123). A new theme named ‘hillslope’/‘streamflow’/‘preferential flow’ was located in quadrant III as an emerging theme with a certain number of publications (53) and LCS (41), which consisted of few terms in subperiod I. Some studies focused on ‘variation’/‘end-member mixing analysis’/‘China’ (publications: 22) with high density scattered in quadrant II as a marginal and new theme together with ‘natural-tracer’ (publications: 5). ‘Model’ shifted to quadrant II with relatively high centrality and approached quadrant I, and its focus changed from ‘soil’ to ‘snowmelt’ in IHS. In general, many new contents sprang up in this subperiod.

Subperiod III was characterized by seven clusters, all of which developed from the themes of subperiod II (

3c)). Some keywords, such as ‘variation’ and ‘spatial’, merely appeared in subperiods II and III, and the sum frequency of the two terms increased from subperiod II to subperiod III.

Figure 4 c,d). Three themes, ‘stable-isotope’/‘precipitation’/‘river’, ‘catchment’/‘runoff’/‘runoff-generation’, and ‘groundwater’/‘flow’/‘recharge’, consolidated their roles as basic themes with high centrality and low density. The number of publications and LCS of these themes were much greater than those of other themes in this subperiod (

3. Word cloud of the most widely used IHS keywords in each subperiod. The size is proportional to the frequency of each keyword. The larger the word size is, the more important it is. For highlighting the important keywords, the keywords with high frequencies and large sizes locate in the center of the word could. (a) Subperiod I, (b) Subperiod II, (c) Subperiod III.

Table 6 ). More publications (68) focused on ‘variation’ in regions and/or countries other than China, with increasing centrality compared with subperiod II, and linked to ‘chemistry’ and ‘distribution’. In addition, a new, similar theme was ‘source’/‘uncertainty’/‘mass balance’ with publications (45) and LCS (45), which combined the partial information of six themes from subperiod II. ‘Transport’/‘event’/‘sediment’ in quadrant II, almost from ‘catchment’/‘tracer’/‘runoff-generation’ in the previous subperiod, appeared as peripheral themes with 25 publications and 4 LCSs. The final theme was ‘nitrogen’/‘three component’/‘shallow’ (publications: 18; LCS: 1), which could be considered a marginal theme mainly from the theme ‘chemistry’/‘mountain’/‘nitrogen’ in subperiod II.

4. Conclusions

IHS is a novel research field that has shown a great shift in separating runoff into pre-event and event water throughout its production lifetime. Although IHS has developed over several decades, many questions and challenges remain in the generation of runoff. This study used text mining and bibliometric analysis to present a quantitative and systematic review of IHS documents published from 1986 to 2019, thus tracing the evolution of the field. The number of IHS documents has increased rapidly in recent years, but the total sum of documents remains relatively small. The most influential documents in the field were published in subperiod I and subperiod II. Significant improvements can still be made. The topic analysis indicated that the contributions and mechanisms of precipitation, groundwater, and soil water to runoff were the core and base topics in the IHS field. Most IHS studies were carried out in watersheds covered with forest, glaciers, and snow. How to clearly characterize the process of runoff generation, the effects of spatial-temporal heterogeneity of precipitation and groundwater, the assessment of the uncertainty of the IHS result, the contribution of soil water in arid and semi-arid areas, and the verification and application of the new IHS methods are future trends in IHS research.

The study had some limitations caused by the method. First, documents published before 1986 were not included in our dataset. Second, it is possible that some IHS documents are not included in the WoS core database or that we did not retrieve some papers. A possible solution to overcome these limitations is the use of more sources and the retrieval of a larger number of IHS search terms. In general, however, this study has provided novel insights into the research characteristics and evolution of topics in the field of IHS. Moreover, the methods described in this study have great value for all research fields and thus have broad applicability.

4. Principal information in each subperiod.
SubperiodPublicationPercentage of Total Publications (%)KeywordsPercentage of Total Keywords (%)Keywords Per Year
1991–19994411.515811.117.6
2000–201417345.163244.542.1
2015–201916743.562944.3125.8