Journal topic
Hydrol. Earth Syst. Sci., 22, 6087–6108, 2018
https://doi.org/10.5194/hess-22-6087-2018
Hydrol. Earth Syst. Sci., 22, 6087–6108, 2018
https://doi.org/10.5194/hess-22-6087-2018

Cutting-edge case studies 28 Nov 2018

Cutting-edge case studies | 28 Nov 2018

# Understanding the water cycle over the upper Tarim Basin: retrospecting the estimated discharge bias to atmospheric variables and model structure

Understanding the water cycle over the upper Tarim Basin: retrospecting the estimated discharge bias to atmospheric variables and model structure
Xudong Zhou1,2, Jan Polcher2, Tao Yang1, Yukiko Hirabayashi3, and Trung Nguyen-Quang2 Xudong Zhou et al.
• 1State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Hohai University, 210098, Nanjing, China
• 2Laboratoire de Météorologie Dynamique du CNRS, IPSL, École Polytechnique, 91128, Paris, France
• 3Department of Civil Engineering, Shibaura Institute of Technology, Tokyo, Japan

Correspondence: Xudong Zhou (xudong.zhou@lmd.polytechnique.fr)

Abstract

The bias in atmospheric variables and that in model computation are two major causes of failures in discharge estimation. Attributing the bias in discharge estimation becomes difficult if the forcing bias cannot be evaluated and excluded in advance in places lacking qualified meteorological observations, especially in cold and mountainous areas (e.g., the upper Tarim Basin). In this study, we proposed an Organizing Carbon and Hydrology In Dynamic EcosystEms (ORCHIDEE)-Budyko framework which helps identify the bias range from the two sources (i.e., forcing and model structure) with a set of analytical approaches. The latest version of the land surface model ORCHIDEE was used to provide reliable discharge simulations based on the most improved forcing inputs. The Budyko approach was then introduced to attribute the discharge bias to two sources with prescribed assumptions. Results show that, as the forcing biases, the water inputs (rainfall, snowfall or glacier melt) are very likely underestimated for the Tarim headwater catchments (−43.2 % to 21.0 %). Meanwhile, the potential evapotranspiration is unrealistically high over the upper Yarkand and the upper Hotan River (1240.4 and 1153.7 mm yr−1, respectively). Determined by the model structure, the bias in actual evapotranspiration is possible but not the only contributor to the discharge underestimation (overestimated by up to 105.8 % for the upper Aksu River). Based on a simple scaling approach, we estimated the water consumption by human intervention ranging from 213.50×108 to 300.58×108 m3 yr−1 at the Alar gauge station, which is another bias source in the current version of ORCHIDEE. This study succeeded in retrospecting the bias from the discharge estimation to multiple bias sources of the atmospheric variables and the model structure. The framework provides a unique method for evaluating the regional water cycle and its biases with our current knowledge of observational uncertainties.

1 Introduction

A failure of discharge estimation can easily happen to a researcher especially when exploring a new region. It is often attributed to model inapplicability to the region, and tuning the model parameters is a common way to eliminate the discharge bias . However, a hidden assumption is often ignored that the atmospheric variables (or named here forcing) are essentially correct, while it may fail in some regions . Without knowing the bias in forcing, the calibration becomes meaningless if the model parameters are tuned to values that are far from their physical meaning . Thus, an important step before applying a model to a new region is to understand where the bias sources lie and their relative relations .

In situ measurements are considered most reliable sources for atmospheric variables and thus can be used to evaluate or further correct the variables used to drive hydrological models . However, larger uncertainties are still found in mountainous and arid areas due to the poor representativity of in situ observations . For instance, the precipitation (P) over the mountainous area is mostly underestimated due to rare observations at high altitude and orographic effects . A total of 20.2 % of the precipitation is underestimated over global orographically affected regions according to . Arid areas receive less water input but have larger relative uncertainties in precipitation , which are crucial to regional runoff (R) generation. Meanwhile, the energy flux over arid regions varies significantly. Thus the potential evapotranspiration (PET) and the actual evapotranspiration (ET) are quite uncertain over those areas ; in the meanwhile, the PET and ET are variables unable to directly measure for a basin. Investigation of their errors and relations based on the model simulation becomes necessary.

Figure 1The major rivers and the glacier distribution in the Tarim Basin. The upper Yarkand River catchment is defined by hydrological gauges 1 (JK) and 2 (KQ), the upper Hotan River catchment is defined by hydrological gauges 3 (TGZLK) and 4 (WLWT), and the upper Aksu River catchment is defined by hydrological gauges 5 (SLGLK) and 6 (XHL). The upper Tarim Basin is defined by hydrological gauge 7 (Alar).

Model efficiency needs to be verified firstly. The model performance is generally evaluated on the agreement of a single variable, and discharge is the most commonly used as it is the result of all the water–energy processes. It reveals the accuracy of the whole system while also accumulating all the errors from the forcing and the model. Therefore, a multivariable analysis based on the relation between variables is needed for overall evaluation . These relations represent typical climatic and regional characteristics; i.e., the aridity index (PET P) reveals the energy and water input over a specific region , and the ratio of evapotranspiration to precipitation (ET P) is relevant to the land cover conditions . The Budyko hypothesis is a widely accepted empirical relation between ET P and PET P (Budyko1974). The shape of the optimal Budyko curve reflects local climatic and underlying characteristics . Hence with a Budyko curve derived from land surface model simulations, biases of the water–energy components (P, ET or PET) can be assessed. For example, quantified the precipitation bias in orographically affected areas using the Budyko hypothesis, although their work attributed all the bias in discharge simulation to the forcing with an incorrect assumption that their model was perfect.

Most of the hydrological models, with either a lumped or a distributed concept, are dependent on the calibration. Because of the assumption that water input (P) is correct and a very crude description of energy processes, the ET is the variables which can be adjusted to meet the water mass balance. Most of the bias is therefore assumed to derive from the ET. It may mislead the relations between PET P and ET P, which represents the climatic and regional characteristics . Land surface models (LSMs) are almost independent of the calibration process, with most of the model parameters obtained from multiple maps (e.g., land cover, land use, soil textures). Based on their substantial physically based modules, the LSMs have been widely used to estimate most of the components in the continental water cycle . Although the LSMs do not necessarily provide good estimates of discharge , they prevent all the bias being revised by calibration. In return, the modeled discharge bias can reveal the biases related to model input or model structure, which have not attracted enough attention.

The Tarim Basin – located in the northwest of China, central Asia; surrounded by high mountains; and punctuated by oases in the center of deserts – is a region integrating the mountainous, arid and cold characteristics in different parts . The precipitation is mainly distributed over the upper mountainous area around the boundaries, and the snowmelt and glacier melt are major contributors to the local water resources . However, the meteorologic observations on the water input components are sparse, and the gauges are not representative because the surface conditions are heterogeneous especially in the mountainous area . In the lower oases, intensive irrigation is developed, which is hugely reliant on the discharge flowing from the headwater catchments , causing considerable changes in the natural river discharge downstream of the area . In reality, human intervention is very difficult to model as it is policy related and because of the lack of an efficient dataset. The anthropogenic effects on the water cycle, accompanying the climatic and topographic characteristics, make the Tarim one of the most challenging places to apply land surface models.

There are three major steps in this study, Firstly, we generated a best-possible forcing dataset for the Tarim domain which reduces as far as possible the biases using refined data sets. The refined forcing then drove an improved land surface model (Organizing Carbon and Hydrology In Dynamic EcosystEms: ORCHIDEE) to obtain the improved discharge estimations. Secondly, the estimated discharge was compared with in situ discharge observations (Sect. 4.1), and the evidence of their bias analyzed (Sect. 4.2). In the third step, the possible bias sources from the forcing and model structure were qualified with the Budyko hypothesis (Sect. 4.3), and their possibilities are discussed in Sect. 4.4. The model bias due to ignorance of human intervention is estimated based on the bias analysis over the headwater catchments in Sect. 4.5.

2 Study area and hydrometeorological characteristics

The Tarim Basin locates in the northwest of China, surrounded by the Kunlun Mountains in the south, the Tienshan Mountain in the north and the Pamirs Plateau in the west (Fig. 1). Its U-shaped terrain blocks the westerly atmospheric water vapor transport that leads to relatively low precipitation inside the basin . As simulated by , 63 % of the water vapor enters Tarim through the eastern passway, but it only happens in summer, contributing around 54 % of the total annual precipitation, leading to a strong seasonality in precipitation . Combined with the glacier melt during warm summer, 70 % of the annual discharge is concentrated in the period from June to October . The high seasonality implies a high risk of water deficit in dry seasons and endangers the ecosystem along the rivers , which requires human regulation to allow for efficient agriculture.

Despite the high interannual variability, the precipitation is heterogeneously distributed due to orographic effects . It ranges from 200 to 500 mm yr−1 in the mountainous area, while there is less than 50 mm in the central Tarim . The mountainous glacier melt/snowmelt occurs in the same place where precipitation is generated. The mountainous area contributes almost all the river runoff of the Tarim Basin, while the plains of Tarim contribute little to the water resource of the main Tarim . In the upper Tarim, only three water systems (the Yarkand, the Hotan and the Aksu rivers; see Fig. 1) have natural hydraulic connections to the mainstream – the Tarim . The water originating from the mountains flows through the oases where people live and allows intensive agriculture . As a consequence, a large proportion of the water is extracted for human utilization in the oases, so that only 8.7 %, 43.4 % and 30.6 % of the discharge from headwater regions of the Yarkand, Hotan and Aksu rivers can finally reach Tarim mainstream, respectively (Zhou et al.2000). The Kaxgar River is another major tributary of the Tarim, but it has already dried up before water could reach the mainstream due to natural evaporation/leakage and human intervention. Of all the water consumption, agriculture irrigation accounts for more than 95 % in the Tarim Basin ; hence the dominant human influence in the Tarim Basin is considered to be the irrigation influence.

There are 11 665 glaciers with a total area of 19 878 km2 and a volume of 2313 km3 distributed over the Tarim . The glacier melt is a critical contributor to the local water resource. According to and , the estimated glacier melt accounts for around 40 % of the total river runoff for the whole Tarim. However, due to the climate change, a large number of the glaciers were in retreat during the last 40 years (1960s–2001). In the upper Tarim, the Yarkand River has suffered the most significant glacier area changes (−205 km2) with a relative proportion of −6.1 %. The most significant retreat rate (−7.9 %) in glacier area occurs on the Pamirs Plateau in the west . All the changes in glaciers will result in the alteration in the river discharge and also the human interactions.

3 Data and models

## 3.1 Data and simulation description

### 3.1.1 River discharge observations

River discharge is a very reliable and integrated observation of the continental water cycle which is always used as a validation variable . Over the headwater catchments, there are 13 hydrological gauges recorded in the Hydrological Yearbooks of China issued by the Ministry of Water Resources, though only six gauges are selected with consideration of their locations and data completeness. Two gauges (1, 2) are in the upper Yarkand River, two (3, 4) are in the upper Hotan River, and two (5, 6) are in the upper Aksu River, while no gauge is found on the Kaxgar River. For the gauges in headwater catchments, the discharge is considered free of human intervention, i.e., irrigation or reservoir regulation, so to a large extent they represent the natural environment . This facilitates model validation. Moreover, on the mainstream of Tarim, one gauge (7, Alar) was selected at the junction of three upstream rivers (Fig. 1). Different from the headwater gauges, the river discharge at Alar has been significantly altered by human consumption after flowing through the irrigation area . Hence, the observations are no longer natural values but can be used to quantify the influence resulting from human activities.

### 3.1.2 Near-surface atmospheric conditions

Near-surface atmospheric conditions are crucial to hydrological responses . However, both the model simulation and gridded forcing generated from observations are proven to have large uncertainties where observations are sparse and heterogeneity is strong , i.e., in the arid and mountainous area, so that, in practice, several forcing datasets are always used in parallel to generate an ensemble of climate conditions which hopefully tracks the uncertainties . Alternatively, when possible, regional datasets which contain more information than global datasets are used to move the forcing closer to true values . In this study, several sets of estimated forcing inputs based on WATCH (Water and Global Change; ; ) are developed and then used to drive the land surface model (Fig. 2). The best simulations among them will be used to analyze the accompanying bias with its driving forcing.

A pair of reference forcing datasets are WFD (WATCH Forcing Data, 1958–2001; ) and WFDEI (WATCH Forcing Data methodology applied to ERA-Interim data, 1979–2012; ). They use the same methodologies but have slight differences in the basic data, processing and formatting . In brief, WFDEI is an evolution of WFD where the underlying re-analysis is now ERA-Interim but using the same bias correction methodology. This product has been proven to be superior to WFD. CRU (Climate Research Unit) monthly total precipitation observations were used to bias-correct the precipitation in WFD and WFDEI datasets. However, the WFD uses a previous version CRU TS 2.10 before the CRU TS 3.1 used in WFDEI was released . The two CRU datasets differ in the time period (CRU TS 2.10: 1901–2002; CRU TS 3.1: 1979–2009; ), in the stations used and in the methods employed; see for details. Hence, there are still some differences between the two which could further affect the hydrological responses. The two datasets are named WFD-CRU and WFDEI-CRU for later description, and the time step for all the forcing variables is 3 h.

The CRU datasets were constructed by monthly observations at meteorological stations across the world's continents. The observations were then interpolated to 0.5 longitude–latitude grid cells. Though the CRU compares favorably to some other gridded datasets, it has significant deviations over regions and time periods with sparser observational data . Moreover, because only the monthly total precipitation was used to correct WFDEI and WFD, it cannot improve the temporal variabilities at smaller time steps (i.e., daily or sub-daily). However, the precipitation variations in a short period would result in different hydrological responses even in the condition of the total monthly amount remaining the same .

Hence, in addition, the WFDEI-CRU dataset was further corrected by the gridded daily precipitation data from the China Meteorological Administration (named CMA). The CMA precipitation product compiles 2416 national meteorological monitoring stations over China, using the climatological optimal interpolation method to generate the gridded 0.5 precipitation field from 1951 to 2016 (). PRISM (Parameter-elevation Regression on Independent Sloped Methods; ) was used to lessen the orographic effects . The density of meteorological stations used in the CMA is much higher than that used for CRU. For instance, there are only six gauges over the Tarim Basin in the CRU database, while 39 gauges are recorded in the CMA system so that it can to a certain degree improve the data applicability where precipitation is spatially inhomogeneous compared to CRU datasets. Given that the CMA data provide the daily information, it also improves the temporal variations of precipitation rather than using the total monthly value in CRU datasets. The corrected atmospheric input dataset is hereafter referred to as WFDEI-CMA.

### 3.1.3 Glacier melt dataset

As mentioned in Sect. 2, glacier melt is a vital water input to the Tarim. However, glacier runoff measurement is so difficult for such large regions that model-based estimates of the glacier melt are necessary. In general, a glacier module is not coupled in LSMs . Hence, rather than building a separate glacier module, we use an independent daily glacier melt dataset obtained from the glacier model called HYOGA2 (meaning glacier in Japanese), which has been proven reliable over the globe . HYOGA2 is a temperature-index-based model utilizing an extensive global-scale glacier inventory and has several improvements compared to its first version (HYOGA) in model parameter simulation as well as the temporal extent. More details can be found in the original papers . The glacier melt is added to the rainfall series of WFDEI-CMA as an additional water flux to the system. The melt water hence participates instantaneously in the water cycle without delays such as stores in ice, glacier pack or groundwater recharge beneath the ice being considered. This method was chosen for its simplicity and the lack of knowledge on the details at the transition between the glacier and the soil. Daily values are uniformly distributed over the eight time steps per day of WFDEI. By adding the glacier melt, the fourth new forcing dataset is generated as WFDEI-CCG.

## 3.2 ORCHIDEE land surface model

The land surface model ORCHIDEE (Organizing Carbon and Hydrology In Dynamic EcosystEms) was developed by the Laboratoire de Météorologie Dynamique (IPSL-LMD) . After more than 20 years of development, ORCHIDEE has been validated from the global scale to typical regional cases, e.g., tropical rainforest area (Guimberteau et al.2012), semiarid regions (D'Orgeval et al.2008) and middle-latitude regions (Tallaksen and Stahl2014). Within ORCHIDEE, only SECHIBA (Schematisation des Echanges Hydriques I'Interface entre la Biosphere et I'atmosphere), which represents the energy and water fluxes between land surfaces and the atmosphere, is used in this study. The hydrological module in SECHIBA is based on developments by and . Thirteen types of vegetation are defined , and dynamic leaf area index is computed to generate the interception and transpiration. The vertical soil water movement is represented by diffusion-type equations resolved on a fine vertical discretization (11 levels) and partitioning between infiltration and surface runoff through a time-splitting procedure . The routing is conducted based on a linear reservoir concept through redefined routing units which are different from the atmospheric grids . The ORCHIDEE version used in this study is available at https://forge.ipsl.jussieu.fr/orchidee/ ().

### 3.2.1 Evapotranspiration simulation

On top of precipitation, evapotranspiration and potential evapotranspiration are two important fluxes, and their errors are key to the water cycle modeling. In ORCHIDEE, the evapotranspiration is calculated with energy balance and resistance concepts. The potential evapotranspiration is defined as “the amount of evapotranspiration that would occur if enough water was available at the surface”, as explained in . The PET is computed as the sum of the potential soil evaporation and the potential transpiration from vegetation. For soil evaporation, the diffusive equations are taken with the ratio of the humidity gradient, the aerodynamic resistance and the air density. The virtual surface temperature is used instead of the actual one to compute the saturate humidity, while the virtual surface temperature is calculated through an unstressed surface energy balance. The method has been proven superior to other diffusive methods in the reference paper . The potential transpiration is driven by the potential evaporation between the evaporating surface and the overlying air but is limited by vegetation resistances. The maximal water loss under stress-free conditions is the potential transpiration . The actual evapotranspiration is a function of the potential evaporation but is modeled by a series of resistances (canopy and aerodynamics) of the surface layer. The details of the methods in simulating PET and ET can be found in , and .

Figure 2The flowchart of the five experiments designed for driving the ORCHIDEE in this study. WFD and WFDEI are two basic forcing datasets. The underlined terms with numbers are the five experiments, while the grey arrows represent the development of the forcing compared to their previous ones.

### 3.2.2 Snow and soil freezing scheme

There is on key improvement which has been implemented in the current version of ORCHIDEE, that is, the snow and soil freezing scheme. Snow and soil freezing are two crucial water processes in cold regions; snow covers nearly half of land area , and the frozen soils occupy 55 % to 60 % of the land surface of the Northern Hemisphere in winter . Snow plays an important role in both the energy and water flux as the snow cover is first an insulation which prevents the heat loss from the soil. It also increases the thermal inertia of the surface by adding a new phase change and acts as a moisture reservoir which stores winter precipitation that is released in spring or early summer. In the old ORCHIDEE version, a constant density and very simple heat capacity are applied for the snow. The snowmelt directly feeds the runoff without refreezing, and the snow layer is mixed with the first soil layer so that they are equal in temperature. While in the new snow scheme, the snow layer is defined and separated from the soil layers. The snowpack is represented in three layers which adequately resolve the snow thermal gradients between the top and base of the snow cover. The energy balance and the temperature of the snow body become more realistic. Refreezing of the snowmelt is allowed, which makes the energy changes more reliable. The snow properties are more detailed than before, i.e., the density, albedo and roughness. All the improvements have been validated over France and northern Eurasia and already implemented in the current ORCHIDEE .

Soil freezing impedes water infiltration and drainage, thus leading to changes in hydrological responses . At small scales, the soil freezing alters the soil structure and therefore its water capacity, which has a consequence on the water flux between soil and atmosphere, as well as the water availability for plants . On the other hand, the frozen soil changes the latent heat exchange, which delays the soil temperature signal . Soil thermal characteristics are also improved due to the different thermal properties of ice and water. In the old soil thermal equations, thermal advection and phase change of the water are not considered when resolving the latent heat exchanges . The mechanical effects of soil freezing are therefore ignored. In the new soil freezing scheme, the apparent soil heat capacity can be increased by considering the ice content of the soil layer during a freezing-temperature window between 0 and −2C. A temperature correction is applied if any soil layer is entirely frozen or thawed. Moreover, the soil heat conductivity is changeable according to the ice content in the soil, which affects the thermal propagation in the vertical soil column. Finally, the hydraulic conductivity is reduced as a function of the ice content. Liquid water is not allowed to cross the frozen soil layers. Thus infiltration and drainage are forced to stop. Full descriptions of the new freezing soil equations and the parameterizations setting can be found in .

### 3.2.3 Human intervention

Irrigation is included in the current version of ORCHIDEE. The irrigation requirement is estimated as the deficit of the available water of the corresponding grid to the potential evapotranspiration. The irrigation extracts water from the local grid first and then its neighboring grids if necessary . This solution is acceptable for most of the humid regions at a 0.5 resolution since the rivers are very likely within 100 km. However, for dry regions, the Tarim for example, the irrigation area is concentrated, with controlled irrigation infrastructures. The nearby rivers are far from the irrigation area, and the irrigated water is not taken directly from the rivers but transported from upstream by channels. Due to the shortcoming of the scheme and the lack of knowledge on the local irrigation, we turned off the irrigation in ORCHIDEE. In the Tarim Basin, the irrigation accounts for more than 95 % of the consumed water , so that the difference between the simulated discharge and observations can be attributed to neglecting irrigation in the model.

In conclusion, as shown in Fig. 2, four different forcing inputs are prepared to drive the ORCHIDEE simulations, two basic forcing datasets (WFD-CRU and WFDEI-CRU), one after correction by CMA (WFDEI-CMA) and then one after adding glacier melt (WFDEI-CCG). Among them, the input water amounts are different, while WFDEI-CMA and WFDEI-CCG have the same other forcing variables as WFDEI-CRU. Additionally, the experiment with the newly developed snow and soil freezing scheme is named WFDEI-CCG-SF based on the forcing dataset WFDEI-CCG. Because the WFD covers the overlapping period 1901–2002, the WFDEI covers the period 1979–2014, CMA covers years 1951–2016 and the glacier melt dataset covers 1958–2001, all the simulations and analysis in this paper are over the overlap period 1979–2001. The monthly discharge measurements for those chosen hydrological gauges over 1979–2001 are then compiled. Spatial resolution for all the forcing inputs remains 0.5 (about 50 km at the Equator), and the time step is 3 h.

## 3.3 Main biases over the headwater catchments

In the water cycle, all the water entering a specific river basin will become ET back to the atmosphere and discharge (R) flowing out of the basin as long-term water storage changes are negligible. The underestimation of discharge is thus attributable either to the underestimation of water inputs or to overestimation of evapotranspiration. The possible biases from water inputs and the ET estimation are discussed in this section.

### 3.3.1 Bias in precipitation

The Tarim Basin is one of the areas where significant deviations exist among different modeled precipitation estimates and observation-derived datasets because of the sparse observations and orographic effects . Meanwhile, the precipitation bias might not be well addressed in the CMA system because the number of meteorological gauges is still too limited in the Tarim region to build a reliable interpolation climate field . Although have tried to use other gauges outside China, the density of the gauge distribution is extremely low around the boundaries of the Tarim Basin where most of the precipitation is generated. Furthermore, due to the orographic effects, the precipitation over the mountainous area is larger and with more significant heterogeneity than that in the plains , while the center of the precipitation events is hard to observe for the nearby gauges. has pointed out that the orography could cause 41.6 % underestimation of the precipitation over the northwestern North American mountainous ranges, and the deviations are larger at higher altitude.

### 3.3.2 Bias in rainfall and snowfall repartition

The differences between WFDEI-CRU and WFDEI-CMA are not only in the total amount of precipitation but also in the proportion of rainfall and snowfall (Table 3). Compared to the rainfall, the snowfall is more difficult to observe and affected by a large uncertainty; hence in the CMA dataset, only the rainfall was recorded and then used to scale the CRU dataset but keeping the relative proportion of liquid and solid precipitation provided by WFDEI-CRU. The energy needs for phrases change are considerably different for the liquid from the solid water. suggested snow will lead to more runoff than rain in similar conditions based on the observations over the US and China. The impact of the precipitation type on the evapotranspiration rate is affected by many factors and hard to measure.

Pike (1964)Fu (1981)

Table 1Different formulas of Budyko curves. Note that the aridity index is expressed as ∅= PET P.

### 3.3.3 Bias in glacier melt

HYOGA2 is a state-of-the-art global glacier model but has not been calibrated over the Tarim Basin. The general bias for global estimation is around −5.0 % compared to the available global glacier mass balance measurements . The estimated annual glacier melt amount is 81.0×108 m3 yr−1 for the whole Tarim Basin (Table 3), significantly lower than previous estimations (: 133.4×108 m3 yr−1; : 144.16×108 m3 yr−1). On the one hand, the difference in the forcing which drives the glacier melting model is probably one of the causes of the deviation. On the other hand, all the glacier melt is evenly distributed in a whole grid. It leads to a higher infiltration ratio and thus feeds more evaporation . This also artificially forces part of the glacier melt to flow out of the grid not belonging to the right basin. However, it is unable to eliminate this problem with the current gridded concepts. Finer spatial resolution in glacier dataset and model simulation is needed to lessen the impacts of discretization.

### 3.3.4 Bias in potential evapotranspiration estimation

As described in Sect. 3.2.1, the PET estimation is independent of underlying conditions (e.g., topography, vegetation) because enough water is provided. It is therefore determined only by forcing conditions, especially the humidity gradient and aerodynamic conditions (e.g., radiation flux, wind). Temperature also plays a role in its estimation. Thus, the bias in PET is mainly propagated from various forcing variables.

### 3.3.5 Bias in actual evapotranspiration estimation

Overestimation (underestimation) of the actual ET will also result in the discharge underestimation (overestimation). Many processes can cause ET errors either by the biases in PET or the stress functions which limit the potential evaporation. The vegetation fraction, vegetation type, surface slope and soil properties are all the uncertain sources affecting the final ET estimation.

### 3.3.6 Bias sources category

With the main biases listed as above, we consider bias in any processes that changes P or PET as bias from forcing and bias in any processes that directly changes ET as bias caused by model structure. Although the shifts in forcing variables will change the ET estimation – for example, the P restricts the available water for ET – this shift still belongs to the forcing category since the relation is indirect. The biases which directly affect ET include biases in infiltration, soil water movement, snow processes, vegetation representation and many other model processes. And all these are considered as biases caused by model structures.

## 3.4 Budyko hypothesis

Budyko hypothesis is an empirical expression for the coupling of the water and energy balances at the surface. It uses the relation between the water and energy balance equation to partition P into ET and R. The Budyko curve is the analytical solution to the Budyko hypothesis, expressed as the evapotranspiration rate (ET P) being a function of the aridity index (PET P) (Budyko1974). Many forms of Budyko curves have been developed and can be categorized into a non-parameter group and parameter group depending on whether there is an adjustable parameter describing the Budyko shape (Table 1).

Table 2The definition of climate types by precipitation and aridity index ().

The forms without parameters (formulas 1 to 4) are universal for most of the basins, while they are unable to capture the various landscape characteristics across regions . Regarding the effects of landscape characteristics, adjustable parameters and corresponding formulas were introduced as formula (5) to (8). Although they have different analytical expressions, the shape of these curves is quite similar and their parameters are highly correlated . Hence, from the formulas with parameters, Fu's equation (formula 6) is chosen in this study as it is more often used in the China region.

The ranges of the aridity index (PET P) correspond to the regional precipitation feeds and climate types (Table 2). For example, the precipitation for a semiarid region ranges from 400 to 800 mm yr−1, and the regional aridity index mostly ranges from 2 to 5. Moreover, the Budyko curve is a reflection of the landscape characteristics which can influence the water movement through different hydrological cycles (Dingman2015) and thus changes the ET rate. Many surface conditions are related to the Budyko parameter setting. (1) Vegetation types and vegetation cover affect the ET rate. Transpiration accounts for about 42 % (25 %–64 % depends on different models) of global ET . Regions that have a larger fraction of vegetation cover or are covered by vegetation with bigger leaves and deeper roots tend to have a larger transpiration rate as well as ET rate; i.e., the forested catchment tends to show a higher evaporation ratio than the grass-covered catchments . (2) Properties of soil determine infiltration rates and the amount of evapotranspirable water. Steeper slopes are more likely to shed surface water as runoff . Limit of infiltration ability also matters as intense precipitation rates or freezing soils tend to force the water into the surface runoff (the solid frozen soil limits the percolation of infiltrated water; ). (3) The ability to transmit or retain infiltrated water of soil also determines the evapotranspiration rate; the soil with larger water conductivity is likely to release more subsurface water rather than evaporation . (4) The soil depth determines the ability to store infiltrated water. Rocky mountains or regions with thin soil would produce more runoff and less ET .

## 3.5 Bias assessment with the ORCHIDEE-Budyko framework

The Budyko formulation relates the ET to PET and P. Both the biases from the water flux (P) and energy flux (PET) will propagate to ET. The shape parameter of the Budyko curve is obtained by fitting the PET P and ET P relation; it is thus a reflection of the model if we use the simulated PET and ET fluxes (i.e., ORCHIDEE simulations in this study, red dots and red curve in Fig. 3). However, because of the existing bias in all the three variables, the relations of the three components may have been shifted to an unrealistic state (point A in Fig. 3). Therefore, the changes in P, PET or ET which can shift the system back to a reliable state are considered as the possible bias. The difference between the unrealistic state with their corrected values provides the estimation of how the forcing or the model would need to be changed for the model to produce the realistic discharge values. To separate the individual effect of the single water–energy component on the hydrologic cycle, three independent assumptions are made as follows, and the illustration can be found in Fig. 3.

The red dots in Fig. 3 represent the states of the PET P and ET P according to ORCHIDEE estimations and forcing inputs for each year. Point A represents the representative state, which is the average of the dots' locations. It reflects the current model and is probably in an unrealistic state because the modeled discharge (PET) may be with bias compared to the observations.

Figure 3The illustration of the ORCHIDEE-Budyko framework. Point A represents the average state among the modeled annual values (with the land surface model ORCHIDEE), and the red curve is the simulated Budyko curve following the modeled state. Points B, C and D represent the representative state with the P, PET and ET shifted, respectively, with different assumptions to meet the discharge observations. A shifted Budyko curve (blue) is obtained crossing the point D, which indicates a new state of the model structure. The new points of B and C still stay on the original Budyko curve, indicating that the model structure remains the same and the changes only relate to forcing variables. The shaded area is the area among the three shifted states.

Assumption 1. Only the water input (P) is uncertain. Because the model structure remains unchanged, the relation between ET P and PET P still follows the original Budyko line regardless of how P changes. The PET is assumed to be independent of P, while the ET is modified as a result of P changes. To meet the deviation between simulated discharge (RS) and the observed discharge (RO), the representative point (long-term “corrected” annual ET P against long-term “corrected” annual PET P) should be shifted along the Budyko curve from current point “A” to the new point “B”, where the difference between the “true” precipitation (P) and the “true” evapotranspiration (ET) equals the observed discharge (${P}^{\prime }-{\mathrm{ET}}^{\prime }=$ RO). The possible maximum bias in P is calculated as

$\begin{array}{}\text{(1)}& \mathrm{bias}\left(P\right)=\left(P-{P}^{\prime }\right)/{P}^{\prime }×\mathrm{100}\phantom{\rule{0.125em}{0ex}}\mathit{%}.\end{array}$

Assumption 2. Only the PET is uncertain. The P remains the same, while ET changes because of the changes in PET. Under these conditions, the model structure still remains unchanged, and so does the Budyko curve. Then the representative point should be shifted along the Budyko curve to point “C” to decrease the ET ratio to meet the discharge observation (ET${}^{\prime }=P$-RO). The PET is changed to a “true” PET, and the possible maximum bias in PET is calculated as

$\begin{array}{}\text{(2)}& \mathrm{bias}\left(\mathrm{PET}\right)=\left(\mathrm{PET}-{\mathrm{PET}}^{\prime }\right)/{\mathrm{PET}}^{\prime }×\mathrm{100}\phantom{\rule{0.125em}{0ex}}\mathit{%}.\end{array}$

Assumption 3. Only the ET is uncertain. P and PET, which are mainly linked to the forcing, remain the same, while the ET, which is significantly affected by the model structure, is assumed biased. It is essentially relevant to the model processes rather than the forcing dataset. To compensate the discharge bias, the ET should be decreased to point “D” where the ET equals precipitation minus observed discharge (ET'$=P-$RO). The possible maximum bias in ET is calculated as

$\begin{array}{}\text{(3)}& \mathrm{bias}\left(\mathrm{ET}\right)=\left(\mathrm{ET}-{\mathrm{ET}}^{\prime }\right)/{\mathrm{ET}}^{\prime }×\mathrm{100}\phantom{\rule{0.125em}{0ex}}\mathit{%}.\end{array}$

With the target ET, a new Budyko curve can be drawn for new relations between P, PET and the new ET (the blue lines in Fig. 3). However, all the assumptions are proposed in conditions of only one variable being uncertain, but in reality any of the three variables can be biased at the same time. The final probable “corrected” state may be located in the shaded area identified by the three states (Fig. 3).

Figure 4The interannual cycle and intra-annual series of the precipitation (including rainfall, snowfall and glacier) in difference simulations for the upper Tarim Basin.

4 Results and discussion

## 4.1 Forcing and discharge comparison

### 4.1.1 Forcing input comparison among experiments

We specify P as the sum of all the water inputs into the system, which include the atmospheric water flux (in its liquid and solid phases) and glacier melt. The precipitation for the three headwater catchments and the upper Tarim is listed in Table 3 for each simulation. The interannual variations and the intra-annual cycle of total precipitation over upper Tarim Basin are plotted for different forcing in Fig. 4.

Table 3The five simulations in this study and basic diagnostics of the water inputs over three headwater catchments and the upper Tarim Basin. Units: 108 m3 yr−1.

The annual cycle of the two basic forcing datasets WFD-CRU and WFDEI-CRU are similar, while the precipitation in WFDEI-CRU is slightly larger in monthly values (Fig. 4a, red and blue lines). The deviation is the result of their expanding differences after 1990 (Fig. 4b). The precipitation difference is mainly due to the fact that two different versions of CRU (CRU TS 2.10 and CRU TS 3.1) are used for precipitation correction in WFD and WFDEI (Weedon et al.2014a). The precipitation difference between the two CRU datasets is relatively small, while the CMA dataset increases the precipitation to a large extent, by 37.3 %, compared to CRU for the upper Tarim Basin (Table 3). The changes mostly occur during summer when the peak of CMA precipitation is more than twice as large as precipitation in CRU. The shape of the annual cycle changes greatly as the timing of the peak is shifted from April to July. However, the precipitation amount in winter (DJF) decreases in CMA, to which the decrease in snowfall is the major contributor. The changes in rainfall and snowfall are similar in all three headwater catchments (Table 3) and the upper Tarim Basin.

Figure 5The temporal variations of the glacier melt, the proportion of glacier melt in the water input and the average summer temperature over the upper Tarim Basin. They are in good correlation and have a consistent increasing trend after 1990.

Figure 6The interannual cycle and intra-annual series of the discharge simulation for three headwater catchments and the upper Tarim Basin: (a, b) Yarkand River, (c, d) Hotan River, (e, f) Aksu River and (g, h) the upper Tarim Basin. Observed discharge for each subbasin was aggregated by the measurements at separated discharge gauges and shown as the black solid line. The simulated discharges at the corresponding grids were extracted from each experiment and plotted as the color lines with markers. The dotted line and dot-dash line in the interannual cycle plots represent the observed discharge in different periods.

Adding the glacier melt leads to negligible changes during winter and spring but large increases in the total water inputs in summer (JJA) when the temperature is higher. The estimated glacier melt is 9.1 %, 25.8 % and 6.1 % to the total water inputs for the upper Yarkand River, upper Hotan River and upper Aksu River. It significantly increases after 1990 (Fig. 5; the trend is $+\mathrm{3.8}×{\mathrm{10}}^{\mathrm{8}}$ m3 yr2, p=0.024), being consistent with its ratio to the total water input (r=0.918, p<0.001). The trend is mainly caused by climate warming as the glacier melt is highly correlated with the summer temperature (r=0.852, p<0.001). The increasing trend has also been documented in glacier runoff observations .

Table 4The water–energy components for the headwater catchments in WFDEI-CCG-SF simulation. Units: mm yr−1.

### 4.1.2 Assessment of the discharge estimations with observations

Evaluating the bias in precipitation over meteorological rain gauges is not convincing as most gauges are located at lower altitude, which makes it difficult to capture regional patterns as intensive precipitation occurs over higher mountains. Instead, the discharge measurement can serve as a better reference since it integrates the net water flux over the entire basin. Therefore, driven by the forcing, ORCHIDEE was used to simulate the river discharge and for comparison to in situ observations (Fig. 6). The corresponding assessment using criteria for the three headwater catchments are plotted in Fig. 7. For the three headwater catchments where most of the discharge of the Tarim Basin is generated, the discharge is significantly underestimated, with the underestimation ratio reaching 90 % (Figs. 6b, d, f, 7a) for CRU datasets. Discharge increases after the precipitation is corrected by the CMA dataset, with the absolute bias decreasing to around 80 %. Adding glacier melt also increases the discharge but by a relatively small amount. Changes in the model structure (new snow and soil freezing scheme) further decrease the bias, especially for the Aksu River. The final biases of the discharge for the three subbasins are −71.1 %, −47.8 % and −49.4 %. The gradual improvements and corresponding magnitude changes are visible in the annual discharge variability in Fig. 6b, d and f.

Table 5The annual average values for different water–energy components (P, ET, PET; units: mm yr−1) and their relations (PET, PET P and ET P) for the three upstream subbasins. The scenarios correspond to the diagnostics of the current model (A) and three bias assumptions listed above from B to D. The bold values are the main factors changed within the three basic water–energy components. The changing ratio indicates the ratio of the changing value to the original value (unit: %), while the bias range implies the bias of the values in the current variables compared to the values they should be (unit: %).

Table 6Summary of the possible causes of the underestimation in discharge and the corresponding arguments. Three levels of the possibility are presented: yes: with direct argument; likely: with indirect argument; and no: with negative argument.

Figure 7The discharge diagnostics of different experiments for three catchments (round – Yarkand; square – Hotan; and triangle – Aksu). (a) represents the absolute bias in percentage, (b) represents the correlation of the interannual cycle and (c) represents the ratio of the trend in estimated discharge to that in observed discharge for the period 1990–2001. S0 to S4 correspond to the experiments WFD-CRU, WFDEI-CRU, WFDEI-CMA, WFDEI-CCG and WFDEI-CCG-SF in ascending order.

Figure 8The water input components (rainfall, snowfall, glacier melt), evapotranspiration and discharge for three headwater catchments (from top to bottom: a, Yarkand; c, Hota; and e, Aksu) in the WFDEI-CCG-SF simulation. In the left panels, the amounts of different variables are plotted as bars, while the average mean of the observed discharge (RO) is plotted as the red line. RS denotes the simulated discharge by ORCHIDEE. In the right panels, the annual cycle of the water inputs (blue line), evapotranspiration (green shadow), estimated discharge (runoff plus drainage, solid black line), observed discharge (dashed black line) and changes in terrestrial water storage (TWS, slashed area) are plotted as b, d and f. The green slashed area represents decreasing in TWS, and the white slashed area represents increasing in TWS.

Figure 9Budyko relation for three headwater catchments (a, Yarkand; b, Hotan; c Aksu). The red points represent the values for each year; the P is obtained from the forcing, while ET and PET are obtained from the model. Point A represents the long-term average Budyko relation, and red lines are the optimal fitted Budyko line. The water input P, the potential evapotranspiration PET or the actual evapotranspiration ET can be modified to meet the observed discharge, which correspondingly shifts the representative points from A to B, C or D. B and C stay in the original Budyko curve, while a new optimal fitted Budyko curve through point D can be built after the changes in ET. The shaded area is the most likely area when not only a single variable is changing.

Besides the increase in the annual mean discharge, the amplitude of the interannual cycle of the discharge is also improved by the progressive changes. The estimated discharge peaks have been shifted from April in CRU simulations to the summer (July or August) by CMA correction and adding glaciers (Fig. 6a, c, e). Correspondingly, the correlation of the annual variability between the estimated and observed discharge has increased above 0.9 for all the three subbasins with the WFDEI-CCG forcing (Fig. 7b). However, contrary to the upper Yarkand River and the upper Hotan River, the introduction of the new snow and soil freezing scheme decreases the discharge correlation for the upper Aksu River from 0.91 to 0.42. An early discharge peak exists in May, while not enough runoff is generated in the summer period (Fig. 6e). Although the correlation decreases, it does not mean the model/simulation deteriorates because correlation only evaluates the similarity of temporal variation but ignores the fact that the discharge amount has been better estimated (Figs. 6e and 7a).

By extracting the observed discharge in the first and last 5 years from the whole period, we can notice there is an obvious shift of the discharge peak from August to July in the three headwater catchments (Fig. 6a, c, e). The regional precipitation changes largely cause the shift, but the increasing temperature also allows the snow/glacier to melt at a higher rate in the most recent period. Furthermore, increasing trends are detected after the 1990s (Fig. 6b, d, f), as the increasing trend is 1.43×108 m3 yr2 (or 0.77×108 m3 yr2, 1.78×108 m3 yr2) for the upper Yarkand (or Hotan, Aksu) River. The increasing trends are consistent with the glacier melt, glacier proportion in water input and summer temperature in the same period (Fig. 5).

The trends in the estimated discharge are also calculated and compared with the observed trends, expressed as the ratio of the trend in estimation to that in observations (Fig. 7c). For CRU simulations, no increasing trend is detected since the ratio is less than 0. The CMA correction increases the ratio for all three subbasins to around 0.3, which means the precipitation accounts for only around 30 % of the discharge increase. Adding the glacier melt increases the ratio from 0.35 to 0.54 for the upper Hotan River and from 0.31 to 0.76 for the upper Aksu River; the improvement of the glacier melting is comparable to the CMA correction. However, no apparent changes are detected for the upper Yarkand River. By comparing the criteria between WFDEI-CMA and WFDEI-CCG, we find that, although adding the glacier melt does not change much the absolute amount of discharge or the correlation, the increased trend in discharge has been considerably improved. The increasing glacier melt is, therefore, one of the contributors to the discharge trend in the Tarim. The modification of the snow and soil freezing scheme increases the trend ratio in the upper Aksu River up to 0.72, while slightly decreasing it for the two other catchments.

In brief, the gradual refinement of the forcing datasets (from WFD-CRU to WFDEI-CRU to WFDEI-CMA to WFDEI-CCG) is effective for improving the model performance using different criteria (bias, correlation, proportion to the trend) to compare the observed discharge. The three criteria are independent as they stand for the averages, the variation and the trend, which can capture the various aspects of the model agreement to the observations. The responses are similar for different catchments, but at different magnitude at different stages. The correction of the CMA dataset is the most significant improvement to all the criteria. The role of glaciers melting is critical for the trend analysis. The modification in the snow and soil freezing scheme increases the total discharge amount but could lead to adverse responses in the correlation and trend simulation. However, the impact of the modification of the model structure is not larger than changes resulting from the forcing biases. From the previous analysis, we can conclude that the simulations of WFDEI-CCG and WFDEI-CCG-SF are comparable in the correlation and trend analysis, while WFDEI-CCG-SF is better regarding the water quantity. Therefore, the further study on the bias is all based on the WFDEI-CCG-SF simulation.

## 4.2 Evidence of the bias in estimated discharge for the headwater catchments

Although the simulations with WFDEI-CCG-SF are better than other experiments, there are still biases compared to the observations (Fig. 7a). In this section, we aim to find evidence of the biases. The annual mean water balance components (rainfall, snowfall, glacier melt, estimated ET and discharge RS) of the three upper catchments are plotted as bars with their relations quantified by comparison to the discharge observations (RO, red lines in Fig. 8a, c, e) in WFDEI-CCG-SF. Their annual cycles are also plotted as Fig. 8b, d and f over the three headwater catchments. Over a long enough period, the changes in terrestrial water storage are assumed negligible compared to the water fluxes, so that the water input into the system either returns to the atmosphere through ET or flows out of the basin as RS.

From the left panels of the Fig. 8, we have a visual impression of the relative amount of different water inputs and their contribution to the ET or discharge. Note that the sum of the ET and RS is not exactly equal to P because in ORCHIDEE the discharge is represented at the outflow of the grid and not at the confluence point of the analyzed catchment with other tributaries. The largest bias is 8 % for the upper Yarkand River (Fig. 8a, ET P+ RS$/P=\mathrm{0.92}$), while it matches exactly for the upper Aksu River. The bias can be added to the current RS if necessary.

Among the three headwater catchments, the upper Hotan River has the best discharge simulation compared to the observations (RS  RO =0.52). The annual cycle of the water also matches well as all the P, ET and discharge RO or RS have the synchronous peaks in summer (Fig. 8d). There are also deviations between P and E, which represent the net water inputs to the system and the estimated discharge (the shaded area with blue lines in Fig. 8d). The deviation implies the regional water storage changes; in summer the soil moisture increases to store the abundant water inputs, which are later released in autumn and winter when the drainage rate is larger than infiltration. The water storage decreases as a result by then. It is the natural adjustment to the strong seasonality in water inputs.

As the neighboring catchment of the upper Hotan River, the upper Yarkand River has similar phases of estimated flux ratios (ET  PET, ET P, RS P) and interannual variations. However, the estimated discharge rate is smaller (RS  RO =0.29) than that of the upper Hotan River. Underestimation in water inputs in summer and autumn is possibly the reason as there is no obvious water storage gaining in the summer period and the ratio of observed discharge to regional precipitation is unrealistically high (>0.9, Fig. 8b).

The upper Aksu River has different characteristics from the other two regions since it lies in the northern part of the Tarim. It has a larger snowfall proportion in the precipitation. Meanwhile, it has the largest ratio of estimated ET  PET, the largest runoff generation ratio (RS$/P=\mathrm{0.35}$) and the least discharge simulation error (RS  RO = 0.51). However, there is certainly large bias in the regional precipitation as the discharge has exceeded the precipitation input in the summer period (July and August). The estimated annual cycle of discharge diverges from the observations (Fig. 8f), as its peak advances by 2 months and the discharge estimation significantly exceeds the observations in spring (MAM). The runoff generation ratio in the summer period is also unrealistically low.

In summary, the biases in discharge estimation exist in terms of the total amount and the annual cycle. Precipitation is one of the largest bias sources, which makes the bias analysis in models more difficult.

## 4.3 Bias range and possibility analysis for the headwater catchments

Although either P or PET or ET can cause final underestimation of discharge over the three headwater catchments, quantifying the bias in each flux is challenging and impractical due to the lack of direct measurements and the strong heterogeneity over the mountainous area. To separate the individual bias, we use the Budyko hypothesis by assuming only one variable is uncertain, while the other two are assumed to have negligible errors, with which we “correct” the model simulation to meet the discharge observation and obtain the possible bias range. Then we evaluate the possibility of rejecting the assumption to find the most likely bias source by checking the status of the water system (i.e., amount of the water–energy components and their relations) in indirect ways. The water–energy components used in the Budyko analysis are all ORCHIDEE outputs of the most satisfactory simulation, WFDEI-CCG-SF. The corresponding characteristics of the water–energy components for three headwater catchments are listed in Table 4.

### 4.3.1 Bias ranges estimated by the ORCHIDEE-Budyko framework

The ORCHIDEE-estimated evapotranspiration rate (ET P) against the estimated aridity index (PET P) over each subbasin in each year is scattered as red points in Fig. 9. Point A represents the Budyko relation between long-term average annual ET P and the long-term average annual PET P. According to the categories introduced by , all three catchments belong to semiarid climate zones by the definition of the annual average precipitation (Table 2). Hence the aridity index is supposed to range from 2 to 5. Regarding the high elevation and cold temperature, the PET rate is likely to be smaller than the representative climate of this aridity index. Thus the aridity index for the three catchments is supposed to be lower than expected. It is realistic for the upper Hotan River and the upper Aksu River as their aridity is 3.63 and 1.73, respectively. While the aridity index for the upper Yarkand River is 5.02, which can be categorized as a semiarid or arid region, this is not very likely since the upper Yarkand River is providing water resources for the irrigated area over the lower Yarkand oases .

However, because there is still a bias in the ORCHIDEE discharge estimations with the observations, the current state A is not correct. Based on the assumptions introduced in Sect. 3.5, the three possible “corrected” states by shifting the P, PET and ET are shown in Fig. 9 for the three headwater catchments. Taking the upper Yarkand River as an example (Fig. 9a), if we consider only P to be biased (assumption 1), the P has to be shifted from 247.3 to 435.4 mm yr−1. PET remains the same, and the ET changes accordingly, but the relation between ET P and PET P still follows the Budyko curve. The state is shifted from point A to point B with the P change ratio as 76.1 %. Reversely, the possible bias of P is −43.2 % ((247.3–435.4)$/\mathrm{435.4}×\mathrm{100}$ %). Similarly, to change the PET in order to shift the state to correct (point C, assumption 2), the PET has to be shifted from 1240.4 to 225.0 mm yr−1. The possible bias in PET is 451.2 %. To change the ET in order to shift the state to correct (point D, assumption 3), the ET has to be shifted from 188.3 to 106.9 mm yr−1, and the possible bias in ET is 76.1 %. The route is the same for the other two catchments, and their results are listed in Table 5.

The previous analysis is based on the assumptions that the P and PET are independent and only a single variable is uncertain, which might be invalid in reality. However, the three assumptions provide the bias boundaries of each variable, and the final system reproducing observed RO is likely to be located within the shaded area shown in Fig. 9. Taking the Hotan River as an example, to meet the discharge observation, the final changes in P will be 0 %–26.6 %, the decrease in PET will be 0 %–46.7 % and the decrease in ET will range 0 %–16.1 %. While in turn, we also conclude that the P is underestimated by 0 %–21.0 %, the PET is overestimated by 87.5 % at most and the ET is overestimated by 19.1 % at most. If we know the bias for any single variable, the feasible ranges will be narrower than at present.

### 4.3.2 Ranking the bias possibility

Although the Budyko approach provides us with possible ranges for the bias of each variable, it is still difficult to determine the bias source without proper bias measurements for each of the forcing variables. We alternatively compare the regional diagnostics with nearby regions or regions with similar climatic and regional characteristics which have qualified observations in an indirect way. In doing so, we can generally rank the occurrence possibility of an uncertain variable.

From multiple-model analysis based on CMIP5 general circulation models (GCMs), the estimated PET over the western boundary of the Tarim Basin is about the same as over the Tibetan Plateau . However, because of the unpredictable biases in GCMs, the absolute values are not highly reliable in their simulation. Nevertheless, the equivalent relation provides us with the ranges of PET over the Tarim headwater catchments by referring to the observations over the Tibetan Plateau, where the topography changes are relatively small and the observations are more abundant. According to , who used site observations from 101 stations over the Tibetan Plateau, the annual average PET over the plateau ranges from 580 to 720 mm yr−1. Hence the PET over the upper Yarkand River and the upper Hotan River is probably overestimated (1240.4 and 1153.7 mm yr−1). Therefore, only changes in P or ET are not satisfactory because the PET is unchanged. For the upper Yarkand River, the PET is not the only error source, because, to match the discharge deviation by only decreasing PET, the PET should be decreased to 225 mm yr−1, which exceeds the referenced PET range. Moreover, because Yarkand and Hotan are neighboring regions which have similar climates, the PET should be a similar amount in each (615.2 mm yr−1 if only PET is uncertain in the upper Hotan River, while 225 mm yr−1 in the Yarkand River). The estimated PET over the Aksu River is realistic since the PET is 631.8 mm yr−1 for the current scenario, but it would decrease to 174.1 mm yr−1 if only the PET changed, which is too low. Besides the absolute value of PET, the ratio PET P also shifts when PET changes, which means the climatic types can be changed. Only decreasing the PET in the upper Yarkand and the upper Aksu River would cause significant decreases in the aridity index (from 5.02 to 0.91 for the upper Yarkand, from 1.73 to 0.48 for the upper Aksu River), which are not realistic for these regions.

ET computation is sensitive to the climatic conditions and the surface conditions; hence the absolute value of ET significantly varies in time and space, and its bias very difficult to quantify. The evapotranspiration ratio to precipitation (ET P) is typical for specific climatic types or regions with similar land cover types . estimated the evapotranspiration ratio to precipitation using a remote-sensing approach over regions of Canada. They concluded that the ratio ET P is 32 % for barren land and 18 % for snow/ice land. In general, most of the catchment area of the three headwater catchments consists of barren and snow/ice land. Because of its lower latitude, the ET ratio could be higher but still below the rate for cropland (67 %). Therefore, only changes in P are not very likely for Yarkand and Hotan (ET P is 0.68 for Yarkand and 0.70 for Hotan after the correction). Higher P for the upper Aksu River is likely to maintain a realistic ET P ratio.

The biases of the three variables (P, ET, PET) have relatively weak dependence because they are governed by different processes. P and PET are quite independent because they relate to different forcing variables. Although the ET amount is linked to two other variables, the ET bias is weakly dependent, and it also comes more from the surface conditions and model biases. The chances of biases arising from each variable are about the same in theory. However, based on the analysis of the model output and the assumed bias-corrected scenarios, there are some priorities for the bias sources over each subbasin. The possibility of biases and the supporting arguments are listed in Table 6 for each headwater catchment. For instance, for the upper Yarkand subbasin, an increase in P, especially the glacier melt, is necessary because of the lower glacier melt ratio compared to the Hotan basin (Sect. 5.1.2.3) and the small trend in model discharge compared to the discharge observation (Sect. 4.2). However, only increasing the precipitation is not sufficient because the current PET is very likely too high compared to the surrounding regions (PET = 1240.4 mm yr−1, while PET ranges 580–720 mm yr−1 over the Tibetan Plateau). Meanwhile, only decreasing the PET without changing other variables would cause a very low PET rate (225 mm yr−1) and low aridity index (PET$/P=$ 0.91), which is not realistic for this region. Modification in ET is possible but not adequate due to the overestimated PET. Hence, this error analysis reveals that increasing precipitation over the upper Yarkand subbasin is quite necessary, the overestimation of PET is very likely and modification in ET estimation is possible but not fundamentally necessary. For the upper Hotan River, the most likely biases come from the overestimation of PET, while the two other variables are possible sources. An increase in precipitation and changes in temporal variability are necessary for the upper Aksu subbasin, but they are not the only causes, as either the PET or ET, or both, is overestimated.

## 4.4 Human intervention in the lower oases

The current ORCHIDEE version does not yet take into account the intensified evapotranspiration caused by human activities especially through irrigation, which is a major process in the hydrological cycle transferring water to the atmosphere. As a consequence, in the lower oases the simulated discharge at the Alar station is significantly larger than the observations (Fig. 6g, h; 146.59×108 m3 yr−1 in WFDEI-CCG-SF simulation to 43.34×108 m3 yr−1 in observation). However, because the biases of the upstream discharge will propagate to the Alar station, the catchment of which includes the three basins discussed above, the currently estimated discharge is thus underestimated compared to the potential river flow at Alar, which is the natural river flow without human intervention.

We use two simple approaches to estimate this underestimation. In the first one, according to the work of , all the water increment of the Alar gauge station is caused by the water changes from the three headwater catchments. Hence the underestimation of the discharge to the potential river flow at Alar equals the underestimation of discharge from those three catchments. The increment at Alar should be 110.25×108 m3 yr−1 (observed 192.15×108 m3 yr−1 from the three headwater catchments minus simulated 81.9×108 m3 yr−1), so that the potential river flow at Alar should be 256.84×108 m3 yr−1 ($=\mathrm{146.59}+\mathrm{110.25}$), and the influence of human activities on the increase of ET can be estimated as 213.50×108 m3 yr−1 ($=\mathrm{256.84}-\mathrm{43.34}$), 83.1 % of discharge.

A second simple scaling approach to obtaining the potential river flow at Alar is that we assume the model bias for the whole upper Tarim Basin is constant over space. The scaling factor is 2.35 ($=\mathrm{192.15}/\mathrm{81.9}$); hence the potential discharge at Alar should be 343.92×108 m3 yr−1 (=146.59×2.35). And the influence of human activities on the increase of ET is estimated as 300.58×108 m3 yr−1($=\mathrm{343.92}-\mathrm{43.34}$), 87.4 % of the discharge. The overestimation over the discharge observation is the amount caused by additional human intervention, especially the irrigation-caused evapotranspiration.

To validate the proposed values, we collected the irrigation area over the Tarim Basin using the FAO Global Map of Irrigation Areas , according to which the total irrigated area for the upper Tarim Basin is 13 548.5 km2. In addition, provides the gross irrigation quota as 1.77×106 m3 km−2; hence the total irrigation water consumption will be 239.80×108 m3 yr−1. Hence the results of the two approaches assessing human net abstraction are −11 % and 25 % (213.50×108 m3 yr−1 and 300.58×108 m3 yr−1, respectively) in relation to the consumption data, which are acceptable figures because of the unknown biases in irrigation area as well as the gross irrigation quota. The proportion of the consumed water (83.1 %–87.4 %) is higher than the estimation (74.7 %) in 1995, which could be explained by the intra-annual variation of inflow and abstraction.

5 Conclusions

In this work, we proposed an ORCHIDEE-Budyko framework which is used to attribute the modeled discharge bias to different sources as the forcing and model structure. Bias in the precipitation (P) and any processes related to the potential evapotranspiration (PET) is considered as bias from forcing and bias in any processes affecting the actual evapotranspiration (ET) estimation is considered as bias from model structures. The discharge simulation was provided by the land surface model ORCHIDEE with latest developments in its modules and driven by the most improved forcing inputs (WFDEI-CCG-SF). However, underestimation in the discharge still exists over the three Tarim headwater catchments, where the biases of P, PET and ET are analyzed with a Budyko analytical approach. With a set of assumptions, we isolated the biases in three variables, and their possibilities were assessed with information from nearby and hydroclimatically similar regions. Results show that precipitation (here considered as the sum of rainfall, snowfall and glacier melt) underestimation is highly likely for the upper Yarkand River and the upper Aksu River, while the overestimation of PET is likely to affect the upper Yarkand River and the upper Hotan River. The overestimation in ET is possible but not likely the only cause for the discharge underestimation for all headwater catchments. In the lower oases, humans consume 83.1 %–87.4 % of the discharge for irrigation, which is also a bias source in the current version of model. Thus, inclusion of detailed human modules is needed for any large-scale model.

In this attempt to analyze the performance of a complex land surface model over the Tarim Basin, large biases are found in the discharge estimation. Our finding that the bias is most likely caused by the forcing variables rather than the model is probably the reason for the failures of other models in specific regions as well. Our work provides more information about the Tarim Basin's water cycle and guidance for future studies that the bias in forcing variables should firstly be assessed and reduced in order to perform pertinent analysis of the regional water cycle. Land surface models are a recommended tool for water cycle analysis because of their independence of calibration and good ability to simulate most variables of the water cycle and their interplay, which facilitates the identification of bias sources. This kind of application along with the improvements of forcing data is also important for predicting water resources in the Tarim as well as other high-altitude basins in central Asia in a changing climate.

Code and data availability
Code and data availability.
Author contributions
Author contributions.

JP initialized the ideas presented in this paper with XZ and TY. XZ prepared the simulations, the figures and the manuscript. TY and YH participated in the data preparation. TNQ participated in the model development. All authors contributed to the discussion and revising the paper.

Competing interests
Competing interests.

The authors declare that they have no conflict of interest.

Acknowledgements
Acknowledgements.

This study was supported by the National Natural Science Foundation of China (grant nos. 41561134016 and 51879068); the CHINA-TREND-STREAM French national project (ANR grant no. ANR-15-CE01-00L1-0L); the National Key Research and Development Program (2018YFC0407900); the Global Environmental Research Fund (S-14) of the Ministry of the Environment, Japan; and the China Scholarship Council (CSC, 201506710042). The work was supported by computing resource of the IPSL ClimServ cluster at École Polytechnique, France. We also thank the editor and the anonymous reviewers for their contribution to the improvement of this paper.

Edited by: Zhongbo Yu
Reviewed by: three anonymous referees

References

Adam, J. C., Clark, E. A., Lettenmaier, D. P., and Wood, E. F.: Correction of global precipitation products for orographic effects, J. Climate, 19, 15–38, https://doi.org/10.1175/JCLI3604.1, 2006. a, b, c, d, e, f

Alkama, R., Kageyama, M., and Ramstein, G.: Relative contributions of climate change, stomatal closure, and leaf area index changes to 20th and 21st century runoff change: A modelling approach using the Organizing Carbon and Hydrology in Dynamic Ecosystems (ORCHIDEE) land surface model, J. Geophys. Res.-Atmos., 115, D17112, https://doi.org/10.1029/2009JD013408, 2010. a

Barella-Ortiz, A., Polcher, J., Tuzet, A., and Laval, K.: Potential evaporation estimation through an unstressed surface-energy balance and its sensitivity to climate change, Hydrol. Earth Syst. Sci., 17, 4625–4639, https://doi.org/10.5194/hess-17-4625-2013, 2013. a, b, c

Barry, R. G. and Chorley, R. J.: Atmosphere, Weather and Climate, CUP Archive, https://doi.org/10.4324/9780203871027, 2009. a

Berger, K. P. and Entekhabi, D.: Basin hydrologic response relations to distributed physiographic descriptors and climate, J. Hydrol., 247, 169–182, https://doi.org/10.1016/S0022-1694(01)00383-3, 2001. a, b

Berghuijs, W. R., Woods, R. A., and Hrachowitz, M.: A precipitation shift from snow towards rain leads to a decrease in streamflow-supplement, Nat. Clim. Change, 4, 583–586, https://doi.org/10.1038/NCLIMATE2246, 2014. a

Boike, J., Roth, K., and Overduin, P.: Thermal and hydrologic dynamics of the active layer at a continuous permafrost site (Taymyr Peninsula , Siberia), Water Resour. Res., 34, 355–363, https://doi.org/10.1029/97WR03498, 1998. a

Budyko, M. I.: Climate and Life, Academic Press, New York, https://doi.org/10.1016/0033-5894(67)90014-2, 1974. a, b, c

Carmona, A. M., Sivapalan, M., Yaeger, M. A., and Poveda, G.: Regional patterns of interannual variability of catchment water balances across the continental U.S.: A Budyko framework, Water Resour. Res., 50, 9177–9193, https://doi.org/10.1002/2014WR016013, 2014. a

Chen, S., Liu, Y., and Axel, T.: Climatic change on the Tibetan Plateau: Potential evapotranspiration trends from 1961–2000, Climatic Change, 76, 291–319, https://doi.org/10.1007/s10584-006-9080-z, 2006. a

Chen, Y., Li, W., Xu, C., and Hao, X.: Effects of climate change on water resources in Tarim River Basin, Northwest China, J. Environ. Sci., 19, 488–493, https://doi.org/10.1016/S1001-0742(07)60082-5, 2007. a

Choudhury, B.: Evaluation of an empirical equation for annual evaporation using field observations and results from a biophysical model, J. Hydrol., 216, 99–110, 1999. a

Cui, T., Yang, T., Xu, C. Y., Shao, Q., Wang, X., and Li, Z.: Assessment of the impact of climate change on flow regime at multiple temporal scales and potential ecological implications in an alpine river, Stoch. Env. Res. Risk A., 32, 1849–1866, https://doi.org/10.1007/s00477-017-1475-z, 2018. a

Daly, C., Halbleib, M., Smith, J. I., Gibson, W. P., Doggett, M. K., Taylor, G. H., and Pasteris, P. P.: Physiographically sensitive mapping of climatological temperature and precipitation across the conterminous United States, Int. J. Climatol., 28, 2031–2064, https://doi.org/10.1002/joc.1688, 2008. a, b

de Rosnay, P. and Polcher, J.: Modelling root water uptake in a complex land surface scheme coupled to a GCM, Hydrol. Earth Syst. Sci., 2, 239–255, https://doi.org/10.5194/hess-2-239-1998, 1998. a

de Rosnay, P., Polcher, J., Bruen, M., and Laval, K.: Impact of a physically-based soil water flow and soil-plant interaction representation for modeling large-scale land surface processes, J. Geophys. Res., 107, ACL 3-1–ACL 3-19, https://doi.org/10.1029/2001JD000634, 2002. a

de Rosnay, P., Polcher, J., Laval, K., and Sabre, M.: Integrated parameterization of irrigation in the land surface model ORCHIDEE. Validation over Indian Peninsula, Geophys. Res. Lett., 30, 1–4, https://doi.org/10.1029/2003GL018024, 2003. a

Dingman, S. L.: Physical hydrology, 3rd ed., Waveland Press, Long Grove, IL, 2015. a, b

Döll, P., Fiedler, K., and Zhang, J.: Global-scale analysis of river flow alterations due to water withdrawals and reservoirs, Hydrol. Earth Syst. Sci., 13, 2413–2432, https://doi.org/10.5194/hess-13-2413-2009, 2009. a

D'Orgeval, T.: Impact du changement climatique sur le cycle de l'eau en Afrique de l'Ouest: modélisation et incertitudes, PhD thesis, Paris, 6, 2006. a, b

D'Orgeval, T. and Polcher, J.: Impacts of precipitation events and land-use changes on West African river discharges during the years 1951–2000, Clim. Dynam., 31, 249–262, https://doi.org/10.1007/s00382-007-0350-x, 2008. a, b, c

d'Orgeval, T., Polcher, J., and de Rosnay, P.: Sensitivity of the West African hydrological cycle in ORCHIDEE to infiltration processes, Hydrol. Earth Syst. Sci., 12, 1387–1401, https://doi.org/10.5194/hess-12-1387-2008, 2008. a, b

Ducoudré, N. I., Laval, K., and Perrier, A.: SECHIBA, a New Set of Parameterizations of the Hydrologic Exchanges at the Land-Atmosphere Interface within the LMD Atmospheric General Circulation Model, 1993. a

Federer, C. A., Vörösmarty, C., and Fekete, B.: Intercomparison of methods for calculating potential evaporation in regional and global water balance models, Water Resour. Res., 32, 2315–2321, https://doi.org/10.1029/96WR00801, 1996. a

Fekete, B. M., Vörösmarty, C. J., Roads, J. O., and Willmott, C. J.: Uncertainties in precipitation and their impacts on runoff estimates, J. Climate, 17, 294–304, https://doi.org/10.1175/1520-0442(2004)017<0294:UIPATI>2.0.CO;2, 2004. a, b, c, d

Fraedrich, K., Jansen, H., Kirk, E., Luksch, U., and Lunkeit, F.: The Planet Simulator: Towards a user friendly model, Meteorol. Z., 14, 299–304, https://doi.org/10.1127/0941-2948/2005/0043, 2005. a

Fu, B.: More on the calculation of average total evaporation, Sci. Atmos. Sin, 5, 23–31, 1981 (in Chinese). a

Gao, X., Ye, B. S., Zhang, S. Q., Qiao, C. J., and Zhang, X. W.: Glacier runoff variation and its influence on river runoff during 1961-2006 in the Tarim River Basin, China, Sci. China Earth Sci., 53, 880–891, https://doi.org/10.1007/s11430-010-0073-4, 2010. a, b

Gerrits, A. M. J., Savenije, H. H. G., Veling, E. J. M., and Pfister, L.: Analytical derivation of the Budyko curve based on rainfall characteristics and a simple evaporation model, Water Resour. Res., 45, 1–15, https://doi.org/10.1029/2008WR007308, 2009. a

Giorgi, F. and Francisco, R.: Evaluating uncertainties in the predicition of regional climate change, Geophys. Res. Lett., 27, 1295–1298, https://doi.org/10.1029/1999GL011016, 2000. a

Gouttevin, I., Krinner, G., Ciais, P., Polcher, J., and Legout, C.: Multi-scale validation of a new soil freezing scheme for a land-surface model with physically-based hydrology, The Cryosphere, 6, 407–430, https://doi.org/10.5194/tc-6-407-2012, 2012. a, b, c

Guimberteau, M., Drapeau, G., Ronchail, J., Sultan, B., Polcher, J., Martinez, J.-M., Prigent, C., Guyot, J.-L., Cochonneau, G., Espinoza, J. C., Filizola, N., Fraizy, P., Lavado, W., De Oliveira, E., Pombosa, R., Noriega, L., and Vauchel, P.: Discharge simulation in the sub-basins of the Amazon using ORCHIDEE forced by new datasets, Hydrol. Earth Syst. Sci., 16, 911–935, https://doi.org/10.5194/hess-16-911-2012, 2012. a, b, c, d, e, f

Harding, R., Best, M., Blyth, E., Hagemann, S., Kabat, P., Tallaksen, L. M., Warnaars, T., Wiberg, D., Weedon, G. P., Lanen, H. V., Ludwig, F., and Haddeland, I.: WATCH: Current Knowledge of the Terrestrial Global Water Cycle, J. Hydrometeorol., 12, 1149–1156, https://doi.org/10.1175/JHM-D-11-024.1, 2011. a

Harris, I., Jones, P. D., Osborn, T. J., and Lister, D. H.: Updated high-resolution grids of monthly climatic observations – the CRU TS3.10 Dataset, Int. J. Climatol., 34, 623–642, https://doi.org/10.1002/joc.3711, 2014. a, b, c, d, e

Hernández, M. R. and Francés, F.: On the Influence of Error Model in the Good Performance of the Hydrological Model for the Right Reasons, International Conference on Hydroinformatics, 17 August 2014, New York City, USA, 2014. a

Hirabayashi, Y., Döll, P., and Kanae, S.: Global-scale modeling of glacier mass balances for water resources assessments: Glacier mass changes between 1948 and 2006, J. Hydrol., 390, 245–256, https://doi.org/10.1016/j.jhydrol.2010.07.001, 2010. a

Hirabayashi, Y., Zang, Y., and Watanabe, S.: Projection of glacier mass changes under a high-emission climate scenario using the global glacier model HYOGA2, Hydrol. Res. Lett., 7, 6–11, https://doi.org/10.3178/HRL.7.6, 2013. a, b, c

Huang, C. S., Yang, T., and Yeh, H. D.: Review of analytical models to stream depletion induced by pumping: Guide to model selection, J. Hydrol., 561, 277–285, https://doi.org/10.1016/j.jhydrol.2018.04.015, 2018. a

Ines, A. V. M. and Hansen, J. W.: Bias correction of daily GCM rainfall for crop simulation studies, Agr. Forest Meteorol., 138, 44–53, https://doi.org/10.1016/j.agrformet.2006.03.009, 2006. a

Jones, P. D. and Harris, I. C.: CRU TS 3.10: Climatic Research Unit (CRU) Time-Series (TS) Version 3.10 of High Resolution Gridded Data of Month-by-month Variation in Climate (Jan. 1901–Dec. 2009), NCAS British Atmospheric Data Centre, available at: http://catalogue.ceda.ac.uk/uuid/ac3e6be017970639a9278e64d3fd5508 (last access: 22 November 2018), 2013. a, b

Kavetski, D., Kuczera, G., and Franks, S. W.: Bayesian analysis of input uncertainty in hydrological modeling: 2. Application, Water Resour. Res., 42, 1–10, https://doi.org/10.1029/2005WR004376, 2006. a

Knutti, R., Furrer, R., Tebaldi, C., Cermak, J., and Meehl, G. A.: Challenges in combining projections from multiple climate models, J. Climate, 23, 2739–2758, https://doi.org/10.1175/2009JCLI3361.1, 2010. a, b

Liu, J., Chen, J. M., and Cihlar, J.: Mapping evapotranspiration based on remote sensing: An application to Canada's landmass, Water Resour. Res., 39, 1189, https://doi.org/10.1029/2002WR001680,, 2003. a, b, c

Liu, S., Ding, Y., Shangguan, D., Zhang, Y., Li, J., Han, H., Wang, J., and Xie, C.: Glacier retreat as a result of climate warming and increased precipitation in the Tarim river basin, northwest China, Ann. Glaciol., 43, 91–96, https://doi.org/10.3189/172756406781812168, 2006. a

Liu, Z., Xu, Z., Huang, J., Charles, S. P., and Fu, G.: Impacts of climate change on hydrological processes in the headwater catchment of the Tarim River basin, China, Hydrol. Process., 24, 196–208, https://doi.org/10.1002/hyp.7493, 2010. a

Mamitimin, Y., Feike, T., Seifert, I., and Doluschitz, R.: Irrigation in the Tarim Basin, China: farmers' response to changes in water pricing practices, Environ. Earth Sci., 73, 559–569, https://doi.org/10.1007/s12665-014-3245-2, 2014. a, b, c, d

Mezentsev, V. S.: More on the calculation of average total evaporation, Meteorol. Gidrol, 5, 24–26, 1955. a

Nguyen-Quang, T., Polcher, J., Ducharne, A., Arsouze, T., Zhou, X., Schneider, A., and Fita, L.: ORCHIDEE_gmd-2018-57, https://doi.org/10.14768/06337394-73A9-407C-9997-0E380DAC5593, 2018. a, b

Ol'dekop, E. M.: On evaporation from the surface of river basins: Transactions on Meteorological Observations, Tech. rep., Univ. of Tartu, Tartu, Estonia, Lur-evskogo, 1911 (in Russian). a

Pike, J. G.: The estimation of annual run-off from meteorological data in a tropical climate, J. Hydrol., 2, 116–123, 1964. a

Pitman, A. J.: The evolution of, and revolution in, land surface schemes designed for climate models, Int. J. Climatol., 23, 479–510, https://doi.org/10.1002/joc.893, 2003. a

Pitman, A. J., Slater, A. G., Desborough, C. E., and Zhao, M.: Uncertainty in the simulation of runoff due to the parameterization of frozen soil moisture using the Global Soil Wetness Project methodology, J. Geophys. Res.-Atmos., 104, 16879–16888, https://doi.org/10.1029/1999JD900261, 1999. a

Ponce, V., Pandey, R., and Ercan, S.: Characterization of drought across climatic spectrum, J. Hydrol. Eng., 5, 222–224, https://doi.org/10.1061/(Asce)1084-0699(2000)5:2(222), 2000. a, b, c

Potter, N. J., Zhang, L., Milly, P. C. D., McMahon, T. A., and Jakeman, A. J.: Effects of rainfall seasonality and soil moisture capacity on mean annual water balance for Australian catchments, Water Resour. Res., 41, 1–11, https://doi.org/10.1029/2004WR003697, 2005. a, b, c

Pritchard, H. D.: Asia's glaciers are a regionally important buffer against drought, Nature, 545, 169–174, https://doi.org/10.1038/nature22062, 2017. a

Qin, Y., Kavetski, D., and Kuczera, G.: A Robust Gauss-Newton algorithm for the optimization of hydrological models: From Standard Gauss-Newton to Robust Gauss-Newton, Water Resour. Res., https://doi.org/10.1029/2017WR022488, accepted, 2018a. a

Qin, Y., Kavetski, D., and Kuczera, G.: A Robust Gauss-Newton algorithm for the optimization of hydrological models: Benchmarking against industry-standard algorithms, Water Resour. Res. https://doi.org/10.1029/2017WR022489, accepted, 2018b. a

Refsgaard, J. C.: Parameterisation, calibration and validation of distributed hydrological models, J. Hydrol., 198, 69–97, https://doi.org/10.1016/S0022-1694(96)03329-X, 1997. a

Ren, W., Yang, T., Shi, P., yu Xu, C., Zhang, K., Zhou, X., Shao, Q., and Ciais, P.: A probabilistic method for streamflow projection and associated uncertainty analysis in a data sparse alpine region, Global Planet. Change, 165, 100–113, https://doi.org/10.1016/j.gloplacha.2018.03.011, 2018. a

Renard, B., Kavetski, D., Kuczera, G., Thyer, M., and Franks, S. W.: Understanding predictive uncertainty in hydrologic modeling: The challenge of identifying input and structural errors, Water Resour. Res., 46, 1–22, https://doi.org/10.1029/2009WR008328, 2010. a, b

Scheff, J. and Frierson, D. M. W.: Terrestrial aridity and its response to greenhouse warming across CMIP5 climate models, J. Climate, 28, 5583–5600, https://doi.org/10.1175/JCLI-D-14-00480.1, 2015. a

Schreiber, P.: Über die Beziehungen zwischen dem Niederschlag und der Wasserführung der Flüsse in Mitteleuropa, Z. Meteorol, 21, 441–452, 1904. a

Shangguan, D., Liu, S., Ding, Y., Ding, L., Xu, J., and Jing, L.: Glacier changes during the last forty years in the Tarim Interior River basin, northwest China, Prog. Nat. Sci., 19, 727–732, https://doi.org/10.1016/j.pnsc.2008.11.002, 2009. a, b, c

Shen, Y., Feng, M., Zhang, H., and Gao, F.: Interpolation methods of China daily precipitation data, J. Appl. Meteorol. Sci., 21, 279–286, 2010. a, b, c, d

Shi, P., Yang, T., Zhang, K., Tang, Q., Yu, Z., and Zhou, X.: Large-scale climate patterns and precipitation in an arid endorheic region: linkage and underlying mechanism, Environ. Res. Lett., 11, 044006, https://doi.org/10.1088/1748-9326/11/4/044006, 2016. a

Shi, P., Yang, T., Xu, C. Y., Yong, B., Shao, Q., Li, Z., Wang, X., Zhou, X., and Li, S.: How do the multiple large-scale climate oscillations trigger extreme precipitation?, Global Planet. Change, 157, 48–58, https://doi.org/10.1016/j.gloplacha.2017.08.014, 2017. a

Siebert, S., Döll, P., Feick, S., Frenken, K., and Hoogeveen, J.: Global Map of Irrigation Areas version 5, March 2013, https://doi.org/10.13140/2.1.2660.6728, 2013. a

Tallaksen, L. M. and Stahl, K.: Spatial and temporal patterns of large-scale droughts in Europe: Model dispersion and performance, Geophys. Res. Lett., 41, 429–434, https://doi.org/10.1002/2013GL058573, 2014. a

Tao, H., Gemmer, M., Bai, Y., Su, B., and Mao, W.: Trends of streamflow in the Tarim River Basin during the past 50years: Human impact or climate change?, J. Hydrol., 400, 1–9, https://doi.org/10.1016/j.jhydrol.2011.01.016, 2011. a, b, c, d

Tebaldi, C. and Knutti, R.: The use of the multi-model ensemble in probabilistic climate projections., Philos. T. R. Soc. A, 365, 2053–2075, https://doi.org/10.1098/rsta.2007.2076, 2007. a

Trenberth, K. E., Fasullo, J., Smith, L., Qian, T., and Dai, A.: Estimates of the Global Water Budget and Its Annual Cycle Using Observational and Model Data, J. Hydrometeorol. – Spec. Sect., 8, 758–769, https://doi.org/10.1175/JHM600.1, 2006. a

Wang, T., Ottlé, C., Boone, A., Ciais, P., Brun, E., Morin, S., Krinner, G., Piao, S., and Peng, S.: Evaluation of an improved intermediate complexity snow scheme in the ORCHIDEE land surface model, J. Geophys. Res.-Atmos., 118, 6064–6079, https://doi.org/10.1002/jgrd.50395, 2013. a, b

Wang, X., Yang, T., Wortmann, M., Shi, P., Hattermann, F., Lobanova, A., and Aich, V.: Analysis of multi-dimensional hydrological alterations under climate change for four major river basins in different climate zones, Climatic Change, 141, 483–498, https://doi.org/10.1007/s10584-016-1843-6, 2017. a

Wang, X., Yang, T., Yong, B., Krysanova, V., Shi, P., Li, Z., and Zhou, X.: Impacts of climate change on flow regime and sequential threats to riverine ecosystem in the source region of the Yellow River, Environ. Earth Sci., 77, https://doi.org/10.1007/s12665-018-7628-7, 2018. a

Weedon, G., Gomes, S., Viterbo, P., Österle, H., Adam, J., Bellouin, N., Boucher, O., and Best, M.: The WATCH Forcing Data 1958–2001: A meteorological forcing dataset for land surface- and hydrological-models, WATCH Tech. Rep. 22, 41, available at: http://www.waterandclimatechange.eu/about/watch-forcing-data-20th-century (last access: 22 November 2018), 2010. a, b

Weedon, G., Gomes, S., Viterbo, P., Shuttleworth, W., Blyth, E., Österle, H., Adam, J., Bellouin, N., Boucher, O., and Best, M.: Creation of the WATCH Forcing Data and Its Use to Assess Global and Regional Reference Crop Evaporation over Land during the Twentieth Century, J. Hydrometeorol., 12, 823–848, https://doi.org/10.1175/2011JHM1369.1, 2011 a

Weedon, G. P., Balsamo, G., Bellouin, N., Gomes, S., Best, M. J., and Viterbo, P.: The WFDEI meteorological forcing data set: WATCH Forcing data methodology applied to ERA-Interim reanalysis data, Water Resour. Res., 50, 7505–7514, https://doi.org/10.1002/2014WR015638, 2014a. a, b, c, d, e

Weedon, G. P., Balsamo, G., Bellouin, N., Gomes, S., Best, M. J., and Viterbo, P.: The WFDEI meteorological forcing data set: WATCH Forcing Data methodology applied to ERA-Interim reanalysis data, available at: http://www.eu-watch.org/gfx_content/documents/README-WFDEI (v2016).pdf (last access: 22 November 2018), 2014b. a, b

Weiß, M. and Menzel, L.: A global comparison of four potential evapotranspiration equations and their relevance to stream flow modelling in semi-arid environments, Adv. Geosci., 18, 15–23, https://doi.org/10.5194/adgeo-18-15-2008, 2008. a

Westerberg, I. K., Guerrero, J.-L., Younger, P. M., Beven, K. J., Seibert, J., Halldin, S., Freer, J. E., and Xu, C.-Y.: Calibration of hydrological models using flow-duration curves, Hydrol. Earth Syst. Sci., 15, 2205–2227, https://doi.org/10.5194/hess-15-2205-2011, 2011. a

Woo, M. K. and Marsh, P.: Snow, frozen soils and permafrost hydrology in Canada, 1999–2002, Hydrol. Process., 19, 215–229, https://doi.org/10.1002/hyp.5772, 2005. a

Wu, Y.-P., Shen, Y.-P., and Larry Li, B.: Possible physical mechanism of water vapor transport over Tarim River Basin, Ecol. Complex., 9, 63–70, https://doi.org/10.1016/j.ecocom.2011.12.002, 2012. a, b, c, d

Xie, P., Chen, M., Yang, S., Yatagai, A., Hayasaka, T., Fukushima, Y., and Liu, C.: A Gauge-Based Analysis of Daily Precipitation over East Asia, J. Hydrometeorol., 8, 607–626, https://doi.org/10.1175/JHM583.1, 2007. a

Yang, D., Sun, F., Liu, Z., Cong, Z., Ni, G., and Lei, Z.: Analyzing spatial and temporal variability of annual water-energy balance in nonhumid regions of China using the Budyko hypothesis, Water Resour. Res., 43, 1–12, https://doi.org/10.1029/2006WR005224, 2007. a, b, c, d, e

Yang, H., Yang, D., Lei, Z., and Sun, F.: New analytical derivation of the mean annual water-energy balance equation, Water Resour. Res., 44, 1–9, https://doi.org/10.1029/2007WR006135, 2008. a, b, c, d, e

Yang, T., Wang, X., Yu, Z., Krysanova, V., Chen, X., Schwartz, F. W., and Sudicky, E. A.: Climate change and probabilistic scenario of streamflow extremes in an alpine region, J. Geophys. Res.-Atmos., 8535–8551, https://doi.org/10.1002/2014JD021824, 2014. a

Yang, T., Wang, C., Chen, Y., Chen, X., and Yu, Z.: Climate change and water storage variability over an arid endorheic region, J. Hydrol., 529, 330–339, https://doi.org/10.1016/j.jhydrol.2015.07.051, 2015a. a

Yang, T., Zhou, X., Yu, Z., Krysanova, V., and Wang, B.: Drought projection based on a hybrid drought index using Artificial Neural Networks, Hydrol. Process., 29, 2635–2648, https://doi.org/10.1002/hyp.10394, 2015b. a, b, c

Yang, T., Cui, T., Xu, C. Y., Ciais, P., and Shi, P.: Development of a new IHA method for impact assessment of climate change on flow regime, Global Planet. Change, 156, 68–79, https://doi.org/10.1016/j.gloplacha.2017.07.006, 2017. a

Yang, Z.: Glacier Water Resources in China (in Chinese), Lanzhou, Gansu Science and Technology Press, 1991. a

Yu, Z.: Assessing the response of subgrid hydrologic processes to atmospheric forcing with a hydrologic model system, Global Planet. Change, 25, 1–17, https://doi.org/10.1016/S0921-8181(00)00018-7, 2000. a

Yu, Z., Lakhtakia, M. N., Yarnal, B., White, R. A., Miller, D. A., Frakes, B., Barron, E. J., Duffy, C., and Schwartz, F. W.: Simulating the river-basin response to atmospheric forcing by linking a mesoscale meteorological model and hydrologic model system, J. Hydrol., 218, 72–91, https://doi.org/10.1016/S0022-1694(99)00022-0, 1999.  a

Yu, Z., Pollard, D., and Cheng, L.: On continental-scale hydrologic simulations with a coupled hydrologic model, J. Hydrol., 331, 110–124, https://doi.org/10.1016/j.jhydrol.2006.05.021, 2006. a

Yu, Z., Lu, Q., Zhu, J., Yang, C., Ju, Q., Yang, T., Chen, X., and Sudicky, E. A.: Spatial and Temporal Scale Effect in Simulating Hydrologic Processes in a Watershed, J. Hydrol. Eng., 19, 99–107, https://doi.org/10.1061/(ASCE)HE.1943-5584.0000762, 2014. a

Zhang, L., Dawes, W. R., and Walker, G. R.: Response of mean annual evapotranspiration to vegetation changes at catchment scale, Water Resour. Res., 37, 701–708, https://doi.org/10.1029/2000WR900325, 2001. a

Zhang, L., Hickel, K., Dawes, W., Chiew, F., Western, A., and Briggs, P.: A rational function approach for estimating mean annual evapotranspiration, Water Resour. Res., 40, W025021–W02502114, https://doi.org/10.1029/2003WR002710, 2004. a, b

Zhang, T., Barry, R., Knowles, K., Ling, F., and Armstrong R.: Distribution of seasonally and perennially frozen ground in the Northern Hemisphere, Proceedings of the 8th International Conference on Permafrost, Zürich, Switzerland, 21–25 July, 2003. a

Zhao, Y., Zhu, J., Xu, Y., and Liu, N.: Monthly gridded surface precipitation at 0.5 in China (V2.0), National Meteorological Information Center, China, available at: http://data.cma.cn/data/detail/dataCode/SURF_CLI_CHN_PRE_MON_GRID_0.5/, last access: 22 November 2018. a, b

Zhou, H., Song, Y., Hu, S., Hongwei, Z., Yudong, S., and Shunjun, H.: Irrigated agriculture and sustainable water management strategies in the Tarim Basin, in: Proc. Work. “New Approaches to water” Manag. Cent. Asia, Aleppo, Syria, 6–11 November, 2000. a, b, c, d, e, f, g

Zomer, R. J., Trabucco, A., Straaten, O. V., and Bossio, D. A.: Trees and Water: Smallholder Agroforestry on Irrigated Lands in Northern India, Colombo, Sri Lanka, International Water Management Institute, 2007. a, b

Zomer, R. J., Trabucco, A., Bossio, D. A., and Verchot, L. V.: Climate change mitigation: A spatial analysis of global land suitability for clean development mechanism afforestation and reforestation, Agr. Ecosyst. Environ., 126, 67–80, https://doi.org/10.1016/j.agee.2008.01.014, 2008. a, b