Dynamical downscaling of future projections of global climate model outputs can provide useful information about plausible and possible changes to water resource availability, for which there is increasing demand in regional water resource planning processes. By explicitly modelling climate processes within and across global climate model grid cells for a region, dynamical downscaling can provide higher-resolution hydroclimate projections and independent (from historical time series), physically plausible future rainfall time series for hydrological modelling applications. However, since rainfall is not typically constrained to observations by these methods, there is often a need for bias correction before use in hydrological modelling. Many bias-correction methods (such as scaling, empirical and distributional mapping) have been proposed in the literature, but methods that treat daily amounts only (and not sequencing) can result in residual biases in certain rainfall characteristics, which flow through to biases and problems with subsequently modelled runoff. We apply quantile–quantile mapping to rainfall dynamically downscaled by the NSW and ACT Regional Climate Modelling (NARCliM) Project in the state of Victoria, Australia, and examine the effect of this on (i) biases both before and after bias correction in different rainfall metrics, (ii) change signals in metrics in comparison to the bias and (iii) the effect of bias correction on wet–wet and dry–dry transition probabilities. After bias correction, persistence of wet states is under-correlated (i.e. more random than observations), and this results in a significant bias (underestimation) of runoff using hydrological models calibrated on historical data. A novel representation of quantile–quantile mapping is developed based on lag-one transition probabilities of dry and wet states, and we use this to explain residual biases in transition probabilities. Representing quantile–quantile mapping in this way demonstrates that any quantile mapping bias-correction method is unable to correct the underestimation of autocorrelation of rainfall sequencing, which suggests that new methods are needed to properly bias-correct dynamical downscaling rainfall outputs.
There is a growing and ongoing need for information about plausible and possible changes to water resource availability in the future due to climate change. End users of hydroclimate projections are interested in more spatially detailed information, information on water metrics for low- and high-flow events, and better-predicted interdecadal metrics (Potter et al., 2018). Dynamical downscaling (such as that provided by the NARCliM Project for south-eastern Australia; see Sect. 2.1) has potential to provide this type of information; however, there remain challenges associated with the use of these data. In this paper, we examine the suitability of NARCliM projections for providing hydroclimate projections for south-eastern Australia. Specifically, we look at the extent of biases in rainfall, which necessitate daily bias correction, and the effect of quantile–quantile mapping (QQM) bias correction on rainfall sequencing metrics that are important for runoff generation. Subsequent research in a related paper (Charles et al., 2020) focuses attention on the effect of these biases on runoff.
Of particular interest are possible changes to rainfall characteristics that could affect runoff and streamflow. Information on future changes to rainfall are typically derived from ensembles of global climate models (GCMs); however, the spatial resolution of these models is too coarse to provide information at the scale needed for hydrological impact modelling (i.e. catchments or gauges). Downscaling is the process by which finer-scale spatial detail is produced from the GCM change information, which is at too coarse a resolution to be usable (Maraun et al., 2010). Many water resource studies use “empirical scaling”, where historical rainfall observations are scaled (perhaps annually or seasonally) for direct use, or “statistical downscaling”, in which a direct statistical relationship is developed between rainfall and other atmospheric predictors. These methods are relatively simple to use, and results from empirical scaling typically lie in the middle of the range of results from other downscaling methods (Chiew et al., 2010; Potter et al., 2018). However, as empirical scaling methods rely on the historical record of rainfall, future changes in rainfall sequencing (e.g. changes to multi-day accumulations and wet/dry transitions) and consequent effects on runoff cannot be properly modelled. Dynamical downscaling, in which a regional climate model (RCM) of finer spatial resolution than the host GCM is used, generates rainfall sequences independent from historical observations. However, challenges remain with using dynamical downscaling output since rainfall (and other climate variables) is not explicitly constrained by observations (see, e.g., Piani et al., 2010; Chen et al., 2011; Teutschbein and Seibert, 2013). As such, dynamical downscaling outputs typically need to be bias-corrected for direct use in hydrological models. In particular, a common feature of dynamical downscaling is the tendency to underpredict the occurrence of zero- and low-rainfall days, which is sometimes known as the drizzle effect (e.g. Maraun, 2013). RCM output has been bias-corrected for applications in Australia in Tasmania (Bennett et al., 2014) and at the central coast of New South Wales (Lockart et al., 2016), but both these studies found residual biases in multi-day rainfall events, dry-spell durations and autocorrelation of rainfall occurrences (see also Themeßl et al., 2012).
Water resources in Victoria are shared by urban users, irrigators, industry and the environment. Long-term water strategies and shorter-term sustainable water strategies are required for Victoria's water regions by the Victorian Water Act. A key aspect of these water planning processes is accounting for scenarios of climate change as determined by the available science. Cool-season rainfall in Victoria since 2000 has averaged 15 % less than the long-term average during the 20th century (Hope et al., 2017). This reduction has been linked to the observed expansion of Hadley cell circulation (Post et al., 2014). The median scenario of climate change for Victoria typically has reduced rainfall and runoff later in the century, with slightly larger percentage declines in the western parts of the state (Post et al., 2012; Potter et al., 2016). Providing better information to improve water planning processes includes developing finer-spatial-resolution projections and different metrics of daily rainfall amounts and occurrences.
Most bias-correction methods alter daily amounts with the application of distributional mappings. The temporal structure of the occurrences is most often unaltered. Further, bias correction can and does affect the magnitude of change signals (Hagemann et al., 2011; Gutjahr and Heinemann, 2013; Dosio, 2016). The underlying assumption of bias correction is that the RCM output faithfully represents the climate processes responsible for rainfall, although the amounts themselves may not be accurate. Water resource projection modelling is concerned with future changes, and so an argument could be made that, although the rainfall amounts are biased for hindcast (historical) simulations, they will presumably be equally biased for future simulations. In this way, changes can be inferred from comparing biased historical and future rainfall and runoff. However, the sensitivity of runoff to rainfall means that biased rainfall can have large effects on the change signal of runoff (Teng et al., 2015). Furthermore, hydrological models are calibrated to historical rainfall and runoff sequences, and since the distribution of runoff is usually highly skewed, using biased rainfall sequences can distort the distribution of runoff, thus creating large biases in high and low runoff amounts. These biases in runoff make inferences on the changes to runoff characteristics highly uncertain when biased rainfall inputs are used.
Bias correction identifies a relationship or mapping between observed
historical rainfall and modelled historical rainfall (in this case hindcast
RCM rainfall). This mapping when applied to hindcast RCM rainfall results in
a distribution of rainfall identical (or very similar, depending on the
methods) to the historical observations. This mapping can then be applied to
future RCM rainfall, resulting in unbiased future rainfall sequences. Of
course, applying the relationship into the future assumes the bias in RCM
rainfall does not change into the future or for different (wetter or drier)
climate periods. Bias-correction methods (see Schmidli et al., 2006; Boé
et al., 2007; Lenderink et al., 2007; Christensen et al., 2008; Piani et
al., 2010; Themeßl et al., 2011; Teng et al., 2015) fall into three main
categories:
scaling or change-factor methods, non-parametric (empirical) quantile–quantile mapping (QQM), parametric (distributional) QQM.
Scaling methods simply consider the change in mean and apply a constant
factor to correct bias in RCM rainfall. Quantile–quantile mapping matches
each quantile (or a selection of quantiles) of the two distributions. This
can be done using the empirical cumulative density or fitting a distribution
to both observed and hindcast RCM daily rainfall amounts.
Teng et al. (2015) demonstrated that representing daily rainfall distributions with double-gamma distributions (see also, e.g., Yang et al., 2010) was largely identical to empirical QQM, implying that distributional and empirical approaches give similar results so long as the distribution is sufficiently flexible. Arguably, the choice between non-parametric (empirical) or parametric (distributional) mapping is a representation of the bias–variance trade-off problem. Empirical mapping will reduce bias to zero, but at the cost of increasing the variance of predictions, since the mapping will be very sensitive to individual amounts. Distributional mapping fits the data across the entire rainfall distribution but can result in the hindcast RCM rainfall not being mapped exactly to the historical distribution. For this study we apply empirical quantile–quantile mapping for each season across integral percentiles as described below. Overall there is a small and relatively unimportant difference between different methods for QQM, at least in the Australian context studied by Teng et al. (2015).
Whereas quantile–quantile mapping can effectively reduce historical error in daily rainfall amounts to zero, albeit with some of the caveats already mentioned, the bias-corrected rainfall time series could still harbour biases and unrealistic characteristics that will result in runoff biases after being routed through a rainfall–runoff model. Specifically, QQM bias correction cannot remove biases in rainfall sequencing and multi-day accumulations that might not be readily apparent when considering only the daily distribution of rainfall amounts (e.g. Addor and Seibert, 2014; Li et al., 2016; Ines and Hansen, 2006). However, Terink et al. (2010) and Rajczak et al. (2016) contend that bias correction produces reasonable transition probabilities and spell durations.
Unfortunately, it is not easy to tell exactly which characteristics of rainfall drive runoff generation, and in general the sensitivity will depend on catchment physical characteristics, storm type and intensity, as well as antecedent moisture and groundwater stores (Goodrich and Woolhiser, 1991; Bell and Moore, 2000; Beven, 2001). Spectral and multifractal approaches (e.g. Milly and Wetherald, 2002; Matsoukas et al., 2000; Tessier et al., 1996) show that rainfall variability at shorter timescales is by and large incorporated into soil moisture buffers, thus dampening runoff variability at these timescales. However, over timescales of several days and greater, variability in runoff matches variability in rainfall more and more closely. As such, it is evident that large, intense rainfall events (measured perhaps by the upper tail of the rainfall distribution), more seasonal rainfall regimes (Wolock and McCabe, 1999), relatively larger variability of rainfall (Potter and Chiew, 2011) and large multi-day accumulations of rainfall are most important for runoff generation (Addor and Seibert, 2014), particularly for high-flow events (Jaun et al., 2008), and we focus on these kinds of rainfall metrics in this study.
The main aim of the paper is to investigate the effect of bias correction on rainfall characteristics relevant to runoff generation. Given that QQM bias-correction approaches are specifically tailored for correcting daily amounts while retaining the existing sequencing of occurrences, we seek to understand the effect of bias correction on rainfall sequencing and transition probabilities. Specifically, we investigate whether key rainfall metrics contain biases in dynamically downscaled GCM hindcasts relative to observations. We then examine if bias correction acts to either enhance or moderate any such biases, and whether bias correction affects change signals (i.e. downscaled GCM future relative to downscaled GCM historical). Section 2 describes the data and methods used in this study, Sect. 3 presents results, and Sects. 4 and 5 contain a discussion and conclusions.
The NSW and ACT Regional Climate Modelling Project (NARCliM;
Overall, there is reasonable confidence in NARCliM projections generally, for both rainfall and temperatures (Evans et al., 2012; Olson et al., 2016; Ji et al., 2016), particularly at daily scale for rainfall (Gilmore et al., 2016), although NARCliM has a quantitative cold and wet bias generally (Ji et al., 2016). The underlying performance of the RCM component of WRF has been tested extensively (Evans et al., 2012; Andrys et al., 2016; Ekström, 2016; Olson et al., 2016; Ji et al., 2016; Gilmore et al., 2016). The three WRF physics configurations are labelled R1, R2 and R3 (see Evans et al., 2014, their Table 1); detailed information is provided elsewhere (Evans et al., 2012; Ekström, 2016; Gilmore et al., 2016; Ji et al., 2016). The full NARCliM model ensemble thus consists of 12 members (four selected GCMs, each downscaled by the three RCM configurations).
Ji et al. (2016) evaluated model output against AWAP (see below) rainfall, demonstrating good representation of precipitation processes at a wide range of timescales, but conclude that bias correction is still needed before applying the data. For hydrological applications, the R2 configuration is recommended (Olson et al., 2016). Although we consider the entire modelling ensemble in this paper, in a related paper (Charles et al., 2020) we use only that specific RCM physics scheme for modelling runoff. We discuss the physical credibility of the NARCliM ensemble in greater detail in Sect. 4 below.
Daily accumulated precipitation from the NARCliM 12-member WRF-downscaled
ensemble (referred to from here onwards as “NARCliM rainfall”) is produced
at approximately 10 km
The historical (baseline) period for NARCliM projections is 1990–2009, and
we relate NARCliM historical rainfall features to observations and NCEP/NCAR
Reanalysis over the same period in this paper. Realisations of future
rainfall follow the A2 scenario of the Special Report on Emissions Scenarios
(SRES) (Nakićenović et al., 2000) at 2060–2079. Change signals
presented later are thus averages over 2060–2079 compared to 1990–2009.
Observed rainfall data are obtained from the Australian Water Availability
Project (AWAP; Jones et al., 2009). This is a
Quantile–quantile mapping bias correction works by estimating the cumulative
density function for observed and modelled historical observed (here AWAP)
daily rainfall amounts:
Schematic of QQM bias correction.
Whereas QQM bias correction can correct the daily distribution exactly,
daily bias correction is not set up to correct sequences and accumulations
(Addor and Seibert, 2014). To this end, we consider not only how bias
correction affects daily metrics of rainfall but also the sequencing of wet
days that produce runoff. One way of measuring this is through transition
probabilities of wet and dry sequences. To this end, we consider a simple
two-state Markov chain rainfall occurrence model. Here, the probability of a
wet or dry day depends on whether the previous day was wet or dry. A “dry”
day can be defined as either zero rainfall or rainfall below a given
threshold (such as 1 mm). Define the wet-to-wet transition probability (i.e.
the probability of a wet day following a wet day) as
Modelled rainfall is compared to observed (AWAP) rainfall in Fig. 2. ECHAM5-R1 was chosen as a representative GCM–RCM ensemble member here as it had the median historical regional rainfall across Victoria from the NARCliM model ensemble. The spatial rainfall fields downscaled from both the reanalysis and GCM both show reasonable agreement with the spatial pattern of the observed mean annual rainfall, with relatively larger rainfall across the mountain ranges to the east of the state and across the southern coast, and less rainfall to the more arid north-west region. However, both rainfall fields are evidently positively biased with around 100–200 mm of excess rainfall consistently across most of the state (Fig. 2f, g), and relatively more for the far eastern part of Victoria for the reanalysis data and across the mountain ranges for the GCM data, interspersed with small patches of negative bias (observations larger than downscaled data). Averaged across Victoria, the mean absolute bias is approximately 26 % for the downscaled reanalysis and 43 % for the downscaled GCM data. In comparison, the absolute change signal averages 3 %, rising to around 10 % in the eastern part of Victoria (Fig. 2h).
Regional mean annual rainfall (mm yr
Consistent with Fig. 2, the raw NARCliM rainfall is mostly wetter (positive bias) across Victoria (Fig. 3a), except for a tendency towards underprediction in the south-east coast for some models. The quantile–quantile bias-correction method is formulated to correct historical quantiles of rainfall exactly (when applied to the same time period used for calibration) so that bias-corrected mean annual rainfall, as well as any quantiles, is approximately equal (Fig. 3b). However, the method used here does not correct high rainfall quantiles (e.g. P99 and above) exactly, due to the interpolation between quantiles as described in Sect. 2.3 (Fig. 4b). This residual bias appears to be randomly distributed spatially and results in bias-corrected mean annual rainfall not being exactly corrected. However, this effect is generally less than 5 % of the mean annual rainfall. This effect can be removed entirely by using empirical density functions rather than interpolated values, but with the effect of increasing prediction uncertainty.
Percentage bias in WRF-downscaled mean annual rainfall from the
GCM/RCMs indicated
Percentage bias in WRF-downscaled 99th percentile of rainfall
(P99) from the GCM/RCMs indicated
Relative bias (modelled compared to AWAP, as percentages) of
Figure 5 shows the distribution of bias for different rainfall metrics before bias correction and the residual bias after QQM bias correction. The bias in raw NARCliM ranges from 5 to 50 % for all percentiles (Fig. 5a), increasing as the percentile increases (i.e. more relative bias for higher rainfall amounts). As with Fig. 3, bias at all percentiles is effectively reduced to zero after bias correction, although the residual bias is relatively larger at larger percentiles (higher rainfall), similarly to Fig. 4b. NARCliM rainfall is overestimated at all seasons and months before bias correction (Fig. 5b), although winter rainfall is relatively less biased than summer rainfall. Bias correction reduces bias to zero annually and seasonally, since QQM is applied to each season separately. Since the intra-seasonal relative monthly rainfall amounts are not exactly equal to the observed amounts, seasonal bias correction occasionally overcorrects bias, particularly in February, April, May and June, with the bias-corrected rainfall in these months being less than observed whereas they were overpredicted before bias correction; this could be reduced to zero using monthly correction factors, although at the potential cost of overfitting. Overall though, the absolute relative bias is reduced and closer to zero in all months compared to the raw RCM monthly amounts.
Biases in spatial average 3 d rainfall accumulation percentiles.
Here a 3 d moving-average filter was applied to each hindcast time series
and equivalent quantiles taken at increasing probability values (
Figure 5c shows relative bias in rainfall-sequencing-related metrics. Autocorrelation of rainfall amounts is underpredicted before bias correction, and the magnitude of this bias actually increases after bias correction. Whereas QQM reduces bias in dry–dry transition probabilities (i.e. the probability of dry sequences persisting), bias in wet–wet transition probabilities increases after bias correction so that, similarly to autocorrelation, the probability of wet spells persisting is considerably underpredicted after bias correction. We examine this in more detail in Sect. 3.3 using the transition probability framework developed in Sect. 2.4. Since dry spells are directly related to dry–dry transition probabilities, the bias in mean and maximum dry spells is well corrected, whereas maximum 3 d rainfall accumulation and wet-spell occurrences all have negative bias after QQM bias correction. Different percentiles of 3 d rainfall accumulation (calculated as percentiles of a 3 d moving sum of rainfall time series) have different residual bias (Fig. 6). Three-day accumulation percentiles below the 80th are all slightly overestimated after bias correction, but above the 80th percentile a large residual underestimation is present. This underestimation reduces at around the 99th percentile but is moderated somewhat for the 3 d maximum (i.e. 100th percentile). As noted in Sect. 2.1, WRF models were selected according to their skill in reproducing selected 2-week periods of heavy rainfall. This provides a potential explanation for the smaller bias in 3 d maxima relative to 3 d 99 % rainfall.
Ensemble (downscaled GCM hindcasts from the NARCliM 12-model
ensemble) median modelled runoff over Victoria:
It is likely that underpredicting wet-spell occurrences and persistence
(Figs. 5c and 6) will result in runoff from the bias-corrected rainfall
being underpredicted too. To explore this, runoff was modelled from bias-corrected rainfall and observed potential evapotranspiration data using GR4J
(see Charles et al., 2020, for more details). Figure 7 plots the percentage
difference in bias-corrected ensemble-median mean annual runoff for each
The results in this study demonstrate that dry–dry transition probabilities for NARCliM have low residual bias (possibly due to the emphasis in QQM on preserving zero-rain occurrences) but that wet–wet transition probabilities have more bias (i.e. they are closer to zero; see Fig. 5c) after QQM bias correction. These residual biases in wet–wet transitions result in the persistence of wet spells being underestimated even though the volumetric amount of rainfall is, by design of QQM bias correction, equal to observed rainfall at any grid point.
Dry–dry transition probabilities (1 mm threshold).
After bias correction, dry–dry transition probabilities for a 1 mm threshold are reduced but still have a small negative bias (Fig. 5c). Both the bias-corrected WRF-downscaled reanalysis and bias-corrected WRF-downscaled GCM results (Fig. 8) show a spatial pattern very similar to observations with higher dry–dry transition probabilities to the north-west of the state and at similar places along the southern coastline. However, the reanalysis and more so the GCM result have a lower dry–dry transition probability across almost all of the region (Fig. 8). As such, dry spells from the bias-corrected model output are likely to be shorter in duration and less common than those from the observed rainfall (although bias correction does reduce the bias in dry spells somewhat compared to the bias in the raw data as seen in Fig. 5c).
Wet–wet transition probabilities (1 mm threshold).
Whereas the dry–dry transition probabilities were largest in the north-west, drier, part of Victoria, the wet–wet transition probabilities are largest over the high-runoff-producing region (Fig. 9), which corresponds to the high-relief, high-altitude part of the state. As with the dry–dry transition probabilities, both bias-corrected downscaled reanalysis and bias-corrected downscaled GCMs reproduce the spatial pattern of wet–wet transition probabilities, but there is considerable residual bias in these probabilities across the entire region. The residual bias in WRF-downscaled GCM transition probabilities is over 10 % over most of Victoria. This residual bias results in underestimation of wet-spell occurrences and durations and multi-day accumulations of rainfall (Fig. 5c). The bias in wet–wet transition probabilities is more problematic for modelling runoff than the bias in dry–dry transition probabilities, not only because it is of larger magnitude but also because runoff is sensitive to multi-day wet spells, and the larger wet–wet probabilities occur in high-runoff-producing areas, which we would like to model correctly for regional water availability projections.
Figure 10 shows rainfall percentiles for a sample grid cell overlaid on the
transition-probability space developed in Sect. 2.4. Other grid cells and
GCMs show very similar responses (as can be seen in the low spread of
results for
An alternative perspective on quantile–quantile mapping: daily
rainfall amounts and associated probabilities plotted in
Here we examine change signals in rainfall metrics (i.e. percentage difference in RCM future relative to RCM historical averages), specifically looking at whether bias correction alters the change signals. Figure 11a shows the change signals in different rainfall percentiles. For the raw data, there is a small decrease in low to moderate rainfall amounts less than the non-zero 40th percentile and a future increase in non-zero percentiles above 50 %. The nature of QQM bias correction means that raw and bias-corrected equal percentiles cannot be meaningfully compared. Nevertheless, a similar pattern is found with the bias-corrected data, namely that larger rainfall amounts have larger relative changes than smaller rainfall amounts.
Change signal (percentage difference of RCM future relative to
RCM historical) in
Figure 11b shows change signals in mean annual, seasonal and monthly average
rainfall. The magnitude of the median change signal in mean annual rainfall
is around
Although the residual bias in rainfall sequencing metrics is not eliminated, and in some cases (e.g. wet–wet transition probabilities) is actually increased, after bias correction (Fig. 5c), Fig. 11c shows that the change signals in rainfall sequencing metrics is largely unaffected by bias correction. With the exception of the maximum accumulation of rainfall over 3 d rainfall accumulation, the distributions of sequencing change signals are largely identical before and after bias correction. The difference in change signal for maximum 3 d rainfall accumulation is presumably related to the relatively larger increase in large rainfall events (e.g. P99 in Fig. 11a).
Individual percentiles and seasonal totals are, by design, effectively reduced to zero using QQM. This elimination of bias is due to the common calibration and validation period (1990–2009) used for bias correction. For applications, different calibration and validation periods should be used to accurately estimate the bias from QQM. Nevertheless, some interpolation and extrapolation occur in the approach used here, so there is some random residual bias in higher percentiles (i.e. large rainfall amounts). This residual bias can be eliminated altogether by using the exact empirical density functions, but at the cost of increased predictive uncertainty. Using empirical densities also raises problems with extrapolation past historical amounts. Although Teng et al. (2015) demonstrated that distributional and empirical approaches give similar results if the distribution is sufficiently flexible, the choice of method should be considered carefully in relation to the end products required.
We demonstrate in Sect. 3.4 that change signals (future mean relative to historical) in rainfall metrics can be considerably smaller than the bias (modelled historical relative to observed historical). On one hand, this seems problematic since biases in processes can be considered so large that the changes are insignificant. On the other hand, there is no particular legitimacy for this viewpoint, certainly not in a statistical sense. The magnitude of bias does not provide any sort of confidence level in changes to rainfall metrics. However, given such relatively large biases, it is reasonable to assume that there are some errors in the way particular climate processes are modelled, through either the host GCM or the RCM. It would be desirable to understand the reasons and climatic process responsible for biases and assess whether these processes are unrealistic, in addition to whether these biases render the changes physically implausible.
We acknowledge that the underlying performance of GCMs in accurately
simulating climate dynamics of the region under consideration is extremely
important. In the context of the study area in the current paper,
south-eastern Australia, the selection of GCMs representing important
climate processes has been an ongoing research strand both for the CMIP3
climate model ensemble (Smith and Chandler, 2010; Kirono and Kent, 2011;
Kent et al., 2013; CSIRO, 2012; Evans and Ji, 2012a; McMahon et al., 2015),
which is used for the current NARCliM dataset, and the CMIP5 ensemble
(see CSIRO and Bureau of Meteorology, 2015; Hope et al., 2016). As mentioned
in the Introduction, the RCM component of NARCliM, WRF, has been tested
extensively, including the general cold and wet bias over south-eastern
Australia. It is suggested that the wet bias (Fig. 3a) is related to subgrid
cloud cover representations (see, e.g., García-Díez et al., 2015;
Di Virgilio et al., 2019), and correction of this is the subject of current,
ongoing research (Di Virgilio et al., 2019). Current work (NARCliM1.5) is
continuing to develop the modelling framework at a higher spatial resolution
(5 km
The use of models that produce plausible climate dynamics is of course desirable; however, in practice it is not necessarily always possible. Apart from the fact that the “best” models identified by the above references differ according to the criteria used, using a dynamical downscaling ensemble for hydrological applications is an opportunistic endeavour, relying largely on existing data products, which have been prepared with many applications in mind, not just hydrological applications. As such, it is not always practical to choose the GCMs and RCMs that best represent climate dynamics important for hydrological applications; many studies have also contended that accuracy in representing historical conditions is no guarantee that future changes are correctly modelled (e.g. Knutti et al., 2010; Racherla et al., 2012).
As mentioned in the Introduction. Evans and McCabe (2010) examined the RCM component (WRF) of NARCliM, concluding that the El Niño–Southern Oscillation, the chief climate process modulating interannual variability of rainfall (Power et al., 1999), was well modelled over south-eastern Australia. Evans and McCabe (2010) also concluded that the severity and duration of recent prolonged droughts over south-eastern Australia were also captured, although the spatial pattern was not characterised exactly. The sub-tropical ridge, which determines the seasonal positioning of storm tracks over southern Australia, was less well represented by WRF (Andrys et al., 2016). Based on the results cited here, we have confidence in the modelling setup of NARCliM to represent atmospheric circulation for southern Australia reasonably well, although we acknowledge that bias correction of NARCliM for end-user applications should consider model skill in atmospheric circulation.
In general, bias correction does not tend to alter the change signals in rainfall metrics (with the exception of 3 d accumulation and low rainfall percentiles). Nevertheless, small differences in rainfall metrics can result in large differences in runoff metrics and other water availability measures (e.g. low flows and high flows). Large and even average runoff amounts can be very sensitive to 3 d rainfall accumulation, which we saw can be altered through daily bias correction. It is recommended that the effects of bias correction are included in any uncertainty analysis undertaken.
Section 3.4 shows that bias correction can affect change signals (future relative to historical) of different hydroclimatic metrics (see also Hagemann et al., 2011; Gutjahr and Heinemann, 2013; Dosio, 2016). Under the assumption that bias is time invariant, Gobiet et al. (2015) argue that bias correction improves the accuracy of climate change signals. Cannon et al. (2015), however, argue that trend-preserving methods should be used (see also Li et al., 2010; Wang and Chen, 2014). Maraun (2016) and Maraun et al. (2017) summarise the debate surrounding the use of trend-preservation methods and conclude that the decision should be informed by the credibility or otherwise of the GCMs in representing the processes driving the changes. This debate further highlights the need for informed selection and screening of GCMs at the start of the modelling process. However, we argue that there is value in reporting both pre- and post-bias-correction future changes in light of the difficulties involved in model selection and assessment, particularly in the case of pre-existing and computationally expensive projections such as a dynamically downscaled ensemble such as NARCliM.
The simple QQM method used here does not consider spatial correlation
between rainfall gauges or grid cells at all. Maintaining spatial
correlations is clearly important for runoff generation, and neglecting this
can lead to “inflation”. Inflation refers to a phenomenon in bias correction
(Maraun, 2013) or statistical downscaling (von Storch, 1999) where an
unmeasured predictand variable is estimated using the predicted values from
a statistical model. Since models contain error, the variance of a
time series of predicted values is expected to be less than the variance of
the true time series of the variable. In the present context of bias-correcting rainfall from RCMs, Maraun (2013) demonstrates that bias
correction reduces subgrid spatial heterogeneity compared to actual
precipitation and that this is particularly problematic when GCM or RCM
resolution is much greater than that of observations. In this case, the
spatial correlation between gauges is increased. As a result, large rainfall
amounts become overestimated and small amounts underestimated. Preserving the
correct spatial correlation between gauges and grid points is an important
issue, and the issue of unintended spatial effects of (temporal) bias
correction is compounded by applying bias correction independently to each
grid cell, as we have done here – although this tends to reduce subgrid
spatial correlation (see Hnilica et al., 2017). Maraun (2013) recommends
aggregating catchment rainfall prior to bias correction to reduce the issue
of inflation, and Charles et al. (2020) examine this in more detail in
relation to catchment runoff production. Variance inflation due to differing
grid cell sizes (Maraun, 2013) is less an issue for the current study, as
NARCliM grid cell size is comparable to that of the gridded rainfall
observations, and the next generation of dynamically downscaled climate
projections (Downes et al., 2019) is to be provided at
Although in this study rainfall alone is bias-corrected, temperature or a suitable representation of potential evapotranspiration is needed for runoff applications. Methods exist for correcting rainfall and temperature simultaneously (e.g. Hoffmann and Rath, 2012; Piani and Haerter, 2012; Mehrotra et al., 2018). However, potential evapotranspiration has a second-order effect on runoff compared to rainfall (Chiew, 2006; Potter et al., 2011), and bias correction was shown to not significantly affect the inherited relationships between rainfall and temperature (Wilcke et al., 2013). Certainly, the host GCM and RCM should correctly represent relationships between atmospheric variables in the study region, further highlighting the need for climate model assessment in construction of the model ensemble.
Another important consideration is the relevant metrics to be considered by end users. Bias correction by season, for example, can alter change signals annually (Haerter et al., 2011), and care must be taken as to which metrics are of interest and which are the most appropriate bias-correction methods to apply in order to properly account for the metrics of interest. Certainly, caution must be applied when considering rainfall and runoff metrics that were not considered when applying bias correction to projections. Low-flow metrics are particularly problematic (Potter et al., 2018), where different downscaling and bias-correction methods can give very different answers.
Although daily bias-correction methods as outlined in this paper tend to result in residual bias in multi-day metrics, generally change signals in transition probabilities are very similar before and after bias correction. This information could thus potentially be extracted from RCMs to drive local weather generation or stochastic methods to provide future rainfall projections that can be suitable for local hydrological projections. Maintaining interannual and multi-decadal correlations and spatial correlations between rainfall gauges remains a challenge for stochastic methods, however.
The relative magnitude of change signals (future RCM to historical RCM) of the different rainfall metrics examined here is typically less than the magnitude of the bias. Mean annual rainfall change is an order of magnitude smaller than the bias in mean annual rainfall, but seasonal changes are closer to half of the bias in seasonal averages. Although this might call into question the validity of the change signal, one approach is to assume that the magnitudes of the changes are responsive to changing greenhouse gas emissions, insofar as the changing atmospheric processes are realistically modelled by the RCM. Indeed, this is the basic premise behind empirical scaling, i.e. that the change is the authentic signature of the climate modelling especially since the RCMs are not explicitly tuned to observed rainfall.
Individual percentiles and seasonal totals are, by design, effectively reduced to zero using QQM. Monthly totals retain some residual bias because of compensating biases within each season due to small errors in rainfall seasonality by the RCMs. Metrics associated with rainfall sequencing (e.g. serial correlation, wet–wet and dry–dry state transition probabilities and quantiles of 3 d accumulation) all have significant residual bias, particularly so for wet–wet state transition probabilities in which the magnitude of bias in raw RCM historical runs is amplified after bias correction. This residual bias leads to a considerable underestimation of mean annual runoff after rainfall is routed through a hydrological model because runoff is very sensitive to multi-day accumulations of rainfall and sequencing of wet spells in particular.
An analysis of the lag-one transition probabilities (i.e. wet state to wet state and dry state to dry state) showed that NARCliM rainfall had transitions to different states that are more random (i.e. more independent) compared to observed rainfall. QQM bias correction is unable to correct these transition probabilities as QQM retains the transition probabilities for any particular quantile. Since persistence of wet spells is critical for runoff generation, a different approach to bias correction is needed to successfully use NARCliM for runoff projections that can correct rainfall sequencing to better represent the observed correlation structure in wet and dry occurrences.
Change signals in annual, seasonal and monthly average rainfall and rainfall sequencing metrics are largely preserved after bias correction, with the exception of maximum 3 d rainfall accumulation. Rainfall sequencing metrics (such as state transition probabilities and daily rainfall autocorrelation) are largely unchanged by bias correction, which suggests the possibility of using this information to drive either weather-generation models or stochastic/resampling-based bias-correction methods to produce hydrologically realistic rainfall sequences for hydroclimate projection applications.
Observed (AWAP) rainfall data are available from the Bureau of Meteorology (
NJP wrote the initial manuscript. NJP, SPC, GF and HZ carried out computations and analysis. NJP, SPC and FHSC revised the manuscript. LZ, FHSC and SPC contributed to experiment design and theoretical guidance.
The authors declare that they have no conflict of interest.
The authors thank the Victorian Department of Environment, Land,
Water and Planning (DELWP) and the Victorian Water and Climate
Initiative. The authors also express thanks to the NARCliM Project (
This research has been supported by the Victorian Department of Environment, Land, Water and Planning (grant no. R-10860).
This paper was edited by Nadav Peleg and reviewed by three anonymous referees.