Journal topic
Hydrol. Earth Syst. Sci., 24, 2207–2233, 2020
https://doi.org/10.5194/hess-24-2207-2020
Hydrol. Earth Syst. Sci., 24, 2207–2233, 2020
https://doi.org/10.5194/hess-24-2207-2020

Research article 06 May 2020

Research article | 06 May 2020

# Assimilation of wide-swath altimetry water elevation anomalies to correct large-scale river routing model parameters

Assimilation of wide-swath altimetry water elevation anomalies to correct large-scale river routing model parameters
Charlotte Marie Emery1,2,a, Sylvain Biancamaria1, Aaron Boone3, Sophie Ricci4, Mélanie C. Rochoux4, Vanessa Pedinotti5, and Cédric H. David2 Charlotte Marie Emery et al.
• 1LEGOS, 16 Avenue Edouard Belin, 31400 Toulouse, France
• 2Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA
• 3CNRM-GAME, Meteo-France, 42 Avenue Gaspard Coriolis, 31000 Toulouse, France
• 4CECI, Université de Toulouse, CERFACS, CNRS, 42 Avenue Gaspard Coriolis, 31057 Toulouse CEDEX 1, France
• 5Magellium, 1 Rue Ariane, 31520 Ramonville-Saint-Agne, France
• anow at: CS-Group, Space Business Unit, 31500 Toulouse, France

Correspondence: Charlotte Marie Emery (charlotte.emery@jpl.nasa.com)

Abstract

Land surface models combined with river routing models are widely used to study the continental part of the water cycle. They give global estimates of water flows and storages, but they are not without non-negligible uncertainties, among which inexact input parameters play a significant part. The incoming Surface Water and Ocean Topography (SWOT) satellite mission, with a launch scheduled for 2021 and with a required lifetime of at least 3 years, will be dedicated to the measuring of water surface elevations, widths and surface slopes of rivers wider than 100 m, at a global scale. SWOT will provide a significant number of new observations for river hydrology and maybe combined, through data assimilation, with global-scale models in order to correct their input parameters and reduce their associated uncertainty. Comparing simulated water depths with measured water surface elevations remains however a challenge and can introduce within the system large bias. A promising alternative for assimilating water surface elevations consists of assimilating water surface elevation anomalies which do not depend on a reference surface. The objective of this study is to present a data assimilation platform based on the asynchronous ensemble Kalman filter (AEnKF) that can assimilate synthetic SWOT observations of water depths and water elevation anomalies to correct the input parameters of a large-scale hydrologic model over a 21 d time window. The study is applied to the ISBA-CTRIP model over the Amazon basin and focuses on correcting the spatial distribution of the river Manning coefficients. The data assimilation algorithm, tested through a set of observing system simulation experiments (OSSEs), is able to retrieve the true value of the Manning coefficients within one assimilation cycle much of the time (basin-averaged Manning coefficient root mean square error, RMSEn, is reduced from 33 % to [1 %–10 %] after one assimilation cycle) and shows promising perspectives with assimilating water anomalies (basin-averaged Manning coefficient RMSEn is reduced from 33 % to [1 %–2 %] when assimilating water surface elevation anomalies over 1 year), which allows us to overcome the issue of unknown bathymetry.

1 Introduction

Global hydrological models (GHMs) are extensively exploited to study the continental component of the global water cycle . Such models have been extensively developed over the past 2 decades in order to quantify freshwater flows and storage changes over continental surfaces (Bierkens2015). They are based on the coupling of a land surface model (LSM) with a river routing model (RRM). As an example, the ISBA-CTRIP hydrologic model results from the coupling of the ISBA LSM and the TRIP RRM . LSMs simulate the energy and water balance at the soil–atmosphere–vegetation interface, while RRMs emulate the lateral transfer of freshwater toward the continent–ocean interface. The current study focuses on the river component of the terrestrial water cycle simulated by the RRM.

GHMs give a global view of the state of the water flow and storage at model spatial and temporal resolutions. Nonetheless, they suffer from multiple sources of uncertainties which are related to the model structure, the external forcing and the input parameters . Model structure uncertainties initially arose from a lack of knowledge of the hydrologic processes or from simplifying assumptions made to limit simulation computational cost. Still, with the increase in computational power, models are more and more complex : they run at finer spatial resolution, they include new physical processes and they use an increasing number of fully distributed forcing and parameter datasets . This has led to an increase in the number of model input parameters and potentially inflates the model uncertainty in those parameters. Input parameters express the spatial and/or temporal properties of the system. The spatial scale of parameters measurable on the field may differ from the model scale, while other conceptual parameters are not directly observable and measurable on the field and are inferred using a geomorphological empirical formula and/or indirect methods such as calibration .

Another way in which to study the terrestrial water cycle is to use direct observations of the system. Most parts of the terrestrial water cycle are currently observed and measured from in situ or remote techniques . For the observations of rivers, in situ techniques measure river water elevations at gauge stations. In situ measurements are commonly very accurate and also frequent (i.e., sub-daily), but their main limitation is their spatially sparse sampling and their decreasing number over recent decades at a global scale . Coincidentally, remotely sensed data provided by satellite missions have increased quite significantly since the 90s and deliver effective river observations. The most common instrument operating to assess river water levels remains the nadir altimeter. Nadir altimetry gives localized water elevation measurements along the satellite ground track. Initially, altimeters were designed to monitor ocean topography, but their application has broadened to the observation of lakes , floodplains and, later on, rivers . However, their main limitation remains their limited spatial and temporal samplings: generally several days between two consecutive measurements at a limited number of locations. Besides, over continental surfaces, the signal is not always retrievable. Current river observations therefore provide a more accurate view of the river system than models, but they are quite limited by their sparse availability in space and time.

The incoming Surface Water and Ocean Topography (SWOT) mission, jointly developed by NASA, CNES, CSA and UKSA and scheduled for launch in 2021, will be dedicated to the observation of continental free surface water with a better spatial and temporal coverage than the current nadir missions (such as EnviSat, the JASON series or also Sentinel-3A/B). SWOT's main payload, called KaRIn, for Ka-band Radar INterferometer , will observe surfaces under two swaths of 50 km each separated by a nadir gap of 20 km and will have a near-global coverage. For hydrology, SWOT will observe rivers wider than 100 m as well as lakes and wetlands larger than 250 × 250 m2 within the latitudes 78 S and 78 N and with a revisit time of 21 d. SWOT will provide two-dimensional images of water surface elevations with a vertical accuracy of 10 cm when averaged over 1 km2 of water area. Along with water surface elevation measurements in rivers, SWOT will also provide observations of river width, surface slope and estimates of discharge based on SWOT observations. SWOT will provide a significant amount of new data for surface hydrology. It will give an ensemble of constraints that will allow a better depiction of surface water in hydrological models. These new data could be combined or integrated into global-scale hydrological models in order to correct them and improve their performances and forecasting capabilities.

Data assimilation techniques are a set of mathematical methods which combine a physical model and related external measurements, taking their relative uncertainties into account. Data assimilation aims at improving the model's ability to forecast and/or emulate the physical system's evolution. For this purpose, data assimilation methods are built to correct either the model's outputs (state estimation) or the model's input parameters (parameter estimation or PE), and sometimes both simultaneously. Data assimilation for state estimation has been widely applied in meteorology and oceanography and is more and more developed for large-scale terrestrial hydrology . Data assimilation for PE in hydrology was initially developed as a dynamic alternative to model calibration . In most models, parameters are assumed to be constant in time, whereas, in reality, they may vary seasonally or under evolving climate and/or anthropogenic conditions. Sequential data assimilation can therefore help track model parameter variations in time . PE is also used to retrieve conceptual parameters of hydrologic models such as friction coefficients or residence times of quick- and slow-flow reservoirs and partition of runoff excess , which can not be directly measured.

Before launch, in the preparatory phase, observing system simulation experiments (OSSEs) can be performed in order to assess the benefits of assimilating SWOT data into a hydrological model and to evaluate the most adapted methodologies to assimilate these data into models. Several studies assimilating synthetic and/or simplified SWOT-like data have been published so as to evaluate the correction of river model state, namely river depth , river storages and river discharges , at various scales. But also, several studies focused on the possibility of using SWOT data to retrieve critical river parameters such as river bathymetry and/or riverbed roughness/friction coefficient . Indeed, SWOT is a scientific mission with a 3-year nominal lifetime. Therefore, SWOT observations will help to better calibrate hydrological models and to improve their performances even over time periods beyond its lifetime. Moreover, other studies using real remote-sensing data have also been published and give insight into the challenges related to the assimilation of space-borne products, such as , and , which assimilate nadir radar altimetry data.

In the present study, a data assimilation framework is used to correct input parameters of the large-scale ISBA-CTRIP model. More specifically, synthetic SWOT observations of water surface depths and anomalies are assimilated in order to correct the spatially distributed riverbed friction coefficients (or Manning coefficients). As SWOT will not directly measure water depths (it provides water elevation and the bathymetry is required to derive water depth), the purpose of this study is to evaluate the possibility of assimilating water elevation anomalies to correct the model's parameters and to assess how the assimilation performances are impacted, compared to the direct assimilation of water depths. Assimilating water elevation anomalies is done to overcome a potential lack of bathymetry data.

This study is presented as a complementary study to that of , which is dedicated to the state estimation (river storage and discharge) of the same ISBA-CTRIP model, using real satellite-based discharge products. The choice of the roughness coefficient as a control variable was made following the results from the ISBA-CTRIP sensitivity analysis in . In this preliminary study, the sensitivity of the simulated water depths and also anomalies to several river input parameters (such as riverbed width, depth, slope and also friction coefficient) was evaluated. The results showed that the highest sensitivity was in the Manning coefficient.

This study is, furthermore, also built on the conclusions from the work of . In our study, an ensemble Kalman filter (EnKF) is used (instead of the extended Kalman filter in ) to better account for the nonlinearities of the system and to better estimate the model errors. Also, chose to update the Manning coefficient distribution at the grid cell scale, and the question of equifinality arose in their results. For the current study, it was decided to update the Manning coefficient distribution not at the grid-cell resolution, but at a coarser zonal resolution, by applying multiplying correcting factors uniformly over each zone, identical to the one used in . Finally, used an assimilation window of 2 d. This configuration resulted in updated Manning coefficient time series displaying “unrealistic jumps” with a frequency of about 20 d associated with the orbit repeat cycle (longer than the 2 d window). To avoid this phenomenon, the present study uses an assimilation window of 21 d corresponding to the current SWOT orbit repeat cycle.

Section 2 will first give a description of the ISBA-CTRIP model used for this study. Section 3 will present the particular data assimilation method developed for this study and, finally, after presenting the assimilation strategy in Sect. 4, Sects. 5 and 6 will give the data assimilation results.

2 Model

## 2.1 The ISBA-CTRIP large-scale hydrological model

The ISBA model is a LSM defined at global scale on a 0.5× 0.5 regular mesh grid that establishes the energy and water budget over continental surfaces. This study operates the ISBA-3L version based on a three-layer soil . The budget equations are solved separately on each grid cell. Still, larger-scale spatial patterns in the radiative and precipitation forcing, the soil composition and the vegetation cover ensure spatial correlations between those cells (for more details, see ). In particular, ISBA gives a diagnostic of the surface runoff (QISBA,sur) and the gravitational drainage (QISBA,sub, i.e., water percolating to the deep layers of the soil) later used as forcing inputs for the RRM denoted CTRIP.

The CTRIP model , is defined on the same mesh grid as ISBA and follows a river network to laterally transfer water from one cell to another, down to the interface with the ocean . The study is based on the CTRIP version from with three reservoirs, as illustrated in Fig. 1a. The water mass (kg) stored in a groundwater reservoir G and a floodplain reservoir F interacts with the water mass in the surface reservoir S representing the river. Only the surface reservoir S is related to the river network and fills with the surface runoff QISBA,sur, the outflow from upstream cells and the delayed drainage QISBA,sub by means of the groundwater reservoir. Occasionally, when the amount of water in the river exceeds a given threshold (defined by the water level in the reservoir), the river spills into the floodplains.

Figure 1(a) The ISBA-CTRIP system for a given grid cell. ISBA surface runoff (QISBA,sur) flows into the river/surface reservoir S, and ISBA gravitational drainage (QISBA,sub) feeds groundwater reservoir G. The surface water is transferred from one cell to another following the TRIP river routing network. (b) Hydro-geomorphological areas of the Amazon basin from with the gauge of Óbidos located by the white circle at the entry of zone 3.

## 2.2 CTRIP parameters

Within a 0.5× 0.5 cell, the surface reservoir is a unique river channel that may gather multiple real river branches. Its rectangular cross section is described by its slope s (–), its width W (m), its bankful depth Hc (m), its length L (m) and finally a Manning or friction coefficient N ($\mathrm{s}\phantom{\rule{0.125em}{0ex}}{\mathrm{m}}^{-\mathrm{1}/\mathrm{3}}$) that assesses the reach resistance at the bottom of the river.

Each cell's elevation is deduced from the STN-30p digital elevation model (http://daac.ornl.gov/ISLSCP_II/islscpii.shtml, last access: 20 April 2020). These elevations are then compared to determine the riverbed slope s. Global empirical geomorphologic relationships are used to define the river width W and bankful depth Hc. The arc length between grid cell centers, inflated by a meandering factor μ, results in the river reach length L. More details on these parameters can be found in and .

The Manning coefficient N is generally more complicated to estimate. Following , it should take values between 0.025 and 0.03 for natural streams and values between 0.075 and 0.1 for smaller and mountainous tributaries and also floodplains. Global studies can apply either a constant or a spatially distributed Manning coefficient. However, it is ordinarily accepted that this parameter should vary in space and even in time across the river catchment. Consequently, CTRIP uses a spatially distributed Manning coefficient based on a simple linear relationship between the relative stream size in the current cell, denoted SO, and the size at the river mouth and the source cells, so that

$\begin{array}{}\text{(1)}& N={N}_{\text{min}}+\left({N}_{\text{max}}-{N}_{\text{min}}\right)\frac{{\text{SO}}_{\text{max}}-\text{SO}}{{\text{SO}}_{\text{max}}-{\text{SO}}_{\text{min}}}.\end{array}$

SO is the stream size relative measure at the current cell, SOmax (whose value depends on the network depth) the same measure at the river mouth and SOmin=1 the measure at source cells (namely cells without any upstream cells, according to the river network). The Manning coefficient is then set to be constant in time while its spatial values decrease towards the river outlet (following the river network), with values between Nmin=0.04 and Nmax=0.06.

All these parameters are eventually essential to estimate the spatially and time-varying average cross-sectional flow velocity in the surface reservoir v(t) following the Manning formula (Manning1891):

$\begin{array}{}\text{(2)}& v\left(t\right)=\frac{{s}^{\frac{\mathrm{1}}{\mathrm{2}}}}{N}{\left(\frac{W{h}_{S}\left(t\right)}{W+\mathrm{2}{h}_{S}\left(t\right)}\right)}^{\frac{\mathrm{2}}{\mathrm{3}}},\end{array}$

where hS is the river water depth estimated from the river storage S by

$\begin{array}{}\text{(3)}& {h}_{S}=\frac{S}{\mathit{\rho }WL}\end{array}$

and ρ is the water density. The flow velocity is ultimately used to estimate the discharge leaving the CTRIP cell:

$\begin{array}{}\text{(4)}& {Q}_{\text{out}}^{S}\left(t\right)=\frac{v\left(t\right)}{L}S\left(t\right).\end{array}$

As the definitions of most of these parameters are based on empirical relationships, we have to be aware that they inevitably have substantial uncertainties.

## 2.3 CTRIP implementation over the Amazon basin

In this study, we present an OSSE test case over the Amazon River basin, whose hydrology is carefully described in and . This choice was motivated by the present work following and complementing studies over the same domain .

For ISBA-CTRIP, the Amazon basin is composed of a total number of 2028 cells. Based on the basin geomorphology and hydrology , the basin has been split into nine spatial regions. These zones, illustrated in Fig. 1b, were initially introduced in and will be re-exploited here within the application of data assimilation. For a detailed description of the zones, the reader can refer to .

## 2.4 ISBA-CTRIP forcing

For the present study, ISBA-CTRIP needs external atmospheric forcing in order to run. Similarly to , such data are provided by Global Soil Wetness Projet 3 (GSWP3, http://hydro.iis.u-tokyo.ac.jp/GSWP3, last access: 20 April 2020) at a 3-hourly time resolution.

3 Method: synthetic parameter estimation on ISBA-CTRIP

## 3.1 OSSE framework

In OSSEs, we introduce beforehand a reference configuration for the model input parameters that we will consider thereafter to be the truth. From those true parameters, we directly deduce the true run from a ISBA-CTRIP model integration. The synthetic observations used for data assimilation are obtained by perturbing the true observables (variables that are used as observations) using an error model that is representative of the real observation errors. The control variables (the model variables to be corrected with data assimilation) first guess is obtained by directly perturbing the true control variables. Control error also has to be chosen to be representative of the real modeling errors. OSSEs are prerequisite tests to ensure that the implementation of the EnKF algorithm is correct and adapted to the hydrologic problem under consideration (temporal/spatial length scales, sources of uncertainty, observation operator, etc.).

## 3.2 Data assimilation variables

### 3.2.1 Observation variables and their errors

The observation vector, denoted ${\mathbit{y}}_{k}^{\mathrm{o}}$ at the assimilation cycle k, is composed of the ny available observations at cycle k:

$\begin{array}{}\text{(5)}& {\mathbit{y}}_{k}^{\mathrm{o}}=\left[\begin{array}{cccc}{y}_{k,\mathrm{1}}^{\mathrm{o}},& {y}_{k,\mathrm{2}}^{\mathrm{o}},& \mathrm{\dots },& {y}_{k,{n}_{y}}^{\mathrm{o}}\end{array}\right],\end{array}$

where ${y}_{k,j}^{\mathrm{o}}$, j=1 … ny is the jth observation among the ny at cycle k.

In the present study, the observed variables are water depths issued from a simplified SWOT simulator. Note that this simulator will produce water depths, while the real SWOT satellite will provide water elevations. As in and , this SWOT simulator replicates SWOT spatio-temporal coverage. At a given date, the simulator selects the ISBA-CTRIP cells contained (at least 50 % of their area) in the SWOT ground tracks. Figure 2 shows some selected ISBA-CTRIP cells under the real swaths over the Amazon basin. The true run is used as a basis to get the true water depths ${\mathbit{y}}_{k}^{t}$. Then, in order to generate the observation vector ${\mathbit{y}}_{k}^{\mathrm{o}}$ from the extracted true water depths, each of them is randomly perturbed by adding a white noise characterized by a standard deviation σo so that

$\begin{array}{}\text{(6)}& \forall \phantom{\rule{0.33em}{0ex}}j=\mathrm{1}\phantom{\rule{0.25em}{0ex}}\mathrm{\dots }\phantom{\rule{0.25em}{0ex}}{n}_{y},\phantom{\rule{0.33em}{0ex}}{y}_{k,j}^{\mathrm{o}}={y}_{k,j}^{t}+{\mathit{ϵ}}_{j}^{\mathrm{o}},\phantom{\rule{0.33em}{0ex}}{\mathit{ϵ}}_{j}^{\mathrm{o}}\simeq \mathcal{N}\left(\mathrm{0},{\mathit{\sigma }}^{\mathrm{o}}\right).\end{array}$

Using water depth observations is a strong simplification of the real SWOT product. Therefore, in order to take into account that SWOT will provide water elevations and not directly water depths, this study will look at the assimilation of both water depths and water anomalies. The method for generating these anomalies will be further detailed in Sect. 4.2.

Figure 2SWOT swaths at ISBA-CTRIP resolution over the Amazon basin.

The observation error is the addition of the measurement error and the representativeness error. The first is associated with inherent instrumental errors when processes are observed and the second represents the error introduced when the observed and simulated variables are not exactly the same (in nature or scale). Following the SWOT uncertainty requirements , SWOT-like water surface elevation measurements have a vertical accuracy of 10 cm (when averaged over a water area of 1 km2). This uncertainty accounts for measurement errors due to the remotely sensed acquisition such as instrumental thermal noise, speckle, troposphere and ionosphere effects. Moreover, we omit error correlations along the swath so that observation errors follow a white noise model. Accounting for spatially correlated observation errors is an active research area in the field of data assimilation , which is beyond the scope of demonstrating the feasibility of assimilating SWOT-type data. In the framework of OSSEs, observed and simulated water depths have the same scale as the ISBA-CTRIP model which is used to generate both. In the following, we assume therefore that there is no representativeness error related to the scale in the system. However, it is worth acknowledging that we should expect higher errors in water depths, compared to water elevations, as we do not know the bathymetry. Assimilation of water depths is performed as a benchmark against which assimilation of water anomalies will be compared. Ultimately, σo is chosen as being equal to 10 cm for all observed variables (i.e., both water depths and water elevation anomalies).

### 3.2.2 Control, observation space and their errors

The control vector is denoted by ${\mathbit{x}}_{k}\in {\mathbb{R}}^{{n}_{x}}$. It includes the nx uncertain variables to be estimated through the kth data assimilation cycle. The choice of the control variables determines the observation operator k:

$\begin{array}{}\text{(7)}& {\mathbit{y}}_{k}={\mathcal{H}}_{k}\left({\mathbit{x}}_{k}\right),\end{array}$

where yk are the simulated observables; in other words, k maps the control variables onto the observation space. They are then compared to the measured observations ${\mathbit{y}}_{k}^{\mathrm{o}}$ during the data assimilation experiment. This difference is referred to as the innovation vector.

Following the conclusions from , we determined that assimilating water-depth-like observations would be efficient for the correction of the distribution of the river Manning coefficients. These coefficients are spatially distributed at the grid-cell scale. However, from , equifinality issues were raised through the correcting of the distribution at this scale. They also affected its upstream-to-downstream spatial distribution. We chose to correct it therefore by applying multiplying factors defined at a coarser scale, namely at the scale of the nine hydro-geomorphological areas defined in Sect. 2.3 and illustrated in Fig. 1b. Within the same area, the Manning coefficient values are all identically modified by being multiplied by the same factor.

The control vector is composed therefore of the nx multiplying factors Nmult,i, i=1 … nx, applied to the correction of the spatial distribution of the river Manning coefficient:

$\begin{array}{}\text{(8)}& {\mathbit{x}}_{k}={\left[{N}_{\text{mult},\mathrm{1}}\phantom{\rule{0.25em}{0ex}}\mathrm{\dots }\phantom{\rule{0.25em}{0ex}}{N}_{\text{mult},{n}_{x}}\right]}^{T},\end{array}$

giving nx=9.

The observation operator k maps the control variables (Manning coefficient dimensionless multiplying factors) into the observables (river water depths in meters) as follows:

1. first, apply the multiplying factors (xk) to the Manning coefficient distribution;

2. then, apply the ISBA-CTRIP model ${\mathcal{M}}_{\left[k-\mathrm{1},k\right]}$ over the assimilation window $\left[k-\mathrm{1},k\right]$ to determine the model states that correspond to the perturbed Manning coefficient distribution;

3. afterwards, turn the CTRIP surface water storage into equivalent water depths following Eq. (3) (we denote by 𝒵k the diagnostic operator turning the surface storage variable into the water depth variable);

4. finally, select the simulated water depths under the SWOT swath mask (we denote by 𝒮k this operator).

The observation operator is therefore the composition of three operators:

$\begin{array}{}\text{(9)}& {\mathbit{y}}_{k}=\left({\mathcal{S}}_{k}\circ {\mathcal{Z}}_{k}\circ {\mathcal{M}}_{\left[k-\mathrm{1},k\right]}\right)\left({\mathbit{x}}_{k}\right)={\mathcal{H}}_{k}\left({\mathbit{x}}_{k}\right).\end{array}$

Such a nonlinear observation operator k is difficult to formulate explicitly, which is why we use an EnKF algorithm to estimate the Kalman gain in a statistical way.

## 3.3 The EnKF general formulation

In the EnKF framework, the model ${\mathcal{M}}_{\left[k-\mathrm{1},k\right]}$ and observation k operators are generally not linear. The main assumption for the EnKF is to use stochastic ensembles to represent first- and second-order moments (namely the means and the covariances) of the control variable errors (Evensen1994, 2003). Indeed, it is assumed that the distribution of the ensemble is similar to that of the error of the control vector, and it is also assumed that the probability density function (PDF) of the error is Gaussian and thus well described by its first and second moments. The background control variables ${\mathbit{x}}_{k}^{\mathrm{b}}$ (the first guess) are therefore represented by an ensemble of ne members:

$\begin{array}{}\text{(10)}& {\mathbf{X}}_{e,k}^{\mathrm{b}}=\left[\begin{array}{cccc}{\mathbit{x}}_{k}^{b,\left[\mathrm{1}\right]}& {\mathbit{x}}_{k}^{b,\left[\mathrm{2}\right]}& \mathrm{\dots }& {\mathbit{x}}_{k}^{b,\left[{n}_{\mathrm{e}}\right]}\end{array}\right].\end{array}$

To avoid the collapsing of the ensemble, the observation vector in Eq. (5) is randomized by adding a supplementary white noise with the same observation error standard deviation σo so that

$\begin{array}{ll}\forall \phantom{\rule{0.33em}{0ex}}j& =\mathrm{1}\phantom{\rule{0.25em}{0ex}}\mathrm{\dots }\phantom{\rule{0.25em}{0ex}}{n}_{y},\phantom{\rule{0.33em}{0ex}}\forall \phantom{\rule{0.33em}{0ex}}l=\mathrm{1}\phantom{\rule{0.25em}{0ex}}\mathrm{\dots }\phantom{\rule{0.25em}{0ex}}{n}_{\mathrm{e}},\phantom{\rule{0.33em}{0ex}}{y}_{k,j}^{\mathrm{o},\left[l\right]}\\ \text{(11)}& & ={y}_{k,j}^{\mathrm{o}}+{\mathit{ϵ}}_{j}^{\mathrm{o},\left[l\right]},\phantom{\rule{0.33em}{0ex}}{\mathit{ϵ}}_{j}^{\mathrm{o}}\simeq \mathcal{N}\left(\mathrm{0},{\mathit{\sigma }}^{\mathrm{o}}\right).\end{array}$

An observation ensemble is generated:

$\begin{array}{}\text{(12)}& {\mathbf{Y}}_{e,k}^{\mathrm{o}}=\left[\begin{array}{cccc}{\mathbit{y}}_{k}^{\mathrm{o},\left[\mathrm{1}\right]}& {\mathbit{y}}_{k}^{\mathrm{o},\left[\mathrm{2}\right]}& \mathrm{\dots }& {\mathbit{y}}_{k}^{\mathrm{o},\left[{n}_{\mathrm{e}}\right]}\end{array}\right].\end{array}$

Note that alternatives exist to the observation randomization chosen here. However, for the present study, we choose to use a full stochastic filter.

Finally, the EnKF analysis step is applied to each member of the ensemble so that

$\begin{array}{}\text{(13)}& \forall \phantom{\rule{0.33em}{0ex}}l=\mathrm{1}\phantom{\rule{0.25em}{0ex}}\mathrm{\dots }\phantom{\rule{0.25em}{0ex}}{n}_{\mathrm{e}},\phantom{\rule{0.33em}{0ex}}{\mathbit{x}}_{k}^{a,\left[l\right]}={\mathbit{x}}_{k}^{b,\left[l\right]}+{\mathbf{K}}_{e,k}\left({\mathbit{y}}_{k}^{\mathrm{o},\left[l\right]}-{\mathcal{H}}_{k}\left({\mathbit{x}}_{k}^{b,\left[l\right]}\right)\right),\end{array}$

where Ke,k is the Kalman gain. It is built from the control and observation error covariance matrices P and R and the linearized observation operator H so that (see Appendix A for more details)

$\begin{array}{}\text{(14)}& {\mathbf{K}}_{e,k}=\left[{\mathbf{PH}}^{T}{\right]}_{e,k}{\left(\left[{\mathbf{HPH}}^{T}{\right]}_{e,k}+{\mathbf{R}}_{k}\right)}^{-\mathrm{1}}.\end{array}$

Figure 3 summarizes the general OSSE framework used for the present study. The figure reads from top to bottom and from left to right. An assimilation cycle $\left[k-\mathrm{1},\phantom{\rule{0.33em}{0ex}}k\right]$ includes a forecast step where an ensemble of ISBA-CTRIP simulations is integrated, each member having a different spatially distributed Manning coefficient; an analysis step where the ensemble of Manning coefficients is corrected using synthetic observations through the Kalman filter update in Eq. (13); and a cycling step where the ISBA-CTRIP model is re-run with these analysis estimates to obtain updated model states.

Figure 3Data assimilation framework over the assimilation cycle $\left[k-\mathrm{1},\phantom{\rule{0.33em}{0ex}}k\right]$ including (1) a forecast step to integrate the ensemble of ISBA-CTRIP simulations, each member having a different spatially distributed Manning coefficient, (2) and an analysis step to correct this ensemble of Manning coefficients using synthetic observations through the Kalman filter equation and re-run the ISBA-CTRIP model with these analysis estimates to obtain the updated model states.

## 3.4 SWOT-based data assimilation special feature

### 3.4.1 Choice of the assimilation window

We use a 21 d assimilation window corresponding to a SWOT orbit revisit period. During one assimilation window, every pixel under the observation mask is therefore observed at least once. However, this implies that new observations are available at times which differ from the update time. Such a case has already been addressed in several studies. The ensemble Kalman smoother (EnKS) for example, introduced by , is a direct extension of the EnKF. It consists of generating an update of the control variables taking into consideration the present and all past observations when a new observation is available. The EnKS is actually a sequential version of the ensemble smoother . The latter takes into consideration all past and future observations, but turned out to be less effective than the EnKF and the EnKS . Alternatively, developed the 4D-EnKF (4D as in the 4D-VAR variational assimilation methods; ), which also assimilates observations available at different time steps. In the 4D-EnKF, all model observations are expressed as a linear combination of the model observations at analysis time, and the problem is transformed into a classic EnKF problem. Similarly, also presented an asynchronous version of the local ensemble transform Kalman filter . In the framework of the present study, we apply an asynchronous ensemble Kalman filter (AEnKF) as described by and . The principle is to increase the dimension of the state in order to consider observations at past and analysis times. This increases the dimension of the matrices which contain covariances between observations available at different times. To our knowledge, the AEnKF has not been used for parameter estimation, as and described the method for state estimation experiments.

### 3.4.2 The asynchronous EnKF

To start with, k represents the assimilation cycle index, but it needs to be distinguished from the day index (within the assimilation cycle), which is the time unit for the observations. We will then denote by k(i), i=1 … 21 the ith day in the current assimilation cycle.

On the ith day of the kth assimilation cycle, the ny,k(i) observations are gathered in the vector ${\mathbit{y}}_{k\left(i\right)}^{\mathrm{o}}$. The overall observation vector at cycle k, ${\mathbit{y}}_{k}^{\mathrm{o}}$, then concatenates the 21 daily observation vectors ${\mathbit{y}}_{k\left(i\right)}^{\mathrm{o}}$ so that

$\begin{array}{}\text{(15)}& {\mathbit{y}}_{k}^{\mathrm{o}}={\left[{\mathbit{y}}_{k\left(\mathrm{1}\right)}^{\mathrm{o},T}\phantom{\rule{0.25em}{0ex}}\mathrm{\dots }\phantom{\rule{0.25em}{0ex}}{\mathbit{y}}_{k\left(\mathrm{21}\right)}^{\mathrm{o},T}\right]}^{T}\in {\mathbb{R}}^{{n}_{y,k}},\phantom{\rule{0.33em}{0ex}}{n}_{y}=\sum _{i=\mathrm{1}}^{\mathrm{21}}{n}_{y,k\left(i\right)}.\end{array}$

Similarly to the observation vector, the overall observation operator at the kth cycle, k, is the concatenation of the daily observation operators k(i), defined from Eq. (9), but by considering the operator integrating the model between k(0) ($=\left(k-\mathrm{1}\right)\left(\mathrm{21}\right)$) and k(i) as well as the diagnostic and selection operators, a time step k(i), 𝒵k(i) and 𝒮k(i).

The observation error covariance matrix Rk is the concatenation of the daily observation error covariance matrices:

$\begin{array}{ll}{\mathbf{R}}_{k}=& \phantom{\rule{0.25em}{0ex}}\left(\begin{array}{cccc}{\mathbf{R}}_{k\left(\mathrm{1}\right)}& & & \\ & {\mathbf{R}}_{k\left(\mathrm{2}\right)}& \mathrm{0}& \\ & \mathrm{0}& \mathrm{\ddots }& \\ & & & {\mathbf{R}}_{k\left(\mathrm{21}\right)}\end{array}\right)\\ \text{(16)}& & \text{with}\phantom{\rule{0.33em}{0ex}}{\mathbf{R}}_{k\left(i\right)}=\left({\mathit{\sigma }}^{\mathrm{o}}{\right)}^{\mathrm{2}}{\mathbf{I}}_{{n}_{y,k\left(i\right)}},\end{array}$

where ${\mathbf{I}}_{{n}_{y,k\left(i\right)}}$ is the identity matrix of size ${n}_{y,k\left(i\right)}×{n}_{y,k\left(i\right)}$. It turns out that

$\begin{array}{}\text{(17)}& {\mathbf{R}}_{k}=\left({\mathit{\sigma }}^{\mathrm{o}}{\right)}^{\mathrm{2}}{\mathbf{I}}_{{n}_{y}}.\end{array}$

Following the same equations as the EnKF, the AEnKF generates, for each member l=1 … ne of the control ensemble, an analysis control vector ${\mathbit{x}}_{k}^{a,\left[l\right]}$.

### 3.4.3 Generation of the ensemble

To generate the background control ensemble, we solely stochastically perturb the variables within the control vector. Note that it amounts to the assumption that all other features of the forward model, e.g., the atmospheric forcings, the LSM structure and therefore the surface and sub-surface runoff, are perfect. While this applies for OSSEs, such features are never perfect in real-case experiments. This assumption is further discussed in Sect. 7.

The ensemble of background control vectors ${\mathbf{X}}_{e,k}^{\mathrm{b}}$, of size nx×ne, is generated so that ${\mathbit{x}}_{k}^{b,\left[l\right]}$, l=1 … ne follows a Gaussian law of mean $\stackrel{\mathrm{‾}}{{\mathbit{x}}_{k}^{\mathrm{b}}}$ and covariance matrix ${\mathbf{P}}_{e,k}^{\mathrm{b}}$. For the first assimilation cycle, the control variables' mean value $\stackrel{\mathrm{‾}}{{\mathbit{x}}_{\mathrm{1}}^{\mathrm{b}}}$ is arbitrarily chosen as the openloop run input parameter (the openloop or free run is the model run without assimilation) and the background error covariance matrix ${\mathbf{P}}_{e,\mathrm{1}}^{\mathrm{b}}$ is a diagonal matrix defined as

${\mathbf{P}}_{e,\mathrm{1}}^{\mathrm{b}}=\left(\left({\mathit{\sigma }}^{\mathrm{b}}{\right)}^{\mathrm{2}}{\mathbb{I}}_{{n}_{x}}\right),$

with ${\mathbb{I}}_{{n}_{x}}$ the identity matrix of size nx×nx and σb the vector that gathers the initial control variable error standard deviation.

Once the analysis ensemble ${\mathbf{X}}_{e,k}^{a}$ is determined, the next step is to propagate the correction in time. In a PE framework, it is necessary to re-run the ensemble model runs during the current assimilation window with the analysis parameters as inputs. Then, the contribution of the updated parameters is propagated through the model, up to the end of the current assimilation window, and put into the model initial condition for the next assimilation cycle.

For the next assimilation cycles, the background mean estimate is set equal to the analysis mean estimate from the previous cycle:

$\stackrel{\mathrm{‾}}{{\mathbit{x}}_{k}^{\mathrm{b}}}=\stackrel{\mathrm{‾}}{{\mathbit{x}}_{k-\mathrm{1}}^{a}}.$

There are different ways of defining ${\mathbf{P}}_{e,k}^{\mathrm{b}}$. One could choose to stochastically estimate ${\mathbf{P}}_{e,k-\mathrm{1}}^{a}$ from the analysis ensemble at the previous cycle and use it as ${\mathbf{P}}_{e,k}^{\mathrm{b}}$. Contrary to state estimation experiments where the analysis error covariance matrix is propagated in time using the model along with the control variables, parameter estimation experiments use it directly as the background error covariance matrix as there is no dynamical model for the Manning coefficient. The issue with this approach is that the analysis ensemble variance can be strongly reduced and provide too small an ensemble spread to have efficient AEnKF updates in time. To ensure that enough uncertainty is maintained in the ensemble, one can maintain the initial background error covariance matrix through all cycles or impose a minimal value for the variance elements (see Sect. 5.4).

The background error cross-covariance matrix [PHT]e,k and covariance matrix [HPHT]e,k are directly built from the definition suggested by , and ; see Appendix A for more details. The matrices are of sizes ${n}_{x}×{n}_{y,k}$ and ${n}_{y,k}×{n}_{y,k}$, respectively. The elements in the error cross-covariance matrices result directly from the characterization of the background ensemble; i.e., the parameter uncertainties accounted for the generation of the control matrix ${\mathbf{X}}_{e,k}^{\mathrm{b}}$ and $\mathcal{H}\left({\mathbf{X}}_{e,k}^{\mathrm{b}}\right)$.

4 Assimilation strategy

In the incoming experiments, the true control variables xt are

$\begin{array}{}\text{(18)}& {\mathbit{x}}^{t}=\left[\mathrm{1.65}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.85}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.85}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.95}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.90}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.95}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.90}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{1.30}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{1.40}\right]\end{array}$

and the a priori values at the first assimilation cycle $\stackrel{\mathrm{‾}}{{\mathbit{x}}_{\mathrm{1}}^{\mathrm{b}}}$ are

$\begin{array}{}\text{(19)}& \stackrel{\mathrm{‾}}{{\mathbit{x}}_{\mathrm{1}}^{\mathrm{b}}}=\left[\mathrm{1.50}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.50}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.50}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.50}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.50}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.50}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{0.50}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{1.50}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\phantom{\rule{0.125em}{0ex}}\mathrm{1.50}\right].\end{array}$

We increase the Manning value in mountainous zones (zone 1 in the Andes and zones 8 and 9 over the shields) and lower the Manning value over the other zones with the lowest values (in zones 2 and 3) corresponding to the main stem. Both true and background values were chosen accordingly.

## 4.1 Sensitivity tests

During one EnKF assimilation cycle, the analysis potentially depends on the following parameters: model spinup, time period (high/low flow), size of the ensemble, and control error. Note that the observation error also has an impact on the analysis, but its value is already fixed for all subsequent experiments (see Sect. 5).

A first set of experiments (either model runs or data assimilation runs) will serve as sensitivity tests for the data assimilation platform with respect to the above features. During these sensitivity tests, the different features are tested individually. Table 1 details the range of variations for each tested feature.

Table 1Tested data assimilation parameters in the sensitivity tests.

## 4.2 Assimilation tests

Following the sensitivity tests, a set of three data assimilation experiments will be run and is presented in Table 2. The data assimilation experiments are divided into two categories: the first one uses water depths as observations, and the second considers water depth anomalies. All experiments are run across a year, corresponding to 17 assimilation cycles of 21 d.

Table 2List of data assimilation experiments. All experiments are run over approximately 1 year (17 cycles of 21 d) starting on 1 January 2008. The ensemble size is ne=25, the observation error standard deviation is σo=0.1 m and the initial control variable error standard deviation is σb=0.3.

The first experiment, denoted PE1, is configured from the aforementioned sensitivity test outcome. The parameters defining the experiment (spinup, starting date, ensemble size, control error) will be those which provide the best results in the sensitivity tests in Table 2. The reference level between the observed and simulated water depths is also the same. In other words, there is no bias in the observation. This first idealized experiment serves as a proof-of-concept as the observation type matches exactly the type of the simulated variables. Consequently, with this experiment, we expect to retrieve the true value of the control variables and hence the correct water depths and discharges.

The next step is to head towards more realistic experiments by including new sources of uncertainties in the data assimilation system and seeing how to address them. In this context, two additional experiments denoted PE2 and PE3 will be carried out. As an example of new uncertainties, SWOT will in fact observe water elevations (water surface elevation as referenced to a geoid or an ellipsoid), whilst CTRIP produces water depths (water surface elevation as referenced to the bottom of the river bed). To perform data assimilation, one needs to convert CTRIP water depths (${h}_{S}^{\text{CTRIP}}$) into CTRIP water elevations (${H}_{\text{alti}}^{\text{CTRIP}}$) or inversely for SWOT. It is highly plausible that this operation induces a bias between the modeled and observed water elevations. A simplified example of such a situation is illustrated in Fig. 4. In this case, SWOT catches the right water elevation dynamic (as ${h}_{S}^{\text{SWOT}}$ and ${h}_{S}^{\text{CTRIP}}$ are equal), but the direct assimilation of SWOT water elevation ${H}_{\text{alti}}^{\text{SWOT}}$ will induce a bias as the elevations of the river bed (${H}_{\text{bed}}^{\text{SWOT}}$ and ${H}_{\text{bed}}^{\text{CTRIP}}$) are different between CTRIP and SWOT.

Figure 4Illustration of a bias case between the water elevation as observed by SWOT (${H}_{\text{alti}}^{\text{SWOT}}$) and the one simulated by CTRIP (${H}_{\text{alti}}^{\text{TRIP}}$) because of a bias between model and true river beds.

A solution for the handling of this issue is to assimilate water depth anomalies instead of water depths. The next data assimilation experiments, denoted PE2 and PE3, will therefore test the feasibility of assimilating anomalies. In these experiments, the water depth anomalies are generated by subtracting a time-averaged reference water depth from the current water depth. For all runs (true, openloop or analysis), this time-averaged reference water depth is computed as the mean (true, openloop or analysis) water depth over the year before the start of the assimilation window. It is therefore different for each member of the ensemble. Firstly, in experiment PE2, there will still be no bias between the observed and simulated river bathymetry to observe how the assimilation of anomalies performs. Similarly to PE1, we expect this experiment to be able to retrieve the true control and state variables. Finally, the last experiment, PE3, which introduces a constant relative bias between CTRIP and SWOT, will be carried out. For this experiment, we anticipate that the assimilation will still be able to retrieve the model state variables. The use of anomalies as observations should limit the impact of the inserted bias. We do not exclude the possibility however that it may be slightly echoed in the control variables.

5 Assimilation sensitivity tests

## 5.1 Model spinup sensitivity tests

The objective of the spinup sensitivity tests is to evaluate the minimum spinup period required by the model before applying data assimilation. For this purpose, the model is run several times across 2 years, from 19 December 2006 to 22 December 2008, corresponding to 35 windows of 21 d (735 d).

A first simulation is run using the true Manning spatial distribution (see Eq. 18) over the 2-year time period. We then run 18 additional simulations over the same period with a varying length of the spinup period (see Table 1). Initially, the simulation setup corresponds to the openloop configuration with the openloop Manning spatial distribution (see Eq. 19). At a given time during the first year, the Manning spatial distribution is instantaneously changed to the true distribution (see Eq. 18) and the model is run until the end of the 2 years with the true Manning spatial distribution. Table B1 summarizes, for each run, the date when the Manning coefficients are changed. The spinup period (expressed as a number of windows of 21 d) corresponds to the period between when the Manning distribution is changed and the start of the second year, i.e., 1 January 2008.

To evaluate the spinup impact, the relative difference between the reference run and the test runs is evaluated over the second year of simulation (from 1 January to 22 December 2008) and averaged over every window of 21 d. Figure 5 presents the results for the spinup sensitivity test. Each test run (on the x axis) is identified by its corresponding spinup period length (expressed as the number of windows of 21 d; see Table B1). We then count (on the y axis) the number of 21 d windows during which the relative error between the test run and the reference run is higher than a given threshold. We assume that the spinup period is long enough when this number is equal to 0. This number is evaluated from the basin-averaged relative difference and from the relative difference at the downstream station of Óbidos, both in terms of water depth and discharge. Note that in Fig. 5, when the number of spinup windows of 21 d is equal to 4 on the x axis, the change from the openloop Manning distribution to the true one is imposed on 9 October 2007. Similarly, when this number is equal to 10, the change is imposed on 5 June 2007. Note also that two thresholds are considered, 0.01 and 0.001. Basin-averaged results are not sensitive to this threshold. There are some differences at Óbidos; however, we retain the basin-averaged results to evaluate the required model spinup period.

Figure 5Results for the spinup sensitivity test. Each test run is represented along the x axis and referenced by its number of spinup windows. The y axis displays the number of windows during which the relative difference between the true run and the openloop run in which the Manning spatial distribution is modified is above the chosen threshold. These statistics are obtained for the discharge (a, c) and the water depth (b, d) and evaluated over the entire basin (a, b) and at the downstream station of Óbidos (c, d). Note that the vertical dashed line corresponds to the minimum model spinup period retained in this study.

From all of the results we conclude that a minimum spinup period of four windows of 21 cycles, i.e., 84 d, is required. This period corresponds to the basin concentration time or, in other terms, the required time for the river network to totally empty. In the following sensitivity tests, the model runs start on 9 October 2007, to be consistent with these results.

## 5.2 Data assimilation starting date sensitivity tests

To evaluate the impact of the starting date, a set of 17 one-cycle long data assimilation experiments is carried out over the second running year. All experiments have the same general configuration, except for the initial date starting from 1 January 2008 and shifted by 21 d until 22 December 2008. This means that the last experiment starts on 2 December 2008. The performance of each experiment is evaluated by simply evaluating the spatial average difference between the analysis and the true Manning coefficients. Results presented in Fig. 6a and b indicate that there are no significant differences between the data assimilation experiments (for all 17 experiments, the error of the updated Manning coefficients with respect to the true value of the coefficients is below 5 %).

Figure 6Top: relative error (to the truth) and (bottom) dispersion of the analysis control ensemble (averaged over all control variables) for the sensitivity tests to (a, b) the data assimilation starting date, (c, d) the ensemble size ne, and (e, f) the background error standard deviation σb. For each test, a set of one-cycle long data assimilation experiments is run.

Note that the results in Fig. 6a–b show a slight increase in the errors for the experiment starting at the end of the year, presently from 9 September to 2 December 2008. This period corresponds to the low-flow season in the Amazon hydrological cycle. Concerning the sensitivity analysis results , the water depths showed a very low sensitivity to the Manning coefficient during the low-flow season. As a consequence of data assimilation, the EnKF is less effective in low-flow seasons in the correction of the Manning coefficient. The analysis relative error (in Fig. 6a) and the analysis ensemble dispersion (in Fig. 6b) are therefore higher in the low-flow season.

For all of the following sensitivity tests, we are only considering therefore one-assimilation-cycle experiments, which will start on 1 January 2008.

## 5.3 Ensemble size sensitivity tests

The next sensitivity test is dedicated to the ensemble size ne, a critical parameter of any EnKF algorithm. This parameter has to be high enough so as to accurately estimate the Kalman gain matrix but low enough to limit the computational cost (the higher ne, the more model runs are required over each data assimilation window to obtain the analysis estimate of the Manning coefficients).

We consider different ensemble sizes through a one-assimilation-cycle experiment; ne varies between 10 and 200. Figure 6c compares the analysis Manning coefficient relative error for each ensemble size ne as in Fig. 6a. Results show that the analysis relative error decreases when the ensemble size ne increases. For an ensemble size ne equal to 20, the analysis error is below 5 %. Also, for an ensemble size ne higher than 50, the analysis relative error has converged to a constant value, while the analysis ensemble dispersion shown in Fig. 6d stabilizes.

These results indicate that the ensemble size for future data assimilation experiments should be at least equal to 20; we consider ne=25 in the present study to limit computational time.

## 5.4 Model error standard deviation sensitivity tests

In this study, we only consider parameter estimation, implying that the background error covariance matrix is associated with the parameter space. We assume that the errors in the Manning coefficients are independent, so that the background error covariance matrix is initially specified as a diagonal matrix, where all diagonal elements correspond to the error variances in the spatially varying Manning coefficients and are equal to the same variance (σb)2, where σb is the background error standard deviation. Similarly to previous sensitivity tests, we study here the sensitivity of the data assimilation results to the value of σb through a one-assimilation-cycle experiment. Figure 6c shows, in logarithmic scale on the x axis, the relative error of the updated Manning coefficient with respect to the true coefficient. The analysis error curve shows a decreasing behavior until σb is on the order of 0.4. It then increases again.

Note that the actual Manning coefficient error before data assimilation is equal to 0.33 (see the blue curve in Fig. 6c showing the openloop Manning coefficient error). Consistently, the best data assimilation results are obtained when σb provides a good approximation of the real error standard deviation. Note also that when σb becomes too small, data assimilation is less effective. The EnKF algorithm is known to be under-dispersive. Therefore, for future data assimilation experiments, when updating the error covariance matrix from one cycle to another, we will need to make sure that the ensemble dispersion is high enough to cover possible model behavior over the forecast time window by imposing a minimum value for the error variance. Given the sensitivity test results, the minimum value for σb is set to 0.005. Thus, in the following data assimilation experiments, we use the analysis error covariance matrix as the background error covariance matrix for the next assimilation cycle while applying the minimum threshold value to the matrix diagonal terms.

6 Data assimilation results

We now present the results from the data assimilation experiments described in Table 2 and in Sect. 4.2. Recall that these experiments aim at correcting a set of nine multiplying factors applied to the Manning coefficient distribution and constant over nine hydro-geomorphological zones dividing the Amazon basin.

## 6.1 Assimilation of water depths (PE1)

Figure 7 gives, for each zone, the time evolution of the mean analysis control variable (red) with its dispersion (even though it is very narrow) compared to the truth (black) and the first guess (blue). Similarly, Fig. 8 shows, for each zone, the time evolution of the analysis water depth (red) compared to the truth (black) and the openloop (blue). To generate one plot per zone, we use, for each time step, the ensemble of water depths over all cells in the zone and estimate the median value, the first decile and the ninth decile. Furthermore, zone-averaged normalized RMSEn statistics are given in Tables D1 and D2 in Appendix D.

Figure 7Control variable assimilation results for the PE1 experiment: evolution of the ensemble-averaged analysis control variable (red line) for each zone (one zone per subplot) with respect to the assimilation cycle and compared to the corresponding true value (black line) and the openloop value (blue line).

Figure 8Water depth assimilation results for the PE1 experiment: daily evolution of the ensemble-averaged analysis water depth (red lines) compared to the true water depths (black lines) and the openloop water depths (blue line). For each zone (one per subplot), the median (full line), the first decile (dotted line) and the ninth decile (dashed line) of water depth ensembles over all grid cells in the zone are represented.

In general, the PE1 experiment gives very good results as the analysis mean for each zone retrieves the true value with a very low dispersion. However, the data assimilation algorithm features spatially dependent behavior; see Fig. 7.

• Firstly, the control variable for zones 1, 2, 3, 4, 5 and 9 converges instantaneously (in only one assimilation cycle) toward the true values and remains at these true values for all following cycles.

• A similar behavior can be observed for zones 6, 7 and 8 from the first cycle to around the ninth cycle. For the remaining cycles, we notice an increase in the mean analysis estimate, along with the ensemble dispersion, until around the thirteenth cycle, and afterwards a decrease back to the true value.

These observations can be explained with the global sensitivity analysis results for water depths in and using Fig. 8.

• Firstly, zones 1, 2 and 3 correspond to the river main stem, whilst zones 4, 5 and 9 correspond to the main left-bank tributaries, namely the Caquetá/Japurá River (zone 4) and the Negro River (zone 5). In these zones, as the Manning coefficient is directly corrected in the first assimilation cycle, the analysis water depths (red) overlap the true water depth (black). Furthermore, Fig. 8 for these zones shows that the openloop (blue) and true (black) water depths have a very similar variability in time but differ by a constant bias. The global sensitivity analysis results in these zones showed a constant first-order sensitivity in time to the Manning coefficient all year long. This first-order sensitivity means that the contribution of the Manning coefficient to the water depth is linear. Correcting the Manning coefficient in these zones equates therefore to correcting the bias between the openloop and the true water depths.

• Subsequently, zones 6, 7 and 8 correspond to right-bank tributaries, namely the Juruá and Purus rivers (zone 6), the Madeira River (zone 7) and the Tapajós and Xingu rivers (zone 8). These right-bank tributary zones are characterized by a strong seasonal cycle (see Fig. 8, zones 6–8). By comparing the corresponding plots in Figs. 7 and 8, we notice that the period when the analysis control variable spreads from the truth corresponds to the low-flow season in these zones. According to the global sensitivity analysis results, water depths in these zones are less sensitive to the Manning coefficient in low-flow conditions. Additionally, there is very little water in the zones during this period and, consequently, the background control ensemble is not spread out enough for the EnKF to be efficient. Meanwhile, the EnKF still sees that the observations are higher than the model predictions (as seen with the positive innovations in these zones shown in Fig. E1). In order to increase the simulated water depth, the EnKF therefore corrects the Manning coefficient so that its value rises (a higher Manning coefficient means a slower flow velocity and then a higher simulated water depth). Finally, once the low-flow season ends, the analysis Manning coefficient converges back to the truth (see the last assimilation cycles).

## 6.2 Assimilation of water anomalies (PE2 and PE3)

The assimilation of anomalies has been tested over two experiments denoted PE2 and PE3 (see Table 2). Note that the observation error standard deviation σo remains equal to 10 cm as, with these experiments, we only aim at testing the feasibility of assimilating water depth anomalies. In the PE2 experiment, there is no difference of bathymetry between the simulated and observed water anomalies, whilst the river bankful depth is different in the PE3 experiment. Figure 9 gives, for each zone, the time evolution of the mean analysis control variable for PE2 (orange) and PE3 (purple) with their dispersion compared to the truth (black) and the first guess (blue). Again, zone-averaged normalized RMSEn statistics for these experiments are given in Tables D1 and D2 in Appendix D.

Figure 9Control variable assimilation results for the PE2 and PE3 experiments: evolution of the ensemble-averaged analysis control variable for the PE2 experiment (orange line) and the PE3 experiment (purple line) for each zone (one zone per subplot) with respect to the assimilation cycle and compared to the corresponding true value (black line) and the openloop value (blue line).

The general configuration of experiments PE1 and PE2 is the same. The only difference between the two experiments is the nature of the observations: water depths for PE1 and water anomalies for PE2. Like experiment PE1, experiment PE2 (the orange line in Fig. 9) gives very good results. All control variables converge toward the true values more or less rapidly. The control variable for zones 4 to 8 instantaneously (in only one assimilation cycle) converges toward the true value, while the convergence is slower for the remaining zones as around five assimilation cycles are needed to retrieve the true value. This slower convergence for these zones can be explained by the fact that the magnitude of the observed water anomalies is generally smaller compared to the water depths assimilated in PE1. The ratio between the observation error and the observations themselves is also then smaller, resulting in a smaller EnKF gain. The control variable correcting increment is smaller for the anomalies therefore than for the water depth, and more cycles are needed to converge.

As for the PE3 experiment results – the purple line in Fig. 9 – the assimilation still gives good results, but not as good as in previous experiments. The control variables still instantaneously converge toward the truth in zones 4, 5 and 6. Concerning the other zones, there is no clear convergence towards the true value. Instead, the analysis control variables either get closer to but remain distinct from the true value (zones 2 and 3) or temporarily deviate from the truth during the experiment (zones 1, 7, 8 and 9). Still, despite the control variables not clearly converging towards the truth, the simulated water depths using the analysis control variables, presented in Fig. 10, display a very low deviation from the truth, confirming the general good performance of the data assimilation procedure.

Figure 10Water depth assimilation results for the PE3 experiment: daily evolution of the ensemble-averaged analysis water depth (red lines) compared to the true water depths (black lines) and the openloop water depths (blue line). For each zone (one per subplot), the median (full line), the first decile (dotted line) and the ninth decile (dashed line) of water depth ensembles over all grid cells in the zone are represented.

Comparing the control variables and water depth time variations, it appears that the control variables are deviating from the truth mainly when water depths are decreasing, in between the high-flow and low-flow seasons. During this period, the model goes from a state where floods occur to a state where there is no flood, particularly in zones 2–3 and 7–8 with a clear seasonal cycle. On the other hand, no flood event was spotted in zones 4, 5 and 6, where the best results were obtained. In ISBA-CTRIP, the activation/deactivation of the flooding scheme is triggered by the simulated water depth exceeding/becoming lower than the river bankful depth. However, in experiment PE3, this river bankful depth differs between the model and the observation because we artificially inserted a bias between the simulated and observed water depths. More specifically, the river bankful depth is lower in the model than in the observations. Therefore, the control variables deviating from the true value when the water depth is decreasing are indicative of the simulated water depths presenting floods when there is no flood in the observations. The activation of the flood scheme changes the dynamics of the water depth in the river. As part of the water in the river is spilled into the floodplains, water-level variations in the river are slower. The flooded model needs then a stronger variation of the Manning coefficient in order to catch the non-flooded observed water level. Ultimately, the stronger variations of the estimated Manning coefficient allow the retrieval of the true water depths.

7 Discussions

The results presented here are preliminary investigations into the assimilation of a SWOT water surface elevations product into a large-scale hydrological model. This study focused on the correction of a critical river parameter, here the river Manning coefficient.

For all the simulations, the Manning coefficient distribution is set to be constant in time. For each grid cell, one value of the Manning coefficient is used for the entire simulation. However, in reality, it is commonly accepted that this parameter could vary in time, depending on the seasonal cycle or also some extreme hydrological event such as large flooding events, which can even modify the bathymetry itself. The results showed that, for this OSSE, the data assimilation is able to converge quite quickly towards the true value. For example, for the left-bank tributary zones, namely zones 4 and 5, in every experiment, the associated control variable converges toward the true value in only one assimilation cycle. In a real-case experiment, we could expect to retrieve the temporal variations of the Manning coefficient from one assimilation cycle to another. The good performances of the assimilation platform are mainly related to the fact that, in the ISBA-CTRIP model, the water depth diagnostic variables are sensitive to the Manning coefficient . Simulated water depths are not then that sensitive to the Manning coefficient (e.g., in the right-bank tributary zones during the low-flow season), and the data assimilation performances slightly degrade. These results are specific to the ISBA-CTRIP model. To apply the same method to another model and even another region, one needs to first study the sensitivity of the (other) model to the (other) study region.

Secondly, the study investigates the potential of assimilating water surface anomalies instead of direct water surface elevations. The use of water surface anomalies is driven by the need to avoid potential bias between the control and the observed variables. Indeed, a bias will likely be introduced from a discrepancy between the elevation of the river bed in the model and in the observations with respect to a reference surface such as a geoid or an ellipsoid (see Fig. 4). Under the assumption that the water variations are the same between the model and the observations, the use of anomalies as observed variables should prevent this bias from affecting the results.

Another likely bathymetry error corresponds to errors in the river bankful depth, the river width and, more generally, representativeness errors due to the use of a simplified bathymetry. This type of error was artificially introduced by perturbing the model bankful depth in PE3. Specifically to ISBA-CTRIP, the river bankful depth controls when the model floods, which has a direct impact on the water depth dynamics. Background anomalies and observed anomalies may therefore present different dynamics where either the observed variables flood while the model variables do not or inversely. Experiment PE3 illustrated the effect of this bias on the variations of Manning coefficients. Instead of being maintained at their true value, their value slightly varied around the true values to account for the difference in dynamics between the model and the observations. However, one could expect even more variations in the updated control variables around the true value to increase if more and different errors in the bathymetry exist (which will likely happen with more realistic experiments).

Furthermore, real-case experiments may suffer from another type of bias originating from errors in the atmospheric forcing and in the surface and sub-surface runoff provided by the LSM (i.e., ISBA). Both control the amount of water entering the river system. A basic idea to attenuate this issue would be to consider their uncertainties when generating the background ensemble. This approach may become limited when the errors in the forcing are very large. It may also lead to unrealistic, even non-physical, updated Manning coefficient values. Besides, when correcting the model's parameters, we only re-distribute the water volume within the basin, whilst such types of errors could actually require addition/withdrawal of water to/from the system. The potential solution would then be to include such forcing or LSM variables in the control vector or to update variables closer to the observations, including CTRIP's state variables such as the water storage. This would change the current framework to a dual state-parameter estimation approach.

Noting this, there may be an additional advantage in assimilating water anomalies instead of the direct water depths. Comparing the Kalman gain between the PE1 experiment (which assimilated direct water depths) and the PE2 and PE3 experiments (which assimilated water anomalies), the gain magnitude for the water anomalies is lower than the water depth gain magnitude. This is to be expected as the Kalman gain is stochastically estimated from an ensemble of model runs and the magnitude of the simulated water anomalies is lower than the simulated water depth magnitude. The consequence of this lower gain is that the correction applied to the control variable is also lower. If the convergence towards the true value takes more than one assimilation cycle, the divergence from it in the presence of bias is also diminished.

Beyond the bias issues, real-data assimilation configuration will raise the question of the unknown true parameter value, if it exists. Firstly, there will be no true estimates of the control variables with which to compare the assimilated simulations. The assimilation will be evaluated against the observed variables directly. Then, with the real data, model structure error will be introduced. To our knowledge, the model structure error is still a challenging error to estimate, and most data assimilation studies assume no model structure error. However, when using an ensemble-based model, a possibility for dealing with such structure error is to enrich the background ensemble by considering more uncertainties from variables that are not necessarily in the control vector (including errors in the forcing or parameters from both the LSM and the RRM). The capacity of such ensembles to tackle model structure errors can be tested using synthetic observations based on a different hydrological model.

Additionally, the real SWOT data will have a finer resolution than the synthetic SWOT data currently used. Still, the coarser-resolution observations are found to provide information to constrain the model and to improve the value of the spatially varying Manning coefficients. Then, when moving to real-data assimilation experiments, we can consider averaging the fine-scale SWOT product over a coarse grid cell corresponding to an ISBA-CTRIP cell so that the resolutions of the observations and the model match.

Ultimately, heading towards more realistic experiments also implies more realistic representations of the observation errors. But more complex errors should be expected for the real SWOT product. Some correlated errors along the swath should be expected due to the instrument but also due to the motion of the satellite and delays due to propagation of the electromagnetic waves in the ionosphere and atmosphere. Nevertheless, as part of the mission science requirements, the sum of all errors should not exceed 10 cm when the measured data are averaged over 1 km2. There should also be additional errors affecting the observations that can be described as “detectability errors”, such as “dark water”, “layover” and “false positive”. “Dark water” pixels will result in missing data and will not be included in the assimilation, and “layover” pixels will have a higher vertical error due to surrounding vegetation and topography, but should also be flagged . Eventually, “false positive” pixels (i.e., pixels classified as water, whereas they correspond to land) will be the most complicated to anticipate. With these additional errors taken into account in the assimilation framework, one could expect a slower convergence of the control variables. Note that these aspects of the measurement errors are related to water surface elevation products.

8 Conclusions

This study presents a series of OSSEs that assimilates SWOT-like synthetic observations of water depths and anomalies into the ISBA-CTRIP large-scale hydrological model in order to correct the spatially distributed Manning coefficient. The study is applied over the Amazon River basin. Prior to the actual data assimilation experiments, a series of sensitivity tests was conducted to study the sensitivity of the data assimilation performance to the different features of the EnKF, in particular the size of the ensemble. Then, three full-year data assimilation experiments were run based on the outcomes of the sensitivity tests. For all three experiments, the assimilation was able to track back the true value of the Manning coefficient distribution.

The sensitivity tests successively studied the sensitivity of the data assimilation platform to model the spinup period, the experiment starting date through the hydrological year, the size of the ensemble for the EnKF and the initial control variable standard deviation. These tests showed first of all that a spinup of four windows of 21 d is sufficient for the transitional period due to a sudden change in the Manning coefficient distribution in the model. The second sensitivity test then demonstrated that the data assimilation performance is not clearly sensitive to the period of the hydrological year when the experiment is done. The next sensitivity test informed us that an ensemble of 25 members was enough to obtain good EnKF performances. Finally, the last sensitivity test studied the effect of the control variable error standard deviation, and the best performances were obtained for a prior standard deviation between 0.05 and 0.75, which corresponds to the order of magnitude of the actual error between the true and openloop control variables.

Using these results, we run three data assimilation experiments over approximately 1 year (the year 2008). The first experiment (PE1) assimilated direct pseudo-observations of water depths. Results showed the capability of the data assimilation algorithm to converge very quickly toward the true value, generally in only one assimilation cycle. Still, during the low-flow season, the assimilation was less effective in the zones with a clear seasonal cycle. This was explained by the fact that during this period, water depths are less sensitive to the Manning coefficient.

The other two experiments (PE2 and PE3) introduced and tested the assimilation of water surface anomalies. The anomalies were obtained by subtracting a yearly-averaged water depth from the current water depth in both the model and the observations. The first water anomaly assimilation experiment (PE2) provided very good results, with all the control variables also converging towards their associated true values. However, the convergence was slightly slower than during the assimilation of the water depth (between one and five assimilation cycles). This is explained by a lower Kalman gain when updating the Manning coefficient.

The last experiment also assimilated water anomalies (PE3). For this particular experiment however a bias was artificially introduced in the river bathymetry. For this experiment, the assimilation was still able to get closer to the true value, but, for some zones like the mainstream zones, there was no convergence as the control variables kept varying around the true value. This phenomenon was explained by the detection of floods in the model but not in the observations. Still, the statistics of the Manning coefficient distribution and the simulated water depths after assimilation remain as improved compared to the openloop simulations. Ultimately, these two experiments demonstrated the feasibility of assimilating water surface anomalies to correct the Manning coefficient.

These experiments offer several perspectives. They mainly consist of approaching more realistic data assimilation experiments which take into account more sources of uncertainties between the model and the observations, such as correlated observation errors or uncertainties in the forcing and the LSM surface and sub-surface runoff. To test the platform's limitations regarding the DEM/bathymetry bias issue, one can use simulated water surface elevations referenced to a geoid instead of water depths from the model or even assimilate water depths from another model where the bathymetry differs. As most applications generally require a good estimate of the river flow and river water volume, another lead of an investigation could maintain the SWOT-based OSSE framework but correct the simulated water storage and/or discharge, either as a single state estimation framework or as a dual state parameter estimation framework (similarly to dual discharge bathymetry inference methods developed by , and , for some hydraulic models). Moreover, along with observations of water surface elevations, SWOT will also provide two-dimensional maps of river widths and surface slopes. One can also study the possibility of assimilating such products to constrain other parameters such as the bankful depth that controls the model flooding scheme.

Appendix A: Definition of error covariance matrices

The background error cross-covariance matrices [PHT]e,k and [HPHT]e,k are defined based on , , and so that

$\begin{array}{ll}\left[{\mathbf{PH}}^{T}{\right]}_{e,k}=& \phantom{\rule{0.25em}{0ex}}\left({n}_{\mathrm{e}}-\mathrm{1}{\right)}^{-\mathrm{1}}\left({\mathbf{X}}_{e,k}^{\mathrm{b}}-\stackrel{\mathrm{‾}}{{\mathbf{X}}_{•,k}^{\mathrm{b}}}{.1}_{{n}_{\mathrm{e}}}^{T}\right)\\ \text{(A1)}& & \cdot {\left(\mathcal{H}\left({\mathbf{X}}_{e,k}^{\mathrm{b}}\right)-\stackrel{\mathrm{‾}}{\mathcal{H}\left({\mathbf{X}}_{•,k}^{\mathrm{b}}\right)}\cdot {\mathbf{1}}_{{n}_{\mathrm{e}}}^{T}\right)}^{T}\end{array}$

and

$\begin{array}{ll}\left[{\mathbf{HPH}}^{T}{\right]}_{e,k}=& \phantom{\rule{0.25em}{0ex}}\left({n}_{\mathrm{e}}-\mathrm{1}{\right)}^{-\mathrm{1}}\left(\mathcal{H}\left({\mathbf{X}}_{e,k}^{\mathrm{b}}\right)-\stackrel{\mathrm{‾}}{\mathcal{H}\left({\mathbf{X}}_{•,k}^{\mathrm{b}}\right)}{.1}_{{n}_{\mathrm{e}}}^{T}\right)\\ \text{(A2)}& & \cdot {\left(\mathcal{H}\left({\mathbf{X}}_{e,k}^{\mathrm{b}}\right)-\stackrel{\mathrm{‾}}{\mathcal{H}\left({\mathbf{X}}_{•,k}^{\mathrm{b}}\right)}\cdot {\mathbf{1}}_{{n}_{\mathrm{e}}}^{T}\right)}^{T}.\end{array}$

In those definitions, ${\mathbf{X}}_{e,k}^{\mathrm{b}}$ is the control matrix storing the ne control vectors ${\mathbit{x}}_{k}^{b,\left[l\right]},\phantom{\rule{0.33em}{0ex}}l=\mathrm{1}\phantom{\rule{0.25em}{0ex}}\mathrm{\dots }\phantom{\rule{0.25em}{0ex}}{n}_{\mathrm{e}}$ from the background ensemble such that

${\mathbf{X}}_{e,k}^{\mathrm{b}}=\left[{\mathbit{x}}_{k}^{b,\left[\mathrm{1}\right]}\phantom{\rule{0.25em}{0ex}}\mathrm{\dots }\phantom{\rule{0.25em}{0ex}}{\mathbit{x}}_{k}^{b,\left[{N}_{e}\right]}\right].$

Next, $\mathcal{H}\left({\mathbf{X}}_{e,k}^{\mathrm{b}}\right)$ represents the same control matrix but mapped into the observation space:

$\mathcal{H}\left({\mathbf{X}}_{e,k}^{\mathrm{b}}\right)=\left[\mathcal{H}\left({\mathbit{x}}_{k}^{b,\left[\mathrm{1}\right]}\right)\phantom{\rule{0.25em}{0ex}}\mathrm{\dots }\phantom{\rule{0.25em}{0ex}}\mathcal{H}\left({\mathbit{x}}_{k}^{b,\left[{n}_{\mathrm{e}}\right]}\right)\right].$

Also, $\stackrel{\mathrm{‾}}{{\mathbf{X}}_{•,k}^{\mathrm{b}}}$ and $\stackrel{\mathrm{‾}}{\mathcal{H}\left({\mathbf{X}}_{•,k}^{\mathrm{b}}\right)}$ are the corresponding ensemble expectations such that

$\stackrel{\mathrm{‾}}{{\mathbf{X}}_{•,k}^{\mathrm{b}}}=\frac{\mathrm{1}}{{n}_{\mathrm{e}}}\sum _{l=\mathrm{1}}^{{n}_{\mathrm{e}}}{\mathbit{x}}_{k}^{b,\left[l\right]}\phantom{\rule{1em}{0ex}}\stackrel{\mathrm{‾}}{\mathcal{H}\left({\mathbf{X}}_{•,k}^{\mathrm{b}}\right)}=\frac{\mathrm{1}}{{n}_{\mathrm{e}}}\sum _{l=\mathrm{1}}^{{n}_{\mathrm{e}}}\mathcal{H}\left({\mathbit{x}}_{k}^{b,\left[l\right]}\right).$

These vector dimensions are nx and ny,k, respectively. Finally, ${\mathbf{1}}_{{n}_{\mathrm{e}}}$ is a vector of size ne containing only 1s.

Appendix B: Spinup sensitivity test additional tables

Table B1 summarizes, for each run, the date when the Manning coefficients are changed. The spinup period (expressed as a number of windows of 21 d) corresponds to the period between when the Manning distribution is changed and the start of the second year, i.e., 1 January 2008.

Table B1Spinup sensitivity test setup: each run consists of an approximately 2-year long ISBA-CTRIP run starting on 16 December 2006 (column 2) and ending on 22 December 2008. After a given number of 21 d windows during the first year (column 3), the Manning distribution is changed to replicate an assimilation update step, while the reference run (row 2) used the same Manning for the entire run. The period between the instant when the Manning distribution is changed and the beginning of the second year of simulation corresponds to the spinup period (column 4). The simulated water depth and discharge during the second year of the run are then compared to the reference run in order to evaluate the impact of the spinup.

Appendix C: Sensitivity test results per zone

Figure C1 displays the sensitivity test results (as in Fig. 6) but for each zone separately.

Figure C1Top: relative error (to the truth) and (bottom) dispersion of the analysis control ensemble for each zone for the sensitivity tests to (a, b) the data assimilation starting date, (c, d) the ensemble size ne, and (e, f) the background error standard deviation σb. For each test, a set of one-cycle long data assimilation experiments is run. Top only: the relative errors in zone 1 (dark blue line), zone 2 (orange line), zone 3 (yellow line), zone 4 (purple line), zone 5 (green line), zone 6 (light blue line), zone 7 (burgundy red line), zone 8 (pink line) and zone 9 (gray line) are compared to the basin-averaged openloop relative error (black line).

Appendix D: Assimilation performances at the zone scales

At each grid cell of the study domain, we estimated the normalized root mean square error (RMSEn) before and after assimilation by comparing the openloop and mean analysis simulations, respectively, to the true simulation, for both the simulated water depth and discharge:

$\begin{array}{}\text{(D1)}& {\text{RMSEn}}_{i}=\frac{\sqrt{\frac{\mathrm{1}}{N}{\sum }_{n=\mathrm{1}}^{N}\left({V}_{n,i}^{\ast }-{V}_{n,i}^{t}{\right)}^{\mathrm{2}}}}{\stackrel{\mathrm{‾}}{{V}_{.,i}^{t}}},\end{array}$

where the state variable V is either the discharge or the water depth, n is the time index, i is the grid-cell index, the t superscript represents the “truth” and the superscript represents either the openloop or analysis ensemble average.

Tables D1 and D2 give these statistics for all experiments averaged over each control zone. Table D1 shows the water depth zone-averaged RMSEn and Table D2 shows the discharge zone-averaged RMSEn.

Table D1Zone-averaged RMSEn for the openloop water depths (row 2) and the ensemble-averaged analysis water depths (rows 3–5) compared to the true water depths.

Table D2Zone-averaged RMSEn for the openloop discharges (row 2) and the ensemble-averaged analysis discharges (rows 3–5) compared to the true discharges.

Appendix E: Assimilation results: additional figures

Figure E1 displays the evolution along the assimilation cycles of the averaged innovations. The sign of the innovation will drive the direction of the correction brought by the assimilation.

• A positive innovation means that the observations are higher than the model. Physically, the simulated flow is too fast and the water leaves the river reservoir too quickly. This means that the river Manning coefficient needs to be increased to slow the flow.

• A negative innovation means that the observations are lower than the model. Physically, the simulated flow is too slow and the water remains in the river reservoir. This means that the river Manning coefficient needs to be increased to accelerate the flow.

Figure E1Evolution of the EnKF innovations (“$\left({\mathbit{y}}_{k}^{\mathrm{o},\left[l\right]}-{\mathcal{H}}_{k}\left({\mathbit{x}}_{k}^{b,\left[l\right]}\right)\right)$” term in Eq. 13) with respect to the assimilation cycle for PE1 (red line), PE2 (orange line) and PE3 (purple line). For each zone (one zone per subplot), the displayed innovation is the average of all the innovations in the corresponding zones.

Code and data availability
Code and data availability.

The CTRIP code is open source and is available as a part of the surface modeling platform called SURFEX, which can be downloaded at http://www.cnrm-game-meteo.fr/surfex/ . SURFEX is updated approximately every 3 to 6 months and the CTRIP version presented in this paper is from SURFEX version 7.3. If more frequent updates are needed, please follow the procedure informing you of how to obtain an SVN or Git account in order to access real-time modifications of the code (see the instructions in the previous link). The ISBA-CTRIP model is coupled to the DA codes via the OpenPalm coupler available at http://www.cerfacs.fr/globc/PALM_WEB/ . To get the DA routines coupled to ISBA-CTRIP with OpenPalm, please directly contact Charlotte Marie Emery (charlotte.emery@jpl.nasa.gov) or Sylvain Biancamaria (sylvain.biancamaria@legos.obs-mip.fr). To obtain the GSWP3 forcings, please refer to the following DOI: https://doi.org/10.20783/DIAS.501 (Kim2017).

Author contributions
Author contributions.

CME designed and carried out the experiments under the supervision of SB and AB, as part of her PhD project. SB provided the SWOT-based observation mask used to generate the synthetic observations and AB provided support for the use of the ISBA-CTRIP hydrologic model. CME prepared the manuscript with contributions from all the co-authors.

Competing interests
Competing interests.

The authors declare that they have no conflict of interest.

Acknowledgements
Acknowledgements.

This work was supported by the CNES through a grant from the Terre-Océan-Surfaces Continentales-Atmosphère (TOSCA) committee assigned to the project entitled “Towards an improved understanding of the global hydrological cycle using SWOT measurements”. Charlotte Marie Emery received doctoral research support from a CNES/région Midi-Pyrénées grant. Charlotte Marie Emery and Cédric H. David received support from the Jet Propulsion Laboratory, California Institute of Technology, under a contract with NASA, including grants from the SWOT Science Team and the Terrestrial Hydrology Program.

Financial support
Financial support.

This research has been supported by the Région Midi Pyrénées, the CNES, and CNES TOSCA, a grant from the SWOT Science Team: NASA ROSES 2015 SWOT Science Team (NNH15ZDA001N-SWOT): Integration of SWOT Measurements into global terrestrial hydrologic models (15-SWOT15-0014), and a grant from the NASA ROSES Terrestrial Hydrology Program: NASA ROSES 2017 Terrestrial Hydrology (NNH17ZDA001N-THP): Filling the Space/Time Gaps Between Surface Water Retrievals (17-THP17-0012).

Review statement
Review statement.

This paper was edited by Markus Hrachowitz and reviewed by Paul Bates, Claire I. Michailovsky, and Hessel Winsemius.

References

Andreadis, K. M. and Schumann, G. J. P.: Estimating the impact of satellite observations on the predictability of large-scale hydraulic models, Adv. Water Res., 73, 44–54, https://doi.org/10.1016/j.advwatres.2014.06.006, 2014. a

Andreadis, K. M., Clark, E. A., Lettenmaier, D. P., and Alsdorf, D. E.: Prospects for river discharge and depth estimation through assimilation of swath-altimetry into a raster-based hydrodynamics model, Geophys. Res. Lett., 34, L10403, https://doi.org/10.1029/2007GL029721, 2007. a

Beighley, R. E., Eggert, K. G., Dunne, T., He, Y., Gummadi, V., and Verdin, K. L.: Simulating hydrologic and hydraulic processed throughout the Amazon basin, Hydrol. Process., 23, 1221–1235, https://doi.org/10.1002/hyp.7252, 2009. a

Beven, K. and Freer, J.: Equifinality, data assimilation, and uncertainty estimation in mechanistic modelling of complex environmental systems using GLUE methodology, J. Hydrol., 249, 11–29, https://doi.org/10.1016/S0022-1694(01)00421-8, 2001. a

Beven, K. J.: Down to basics: runoff processes and the modelling of processes, in: Rainfall-Runoff Modelling, John Wiley and Sons, West Sussex, UK, chap. 1, 1–22, 2012. a

Biancamaria, S., Bates, P., Boone, A., and Mognard, N.: Large-scale coupled hydrologic and hydraulic modelling of teh Ob river in Siberia, J. Hydrol., 379, 136–150, https://doi.org/10.1016/j.jhydrol.2009.09.054, 2009. a

Biancamaria, S., Durant, M., Andreadis, K. M., Bates, P. D., Boone, A., Mognard, N. M., Rodriguez, E., Alsdorf, D. E., Lettenmaier, D. P., and Clark, E. A.: Assimilation of virtual wide swath altimetry to improve Arctic river modeling, Remote Sens. Environ., 115, 373–381, https://doi.org/10.1016/j.rse.2010.09.008, 2011. a, b

Biancamaria, S., Lettenmaier, D. P., and Pavelsky, T. M.: The SWOT mission and its capabilities for land hydrology, Surv. Geophys., 37, 307–337, https://doi.org/10.1007/s10712-015-9346-y, 2016. a

Bierkens, M. F. P.: Global hydrology 2015: State, trends, and directions, Water Resour. Res., 51, 4923–4947, https://doi.org/10.1002/2015WR017173, 2015. a

Birkett, C. M., Mertes, L. A. K., Dunne, T., Costa, M. H., and Jasinski, M. J.: Surface water dynamics in the Amazon basin: Application of satellite radar altimetry, J. Geophys. Res., 107, L10403, https://doi.org/10.1029/2001JD000609, 2002. a

Bishop, C. H., Etherton, B. J., and Majumbar, S. J.: Adaptative sampling with the ensemble transform Kalman filter. Part I: Theoretical aspects, Mon. Weather Rev., 129, 420–436, https://doi.org/10.1175/1520-0493(2001)129<0420:ASWTET>2.0.CO;2, 2001. a

Boone, A., Calvet, J.-C., and Noilhan, J.: Inclusion of a Third Soil Layer in a Land Surface Scheme Using the Force-Restore Method, J. Hydrometeorol., 38, 1611–1630, https://doi.org/10.1175/1520-0450(1999)038<1611:IOATSL>2.0.CO;2, 1999. a

Brisset, P., Monnier, J., Garambois, P.-A., and Roux, H.: On the assimilation of altimetry data in 1D Saint-Venant river models, Adv. Water Res., 119, 41–59, https://doi.org/10.1016/J.advwatres.2018.06.004, 2018. a

Buis, S., Piacentini, A., and Declat, D.: PALM: a computational framework for assembling high-performance computing applications, Concurrency Computat.: Pract. Exper., 18, 247–262, 2006 (data available at: http://www.cerfacs.fr/globc/PALM_WEB/, last access: 20 April 2020). a

Burgers, G., Leeuwen, P. J. V., and Evensen, G.: Analysis Scheme in the Ensemble Kalman Filter, Mon. Weather Rev., 126, 1719–1724, https://doi.org/10.1175/1520-0493(1998)126<1719:ASITEK>2.0.CO;2, 1998. a

Clark, M. P., Rupp, D. E., Woods, R. A., Zheng, X., Ibbitt, R. P., Slater, A. G., Schmidt, J., and Uddstrom, M. J.: Hydrological data assimilation with the ensemble Kalman filter: Use of streamflow observations to update states in a distributed hydrological model, Adv. Water Res., 31, 1309–1324, https://doi.org/10.1016/j.advwatres.2008.06.005, 2008. a

Cretaux, J.-F., Calmant, S., Romanoski, V., Shabunin, A., Lyard, F., Berge-Nguyen, M., Cazenave, A., Hernandez, F., and Perosanz, F.: An absolute calibration site for radar altimeters in the continental domain: Lake Issykkul in Central Asia, J. Geodesy, 83, 723–735, https://doi.org/10.1007/s00190-008-0289-7, 2009. a

Decharme, B., Alkama, R., Douville, H., Becker, M., and Cazenave, A.: Global Evaluation of the ISBA-TRIP Continental Hydrological System. Part II: Uncertainties in River Routing Simulation Related to Flow Velocity and Groundwater Storage, J. Hydrometeorol., 11, 601–617, https://doi.org/10.1175/2010JHM1212.1, 2010. a

Decharme, B., Alkama, R., Papa, F., Faroux, S., Douville, H., and Prigent, C.: Global off-line evaluation of the ISBA-TRIP flood model, Clim. Dynam., 38, 1389–1412, https://doi.org/10.1007/s00382-011-1054-9, 2012 (data available at: http://www.cnrm-game-meteo.fr/surfex/, last access: 20 April 2020). a, b, c, d, e, f

Decharme, B., Delire, C., Minvielle, M., Colin, J., Vergnes, J.‐P., Alias, A., Saint‐Martin, D., Séférian, R., Sénési, S., and Voldoire, A.: Recent changes in the ISBA-CTRIP land surface system for use in CNRM-CM6 climate model and global off-line hydrological applications, J. Adv. Model. Earth Sy., 11, 1207–1252, https://doi.org/10.1029/2018MS001545, 2019. a, b, c

Deng, C., Liu, P., Guo, S., Li, Z., and Wang, D.: Identification of hydrological model parameter variation using ensemble Kalman filter, Hydrol. Earth Syst. Sci., 20, 4949–4961, https://doi.org/10.5194/hess-20-4949-2016, 2016. a

Doll, P., Douville, H., Güntner, A., Schmied, H. M., and Wada, Y.: Modelling Freshwater Resources at the Global Scale: Challenges and Propects, Surv. Geophys., 37, 195–221, https://doi.org/10.1007/s10712-015-9343-1, 2015. a

Durand, M., Andreadis, K., Alsdorf, D., Lettenmaier, D., Moller, D., and Wilson, M.: Estimation of bathymetric depth and slope from data assimilation of swath altimetry into a hydrodynamic model, Geophys. Res. Lett., 35, L20401, https://doi.org/10.1029/2008GL034150, 2008. a, b, c

Emery, C. M., Biancamaria, S., Boone, A., Garambois, P.-A., Ricci, S., Rochoux, M. C., and Decharme, B.: Temporal variance-based sensitivity analysis of the river routing component of the large scale hydrological model ISBA-TRIP: Application on the Amazon Basin, J. Hydrometeorol., 17, 3007–3027, https://doi.org/10.1175/JHM-D-16-0050.1, 2016. a, b, c, d, e, f, g, h, i, j, k

Emery, C. M., Paris, A., Biancamaria, S., Boone, A., Calmant, S., Garambois, P.-A., and Santos da Silva, J.: Large-scale hydrological model river storage and discharge correction using a satellite altimetry-based discharge product, Hydrol. Earth Syst. Sci., 22, 2135–2162, https://doi.org/10.5194/hess-22-2135-2018, 2018. a, b, c, d

Esteban Fernandez, D.: SWOT Project, Mission performance and error budget, Tech. rep., Jet Propulsion Laboratory, 2017. a

Evensen, G.: Sequential data assimilation with a nonlinear quasi-geostropic model using Monte Carlo methods to forecast error statistics, J. Geophys. Res., 99, 10143–10162, https://doi.org/10.1029/94JC00572, 1994. a

Evensen, G.: Advanced data assimilation for strongly nonlinear dynamics, Mon. Weather Rev., 125, 1342–1354, https://doi.org/10.1175/1520-0493(1997)125<1342:ADAFSN>2.0.CO;2, 1997. a

Evensen, G.: The Ensemble Kalman Filter: theoretical formulation and practical implementation, Ocean Dynam., 53, 343–367, https://doi.org/10.1007/s10236-003-0036-9, 2003. a

Evensen, G.: Sampling strategies and square root analysis schemes for the EnKF, Ocean Dynam., 54, 539–560, https://doi.org/10.1007/s10236-004-0099-2, 2004. a, b

Evensen, G. and Leeuwen, P. V.: An ensemble kalman smoother for nonlinear dynamics, Mon. Weather Rev., 128, 1852–1867, https://doi.org/10.1175/1520-0493(2000)128<1852:AEKSFN>2.0.CO;2, 2000. a, b

Fjørtoft, R., Gaudin, J.-M., Pourthie, N., Lalaurie, J.-C., Mallet, A., Nouvel, J.-F., Martinot-Lagarde, J., Oriot, H., Borderies, P., Ruiz, C., and Daniel, S.: KaRIn on SWOT: Characteristics of near-nadir Ka-band interferometric SAR imagery, IEEE T. Geosci. Remote, 52, 2172–2185, https://doi.org/10.1109/TGRS.2013.2258402, 2014. a

Guillet, O., Weaver, A., Vasseur, X., Michel, M., Gratton, S., and Gürol, S.: Modelling spatially correlated observation errors in variational data assimilation using a diffusion operator on an unstructured mesh, Q. J. Roy. Meteor. Soc., 145, 1947–1967, https://doi.org/10.1002/qj.3537, 2018. a

Gupta, H. V., Sorooshian, S., and Yapo, P. O.: Toward improved calibration of hydrological models: multiple and noncommensurable measures of information, Water Resour. Res., 34, 751–763, https://doi.org/10.1029/97WR03495, 1998. a

Hafliger, V., Martin, E., Boone, A., Ricci, S., and Biancamaria, S.: Assimilation of synthetic SWOT river depths in a regional hydrometeorological model, Water, 11, 78, https://doi.org/10.3390/w11010078, 2019. a, b

Hunt, B., Kalnay, E., Kostelich, E. J., Ott, E., Patil, D. T., Sauer, T., Szunyogh, I., Yorke, J. A., and Zimin, A. V.: Four-dimensional ensemble Kalman filtering, Tellus, 56, 273–277, https://doi.org/10.1111/j.1600-0870.2004.00066.x, 2004. a

Hunt, B. R., Kostelich, E. J., and Szunyogh, I.: Efficient data assimilation for spatiotemporal chaos: A local ensemble transform Kalman filter, Physica D, 230, 112–126, https://doi.org/10.1016/j.physd.2006.11.008, 2007. a

International Association of Hydrological Sciences Ad Hoc Group on Global Water Sets, Vörösmarty, C., Askew, A., Grabs, W., Barry, R. G., Birkett, C., Döll, P., Goodison, B., Hall, A., Jenne, R., Kitaev, L., Landwehr, J., Keeler, M., Leavesley, G., Schaake, J., Strzepek, K., Sundarvel, S. S., Takeuchi, K., and Webster, F.: Global water data: a newly endangered species, EOS T. Am. Geophys. Un., 82, 54–58, https://doi.org/10.1029/01EO00031, 2001. a

Kim, H.: Global Soil Wetness Project Phase 3 Atmospheric Boundary Conditions (Experiment 1), Data set, Data Integration and Analysis System, https://doi.org/10.20783/DIAS.501, 2017. a

Kurtz, W., Hendricks-Frassen, H.-J., and Vereecken, H.: Identification of time-variant river bed properties with Ensemble Kalman Filter, Water Resour. Res., 48, W10534, https://doi.org/10.1029/2011WR011743, 2012. a

Leeuwen, P. V. and Evensen, G.: Data assimilation and inverse methods in terms of a probabilistic formulation, Mon. Weather Rev., 124, 2898–2913, https://doi.org/10.1175/1520-0493(1996)124<2898:DAAIMI>2.0.CO;2, 1996. a

Liu, Y. and Gupta, H. V.: Uncertainty in hydrological modeling: Towards an integrated data assimilation framework, Water Resour. Res., 43, W07401, https://doi.org/10.1029/2006WR005756, 2007. a, b

Liu, Y., Weerts, A. H., Clark, M., Hendricks Franssen, H.-J., Kumar, S., Moradkhani, H., Seo, D.-J., Schwanenberg, D., Smith, P., van Dijk, A. I. J. M., van Velzen, N., He, M., Lee, H., Noh, S. J., Rakovec, O., and Restrepo, P.: Advancing data assimilation in operational hydrologic forecasting: progresses, challenges, and emerging opportunities, Hydrol. Earth Syst. Sci., 16, 3863–3887, https://doi.org/10.5194/hess-16-3863-2012, 2012. a

Maidment, D. R.: Handbook of Hydrology, McGraw Hill Professional, 1993. a

Manning, R.: On the flow of water in open channels and pipes, Institution of Civil Engineers of Ireland, 20, 161–207, 1891. a

Meade, R., Rayol, J., Conceicão, S. D., and Natividade, J.: Backwater Effects in the Amazon River Basin of Brazil, Environ. Geol. Water S., 18, 105–114, https://doi.org/10.1007/BF01704664, 1991. a

Melsen, L., Teuling, A., Torfs, P., Zappa, M., Mizukami, N., Clark, M., and Uijlenhoet, R.: Representation of spatial and temporal variability in large-domain hydrological models: case study for a mesoscale pre-Alpine basin, Hydrol. Earth Syst. Sci., 20, 2207–2226, https://doi.org/10.5194/hess-20-2207-2016, 2016. a, b

Mersel, M. K., Smith, L. C., Andreadis, K. M., and Durand, M. T.: Estimation of river depth from remotely-sensed hydraulic relationship, Water Resour. Res., 49, 3165–3179, https://doi.org/10.1002/wrcr.20176, 2013. a

Michailovsky, C. I. and Bauer-Gottwein, P.: Operational reservoir inflow forecasting with radar altimetry: the Zambezi case study, Hydrol. Earth Syst. Sci., 18, 997–1007, https://doi.org/10.5194/hess-18-997-2014, 2014. a

Michailovsky, C. I., Milzow, C., and Bauer-Gottwein, P.: Assimilation of radar altimetry to a routing model of the Brahmaputra river, Water Resour. Res., 49, 4807–4816, https://doi.org/10.1002/wrcr.20345, 2013. a, b

Molinier, M., Guyot, J.-L., Orstom, B., Guimarães, V., de Oliveira, E., and Dnaee, B.: Hydrologie du bassin de l'Amazone, in: Grands Bassins Fluviaux Périatlantiques, PEGI-INSA-CNRS-ORSTOM, Paris, available at: http://horizon.documentation.ird.fr/exl-doc/pleins_textes/pleins_textes_7/carton01/40102.pdf (last access: 4 May 2020), 335–345, 1993. a

Montzka, C., Moradkhani, H., Weihermüller, L., Hendricks-Franssen, H.-J., Canty, M., and Vereecken, H.: Hydraulic parameter estimation by remotely-sensed top soil moisture observations with the particle filter, J. Hydrol., 399, 410–421, https://doi.org/10.1016/j.jhydrol.2011.01.020, 2011. a

Moradkhani, H., Hsu, K.-L., Gupta, H., and Sorooshian, S.: Uncertainty assessment of hydrologic model states and parameters: Sequential data assimilation using particle filter, Water Resour. Res., 41, W05012, https://doi.org/10.1029/2004WR003604, 2005. a, b, c

Munier, S., Polebistki, A., Brown, C., Belaud, G., and Lettenmaier, D. P.: SWOT data assimilation for operational reservoir management on the upper Niger river basin, Water Resour. Res., 51, 554–575, https://doi.org/10.1002/2014WR016157, 2015. a

Noilhan, J. and Planton, S.: A simple parameterization of land surface processes for meteorological models, Mon. Weather Rev., 117, 536–549, https://doi.org/10.1175/1520-0493(1989)117<0536:ASPOLS>2.0.CO;2, 1989. a, b

Oki, T. and Sud, Y. C.: Design of Total Integrating Pathways (TRIP)—A Global River Channel Network, Earth Interact., 2, 1–36, https://doi.org/10.1175/1087-3562(1998)002<0001:DOTRIP>2.3.CO;2, 1998. a, b, c

Ott, E., Hunt, B. R., Szunyogh, I., Kostelich, A. V. Z. A. J., Corazza, M., Kalnay, E., Patil, D. J., and Yorke, J. A.: A local ensemble Kalman filter for atmospheric data assimilation, Tellus A, 56, 415–428, https://doi.org/10.1111/j.1600-0870.2004.00076.x, 2004. a

Oubanas, H., Gejadze, I., Malaterre, P.-O., Durand, M., Wei, R., Frasson, R. P. M., and Domeneghetti, A.: Discharge estimation in ungauged basins through variational data assimilation: the potential of the SWOT mission, Water Resour. Res., 54, 2405–2423, https://doi.org/10.1002/2017WR021735, 2018. a, b, c

Paiva, R. C. D., Buarque, D. C., Collischonn, W., Bonnet, M.-P., Frappart, F., Calmant, S., and Mendes, C. A. B.: Large scale hydrological and hydrodynamic modeling of the Amazon River basin, Water Resour. Res., 49, 1226–1243, https://doi.org/10.1002/wrcr.20067, 2013. a

Panzeri, M., Riva, M., Guadagnini, A., and Neuman, S. P.: Data assimilation and parameter estimation via ensemble kalman filter coupled with stochastic moment equations of transient groudwater flow, Water Resour. Res., 49, 1334–1344, https://doi.org/10.1002/wrcr.20113, 2013. a

Pathiraja, S., Marshall, L., Sharma, A., and Moradkhani, H.: Detecting non-stationar hydrologic model parameters in a paired catchment system using data assimilation, Adv. Water Res., 94, 103–119, https://doi.org/10.1016/j.advwatres.2016.04.021, 2016. a, b

Pedinotti, V., Boone, A., Ricci, S., Biancamaria, S., and Mognard, N.: Assimilation of satellite data to optimize large-scale hydrological model parameters: a case study for the SWOT mission, Hydrol. Earth Syst. Sci., 18, 4485–4507, https://doi.org/10.5194/hess-18-4485-2014, 2014. a, b, c, d, e, f, g, h

Rakovec, O., Weerts, A. H., Sumihar, J., and Uijlenhoet, R.: Operational aspects of asynchronous filtering for flood forecasting, Hydrol. Earth Syst. Sci., 19, 2911–2924, https://doi.org/10.5194/hess-19-2911-2015, 2015. a, b

Renard, B., Kavetski, D., Kuczera, G., Thyer, M., and Franks, S. W.: Understanding predictive uncertainty in hydrologic modeling: the challenge of identifying input and structural errors, Water Resour. Res., 46, W05521, https://doi.org/10.1029/2009WR008328, 2010. a

Rodell, M., Beaudoing, H. K., L'Ecuyer, T. S., Olson, W. S., Famiglietti, J. S., Houser, P. R., Adler, R., Bosilovich, M. G., Clayson, C. A., Chambers, D., Clark, E., Fetzer, E. J., Gao, X., Gu, G., Hilburn, K., Huffman, G. H., Lettenmaier, D. P., Liu, W. T., Roberton, F. R., Schlosser, C. A., Sheffield, J., and Wood, E. F.: The observed state of the water cycle in the early twenty-first century, J. Climate, 28, 8289–8318, https://doi.org/10.1175/JCLI-D-14-00555.1, 2015. a

Ruiz, J. J., Pulido, M., and Miyoshi, T.: Estimating model parameters with ensemble-based data assimilation, J. Meteorol. Soc. Jpn., 91, 79–99, https://doi.org/10.2151/jmsj.2013-201, 2013. a

Sakov, P., Evensen, G., and Bertino, L.: Asynchronous data assimilation with the EnKF, Tellus, 62, 24–29, https://doi.org/10.1111/j.1600-0870.2009.00417.x, 2010. a, b

Sanoo, A. K., Pan, M., Troy, T. J., Vinukollu, R. K., Sheffield, J., and Wood, E. F.: Reconciling the global terrestrial water budget using satellite remote sensing, Remote Sens. Environ., 115, 1850–1865, https://doi.org/10.1016/j.rse.2011.03.009, 2011. a

Shi, Y., Davis, K. J., Zhang, F., Duffy, C. J., and Yu, X.: Parameter estimation of physically-based land surface model hydrologic model using an ensemble Kalman filter: a multivariate real-data experiment, Adv. Water Res., 83, 421–427, https://doi.org/10.1016/j.advwatres.2015.06.009, 2015. a

Silva, J. S. D., Calmant, S., Seyler, F., Filho, O. C. R., Cochonneau, G., and Mansur, W. J.: Water levels in the Amazon basin derived from the ERS 2 and ENVISAT radar altimetry missions, Remote Sens. Environ., 114, 2160–2181, https://doi.org/10.1016/j.rse.2010.04.020, 2010. a

Sood, A. and Smakhtin, V.: Global hydrological models: a review, Hydrolog. Sci. J., 60, 549–565, https://doi.org/10.1080/02626667.2014.950580, 2015.  a

Talagrand, O. and Courtier, P.: Variational assimilation of meteorological observations with the adjoint vorticity equation, Part 1: Theory, Q. J. Roy. Meteor. Soc., 113, 1311–1328, https://doi.org/10.1002/qj.49711347812, 1987. a

Vinukollu, R. K., Meynadier, R., Sheffield, J., and Wood, E. F.: Multi-model, multi-sensor esti- mates of global evapotranspiration: climatology, uncertainties and trents, Hydrol. Process., 25, 3993–4010, https://doi.org/10.1002/hyp.8393, 2011. a

Vrugt, J. A., ter Braak, C. J. F., Diks, C. G. H., and Shoups, G.: Hydrological data assimilation using particle Markov chain Monte Carlo simulation: Theory, concepts and applications, Adv. Water Res., 51, 457–478, https://doi.org/10.1016/j.advwatres.2012.04.002, 2012. a

Wisser, D., Fekete, B. M., Vörösmarty, C. J., and Schumann, A. H.: Reconstructing 20th century global hydrography: a contribution to the Global Terrestrial Network- Hydrology (GTN-H), Hydrol. Earth Syst. Sci., 14, 1–24, https://doi.org/10.5194/hess-14-1-2010, 2010. a

Yoon, Y., Durand, M., Merry, C. J., Clark, E. A., Andreadis, K. M., and Alsdorf, D. E.: Estimating river bathymetry from data assimilation of synthetic SWOT measurements, J. Hydrol., 464-465, 363–375, https://doi.org/10.1016/j.jhydrol.2012.07.028, 2012. a