1. Introduction

Langland and Baker (; LB04) wrote a fundamental paper, showing how to answer the question: “Did the use of any subset of the observations make the forecast better or worse?” without having to carry out analyses and forecasts with and without assimilating those observations, as required in conventional Observing System Experiments (OSEs). Gelaro and Zhu (; GZ09) derived a formulation equivalent to LB04. These estimations of forecast sensitivity to observations are based on adjoint sensitivity and have proven to be a powerful monitoring tool adopted operationally at Naval Research Laboratory (NRL) and National Aeronautics and Space Administration/Global Modeling and Assimilation Office (NASA/GMAO).

Liu and Kalnay (; LK08) proposed an ensemble Kalman filter (EnKF) formulation algorithm equivalent to LB04 or GZ09, without requiring the adjoint of either the forecast model or the data assimilation scheme. Li et al. (; LLK10) pointed out a minor error in the original LK08 formulation and noted that the cost function measuring the impact of the observations at the initial time on the forecast could be computed directly rather than through the gradient of the cost function. LK08 and LLK10 tested the ensemble sensitivity formulation within the Local Ensemble Transform Kalman Filter (LETKF; Hunt et al., ) coupled with the Lorenz and Emanuel () 40 variables model and found results comparable to those obtained using the adjoint sensitivity, and Kunii et al. (2012) successfully applied this methodology on real observations for tropical cyclone prediction.

Here we present a formulation of the ensemble forecast sensitivity to observations based on the direct computation of the cost function, without computing its gradient. Although it is essentially equivalent to the original LK08/LLK10 formulation, it is simpler and computationally more efficient since it uses available EnKF products. Unlike the original LK08 formulation, it can be applied to the Ensemble Square Root Filter (EnSRF; Whitaker and Hamill, ) in which the analysis weights of the background ensemble members cannot be explicitly computed.

In addition, we find that the use of localization, necessary in EnKF when the number of ensemble members is much smaller than the number of degrees of freedom (DOF) of the model, reduces the accuracy of the ensemble forecast sensitivity. We show that two approaches to evolve the observation localization during the forecast improve the results.

Section 2 presents the new formulation and compares it with the original one. Section 3 describes the experimental design. Results from the two ensemble formulations and from the adjoint sensitivity are compared in Section 4 using the Lorenz and Emanuel () model. Section 5 is a summary of the results and relative advantages and a discussion of the impact of localization on the EnKF forecast sensitivity.

2. Adjoint and ensemble formulations

Let represents a forecast started from the analysis at time 0 and verifying at time t. The overbars are used to indicate ensemble mean and are relevant only for the ensemble sensitivity formulation so that they can be ignored in the adjoint sensitivity formulation. The perceived forecast error at the verification time t from a forecast started at time 0, and verified against the analysis valid at time t, is given by . The corresponding error from the forecast started at time t=−6 h is given by (see schematic Fig. 1). Six hours is a typical data assimilation window for numerical weather prediction (NWP), but it could be different depending on the system of our interest, for example, 1 h for mesoscale NWP, or 1 week for global ocean data assimilation. As indicated in Fig. 1, the difference between the forecast errors and at verification time t is only due to the observations y0 assimilated at time 0 that change the background by the analysis increment

(1 )

Fig. 1.   

Schematic of the perceived forecast error verified against the analysis at the verification time t from two forecasts started from the analysis at t=0 h, and from the analysis at t=−6 h . Since the forecast started at t=−6 h serves as first guess for the analysis at t=0 h, the only difference between the two forecasts is the assimilation of the observations y0 at t=0 h. Adapted from Langland and Baker ().

Here K is the gain matrix that defines the data assimilation algorithm, is the observational increment with respect to the first guess, and H is the non-linear observation operator. We indicated with a superscript b the 6-h forecast started from the analysis at time −6 h and used as background at time 0.

LB04 introduced a cost function to measure the impact of the observations at time 0 on the forecast at time t as the difference between the squares of the forecast errors with and without assimilating the observations y0:

(2 )

where the matrix of weights C defines the squared norm to be used (dry total energy in the case of LB04 and GZ09).

Here we follow the suggestion of LLK10 and compute the sensitivity to observations directly from the cost function eq. (2) rather than from a Taylor series approximation and assume that the forecast length is short enough to allow the use of the linear tangent model :

(3 )

LB04 and GZ09 solve eq. (3) by using the adjoint approach (cf. eq. (7) in LB04 and eq. (7) in GZ09):

(4 )

thus requiring the adjoint of both the model (MT) and of the data assimilation (KT).

In their corrected ensemble formulation, LLK10 used the LETKF formulation (Hunt et al., ) to rewrite eq. (3) as

(5 )

[cf. eq. (9b) in LLK10]. Here is an intermediate matrix used in the LETKF, K is the number of ensemble members, and is a matrix whose columns are the forecast ensemble perturbations with respect to the mean, started from the analysis at time −6 h and valid at time t. Equation (5) requires estimating the Kalman gain matrix at each grid point and is therefore computationally burdensome.

In the new formulation of the forecast sensitivity in a deterministic EnKF framework, we start from eq. (3) and use the Kalman gain matrix and the matrix composed of analysis ensemble perturbations in observation space. Here is the analysis ensemble error covariance. We can then write so that the cost function eq. (3) becomes

(6 )

Each column k of the perturbation forecast matrix can be computed with the full non-linear model . The new formulation eq. (6) uses analysis ensemble products rather than the Kalman gain and is more efficient than the original LK08/LLK10 formulation eq. (5). Unlike eq. (5), the new formulation eq. (6) does not introduce an approximation of , and it uses rather than used in eq. (5). As a result, although the two formulations are essentially equivalent, eq. (6) should be slightly more accurate than eq. (5). In addition, this formulation can be applied to EnKF methods other than the LETKF.

3. Experimental design

We use the Lorenz and Emanuel () 40 variables model for the experiments. This model is governed by the equation:

(7 )

where j is an index of the variables and 1≤j≤40 with periodic boundary conditions. As in LK08, we allow for model errors by using F=8.0 for the truth run and F=7.6 for the data assimilation cycle and the forecast. The time integration is performed with the fourth-order Runge-Kutta scheme with time step 0.01 in non-dimensional time. Data assimilation with the LETKF is performed every 0.05 time units, estimated by Lorenz and Emanuel (), to be equivalent to 6 h when F=8.0. Observations are created at every grid point on each analysis time by adding independent Gaussian random errors with a standard deviation (SD) of 0.2 to the truth run. Like the experiment in LK08, observations at the 11th grid point have larger random errors (SD of 0.8), but we still use 0.2 as the observation errors for these observations in the assimilation process. We run 10 sets of 14,600 data assimilation cycles with different truth runs. In each set, we verify over the last 14,360 cycles.

We performed observation impact estimations using the three methods LB04/GZ09 (eq. 4 using adjoint formulation, denoted 4-ADJ), LK08/LLK10 (eq. 5, denoted 5-OLD) and the new ensemble formulation (eq. 6, denoted 6-NEW). For the adjoint formulation, the gradient of the cost function is constructed at forecast time and brought back to the initial time by the adjoint model as in LB04/GZ09. In the Runge-Kutta time scheme, the non-linear trajectories are used at every time step and substep of the time integration for the adjoint operation. The adjoint code is validated by the test described in Appendix. The same background covariance (full matrix with the same covariance localization as in the EnKF) is constructed from the EnKF perturbations and used in the computation of KT.

Since EnKF requires space localization when the number of ensemble members is much smaller than the number of DOF of the model, comparisons are made both with and without localization. For the experiments with localization, we applied a Gaussian localization function with an e-folding scale of grid points. The same localization is also applied to the observation impact estimation (Kunii et al., ). For the LK08/LLK10 method (5-OLD), observation impacts are computed on each grid point j and observation l as

(8 )

The same localization as in the LETKF analysis is applied in the computation of so that it does not explicitly appear in eq. (8). For eq. (6) (the NEW formulation), this becomes

(9 )

where ρj is the localization function on grid point j. Both eq. (8) and eq. (9) can be simply summed up to get the impact of each observation. The localization function is fixed during the analysis and forecast times except as noted in Section 4.3. Multiplicative inflation (Anderson, ) is applied with a tuned parameter of 1.152. The observation impact estimations are computed at the forecast times equivalent to 6 h and 0.5, 1, 2, 3, 5 and 7 d.

4. Results

In Section 4.1, experiments are made with an ensemble of 40 members (same as the number of DOF of the model) so that no spatial localization is needed. In Section 4.2, we run the ensemble with only 10 members, more representative of operational applications for which the number of ensemble members is much smaller than the model DOF, so that localization is required. Since the results deteriorate with the use of localization, in Section 4.3 we introduce two methods that ameliorate this problem in longer forecasts, and compare the percentage observations that improve the forecasts in each of the methods with that obtained with a much more expensive OSE. In Section 4.4, we compare how well the different methods can detect “bad” observations with much higher error variance than specified in the assimilation system.

4.1. Analysis with 40 ensemble members without localization

Table 1 shows the average forecast error reduction amount of each experiment. For completeness, our results include estimates of forecast error reduction compared with values obtained from (1) the truth run and (2) the analysis cycles, as the truth is not known in real forecast applications. Because the number of ensemble members is as large as the number of DOF of the model, spatial localization is not required and, therefore, not applied here. The verification results against the analysis have very similar means and SDs as the verifications against the truth. The adjoint formulation (eq. 4) and the new ensemble formulation (eq. 6) both estimate well the average impacts throughout the forecast range, with eq. (6) being the most accurate at longer forecast ranges but with much higher SD. The old ensemble formulation (eq. 5) estimates similar, sometimes slightly better, observation impacts up to 3-d forecast but tends to underestimate the average impact for longer forecast times.

Figure 2 shows a skill score (SS) of the corresponding time–mean root mean square error (RMSE) of the different estimations of the total forecast error reduction verified against the true error reduction:

Fig. 2.   

Skill score of the estimation of the total forecast error reduction verified against the true forecast error reduction. Black, red and blue lines show the results of equations (4-ADJ), (5-OLD) and (6-NEW). The vertical error bars show the standard deviation of the 10 experiments. No localization was used in the LETKF.

so that if the estimated reduction is always zero, SS=0, and if the estimated reduction always agrees with the true reduction (eq. 2), SS=1. Note that in Figs. 24, the skill peaks at 2–3 d because the forecast error is estimated against the analysis (affected by observation errors) and verified against the truth. It is only after a day or two that the forecast errors become large compared to the analysis errors. The performance of the adjoint and ensemble estimations is essentially identical for the first 2 d of the forecast, but the estimation by equation (4-ADJ) is somewhat better at 3- and 5-d forecasts, with the new ensemble formulation (6-NEW) better than the old one (5-OLD). The adjoint formulation (4-ADJ) becomes increasingly inaccurate and noisy beyond 6 d because, as shown in the Appendix, the linearization assumption made in the adjoint model does not hold anymore.

Fig. 3.   

Same as Fig. 2 but with only 10 members and using localization in the analysis. No localization was applied for the ensemble-based observation impact computations.

Fig. 4.   

As per Fig. 3 but now with localization used in the observation impact computations. Red and blue lines show the results of equations (5-OLD) and (6-NEW) with a fixed localization (Fxloc) function as used in the LETKF analysis. Dotted green and light blue lines show the results of equation (6-NEW) with localization functions moving with non-linear evolution (NL-loc) and constant speed equal to the group velocity (CL-loc).

4.2. Analysis with 10 ensemble members and fixed localization

Figure 3 and Table 2 are the same as Fig. 2 and Table 1 but with only 10 ensemble members and now introducing localization in the analysis. As a benchmark, no localization was applied to the ensemble-based observation impact computations, shown in Fig. 3 as (5-OLD) Noloc and (6-NEW) Noloc. Compared with the results of 40 ensemble members without localization, the accuracies of both ensemble sensitivity methods in terms of RMSE are strongly degraded relative to the adjoint sensitivity, especially for (6-NEW) Noloc. This suggests that with much fewer ensemble members compared to the number of DOF of the model, spatial localization is also required in the computation of the observation impacts. Fig. 4 shows a graph similar to Fig. 3, but now with fixed spatial localization (as in eqs. 8 and 9), denoted (5-OLD) Fxloc and (6-NEW) Fxloc, respectively. The RMSEs of both ensemble-based estimations are improved with the fixed localization.

Table 2 includes the average forecast error reduction of these experiments, as well as the new moving localization methods introduced in Section 4.3. Although the RMSE of the estimates is reduced by applying the localization, the average estimate of (6-NEW) with fixed localization (Fxloc) tends to underestimate the impacts especially for the longer forecast times. Results using (5-OLD) have the wrong sign, so clearly the new formulation eq. (6) outperforms eq. (5) in the estimation of the mean when the ensemble size is smaller than the model DOF.

4.3. Analysis with 10 ensemble members and forecast sensitivity with moving localization

The analysis increment for any observation (i.e. the impact of the observation) evolves through the forecast. The localization function should include this evolution, otherwise the ensemble sensitivity will miss most of the observation impacts on longer forecast times. Unfortunately, the localization used in eqs. (8) and (9) is fixed so that it should become less accurate for longer forecasts.

The problem of how to evolve optimally the localization function in the computation of the forecast sensitivity is not trivial, and solving it is beyond the scope of this paper. Here we show two approximations that could be used in real systems.

The first approximation is to move the localization function (a Gaussian localization function centred on the location of observation l) with the non-linear incremental evolution (denoted NL-loc). The localization function of the observation l is added to the ensemble mean analysis field, and the evolution of the increment (Gl) is forecasted as

(10 )

where α is small positive real number set here at 0.01. We take the absolute value of eq. (10) and smooth it by using the fourth-order Shapiro () filter in order to remove 2-gridlength waves. Then we normalize it with the area integral of the localization function and limit the maximum and minimum values not exceeding 1 and 0. This value is used as the localization function of observation l for the impact estimations. Fig. 5a shows an example of the evolution of the localization function with the NL-loc method. Figs. 4 and 6 and Table 2 include results of the experiment with this moving localization. Both the RMSE and the average estimation are improved by applying the moving localization.

Fig. 5.   

Example of the evolution of the localization function with two methods. (a) Localization function with the non-linear incremental evolution (NL-loc); and (b) localization function moving with constant group velocity (CL-loc).

Fig. 6.   

Time series of the total forecast error reduction at 5-d forecast from cycle 1000 to 1100. Pink lines show the true forecast error reduction, the black line is the adjoint estimation, the blue line is the ensemble estimation with fixed localization and the dotted green line is the ensemble estimation with non-linear evolution of the localization. A total of 40 observations are assimilated on each analysis time.

In a realistic geophysical data assimilation problem, using eq. (10) would be prohibitive because it requires a forecast for each observation. For this reason, we also tested a much simpler method to evolve the localization function. In this method, the localization centre is translated with a constant speed, +0.6 grid points per day, equal to the climatological group velocity of the dominant wavenumbers 6–8. The group velocity is estimated from the modulation envelope of the space–time correlation using the method of Yoon et al. (). Figure 5b shows the evolution of the localization for the same case as Figure 5a but with the simpler constant linear method (CL-loc). Figure 4 and Table 2 also include the results of CL-loc, which, although simpler, is also clearly better than the fixed localization (Fxloc).

Figure 6 compares a time series of the total forecast error reductions of (4-ADJ), (6-NEW) Fxloc with fixed localization and (6-NEW) NL-loc with moving localization of non-linear incremental evolution at 5 d with the true reduction of error. Ensemble-based estimates generally capture the large peaks (degradations and improvements of the forecast). However, (6-NEW) Fxloc sometimes misses a peak, but the ensemble-based estimations with moving localization (6-NEW) NL-loc capture the time series very well.

Figure 7 shows the fraction of the observations that are estimated to decrease the forecast error in each method. As a reference, we also performed the much more computationally expensive data denial experiments of each observation at each analysis time and derived the actual rate of positive impacts (plotted as OSE). For short-range forecasts, the adjoint tends to underestimate the percentage of observations providing a positive impact, whereas the EnKF-based estimations have larger rate and are in better agreement with the data denial experiments. The fraction of positive impacts of (5-OLD) Fxloc is the largest up to 2-d forecast but becomes less than a half at 7-d forecast. Other methods converge to just above 0.5 (the value expected when forecasts have no more skill, and observations errors are Gaussian with zero bias) for long-range forecasts with (6-NEW) NL-loc and (6-NEW) CL-loc being closer to the OSE result.

Fig. 7.   

Fraction of the observations that are estimated to decrease the forecast error in each method. Purple solid line shows the result of data denial experiments, denoted as an Observing System Experiment (OSE).

4.4. Detection of bad observations

Finally, we compare the ability of the adjoint- and ensemble-based impact estimates to detect flawed observations useful for realistic applications. In this subsection, we show the results of 10-member ensemble experiments.

Figure 8 shows the average impact estimates of each method at each observation point on 5-d forecasts. Estimates are derived from the last 1000 (Fig. 8a) and the last 100 assimilation cycles (Fig. 8b). For a large sample of 1000, both the adjoint and the two ensemble-based methods can clearly detect the erroneous observation at the 11th grid point (Section 3) with very similar results. With only 100 samples, both methods still detect a peak at the 11th grid point, but at other points the estimates are much noisier, especially for the adjoint approach. In practical applications, the number of samples (assimilation cycles) can be of the order of 100–1000. This result suggests that all three methods are able to detect bad observations even for relatively small samples, but the adjoint approach may be noisier.

Fig. 8.   

Time average of observation impact estimates at each point on 5-d forecast with (a) 1000 and (b) 100 samples, using the adjoint approach eq. (4), the LLK10 formulation eq. (5) and the new ensemble formulation eq. (6), both with fixed localization. The observation at grid point 11 has an error variance of 0.8 rather than the assumed value of 0.2.

5. Summary and conclusions

In this article, we introduce a new formulation of the ensemble forecast sensitivity developed by Liu and Kalnay () with the small correction of Li et al. (). The two formulations are compared with the adjoint forecast sensitivity using the same analysis with the Lorenz and Emanuel () 40-variables model. For the first 2 d, the three formulations are essentially identical. From 3 to 5 d, the adjoint formulation is the most accurate, but it becomes the worst at longer time scales, when the linear approximation made in the adjoint formulation breaks down.

We find that the use of localization in the analysis, necessary in EnKF when the number of ensemble members is much smaller than the model's DOF, has a negative impact on the accuracy of the ensemble sensitivity. This is not surprising since the impact of an observation during the analysis (i.e. the analysis increment associated with the observation) is transported by the flow during the integration, and this is ignored with a fixed localization. To address this problem, we also introduce two approaches that could be adapted to evolve the localization function during the estimation of forecast sensitivity to the observations. The first method estimates the non-linear evolution of the initial localization and is very expensive. The second one moves the localization with a constant estimation of the group velocity. Both methods succeed in significantly improving the estimations for longer forecasts. When compared with the fraction of observations that reduce the forecast errors obtained running with and without each observation (as in an OSE) for short-range forecasts, the adjoint approach tends to underestimate the percentage of positive impact of the observations, and the new ensemble formulation gives impacts closer to the OSEs.

In summary, the adjoint and ensemble forecast impact estimations give similarly accurate results for short-range forecasts, except that the new formulation estimation of the fraction of observations that reduce forecast errors is closer to that obtained with OSEs. For longer-range forecasts, they both deteriorate for different reasons. The adjoint sensitivity becomes noisy due to the fact that the adjoint model is based on the tangent linear model and, therefore, cannot capture forecast non-linearities that become large with the forecast length. The ensemble sensitivity becomes less accurate due to the use of fixed localization, a problem that could be ameliorated with an evolving adaptive localization method (Bishop and Hodyss, ; ). Advantages of the new ensemble formulation are that it is simpler, more computationally efficient, and that it can be applied to other EnKF methods, and not just the LETKF. It has been implemented and tested with real forecasts at National Centers for Environmental Prediction (NCEP) using the EnSRF (Whitaker and Hamill, ).

Fig. A1.   

Similarity indices between the non-linear and tangent linear evolution of the perturbations. Standard deviation of the perturbations are 0.2 (red), 0.5 (yellow), 1.0 (green) and 2.0 (blue). Values are averaged over 1000 realizations.