#### Abstract

We introduce a method for quantifying the predictability of the event that the evolution of a deterministic dynamical system enters a specific subset of state space at a given lead time. The main idea is to study the distribution of finite-time growth rates of errors in initial conditions along the attractor of the system. The predictability of an event is measured by comparing error growth rates for initial conditions leading to that event with all possible growth rates. We illustrate the method by studying the predictability of extreme amplitudes of traveling waves in the Lorenz-96 model. Our numerical experiments show that the predictability of extremes is affected by several routes to chaos in a different way. In a scenario involving intermittency due to a periodic attractor disappearing through a saddle-node bifurcation we find that extremes become better predictable as the intensity of the event increases. However, in a similar intermittency scenario involving the disappearance of a 2-torus attractor we find that extremes are just as predictable as nonextremes. Finally, we study a scenario which involves a 3-torus attractor in which case the predictability of extremes depends nonmonotonically on the prediction lead time.

#### 1. Introduction

Classical extreme value statistics is concerned with the asymptotic distribution of large values in time series of random variables. The theory, which is based on the extreme value and generalized Pareto distributions, is well developed for stochastic processes both with and without serial dependence; see the text books [1–7]. A recent development is the application of extreme value statistics in the setting of* deterministic* dynamical systems. The main idea is to evaluate a scalar observable along the evolution of a system and to study under which conditions the same extreme value laws hold as in the case of stochastic processes. Geophysical applications, in which dynamical systems arise as models and observables are physical quantities like wind speed or temperature, form an important motivation for the development of the theory. Very recently, Lucarini et al. [8] published the first text book on extremes in dynamical systems which gives an excellent overview of the latest developments and also provides an extensive source of references.

Statistics only describe the behaviour of extremes over long periods of time. However, for the development of early-warning systems and risk mitigation strategies the short-term predictability of extremes is of great importance. This leads to the following question: how predictable are extremes? Bodai [9] summarizes three different conclusions that can be found in the literature:(1)Extremes are better predictable.(2)Extremes are less predictable.(3)Extremes can be better or less predictable depending on several factors.The first conclusion is supported by the work of Hallerberg et al. [10] who studied the predictability of extreme* increments* in first-order autoregressive process, wind speed recordings, and long-range correlated autoregressive moving averages. In all their examples extremes become better predictable with increasing event size. The results in [11] showed that in i.i.d. stochastic processes large increments are better predictable if the process is Gaussian, whereas large increments become less predictable if the underlying distribution has a power law tail. However, in the follow-up study [12], which is concerned with* threshold crossings* instead of* increments*, it was found again that extremes are always better predictable. The first conclusion is also supported by the work of Franzke [13, 14] in the context of dynamic-stochastic models. Bodai [9] argues that in dynamical systems stronger predictability of extremes may be typical but not universal. The third conclusion is supported by the work of Sterk et al. In [15] it was pointed out that the predictability of extreme values in dynamical systems depends on the observable, the attractor of the system, and the prediction lead time. In [16] it was shown how the tail of the distribution of wind speeds affects their predictability at high thresholds.

The predictability of extremes can be measured in different ways. By treating extreme events as binary events one can measure prediction skill by means of a receiver operator characteristic (ROC) curve which is a graph of the hit rate against the false alarm rate [9–14]. Another possible measure is the extreme dependency score developed by Stephenson et al. [17], which does not tend to zero for vanishingly rare events unlike scores such as the equitable threat score. Alternatively, when predictions are made using a dynamical model, predictability can be measured in terms of the growth rate of errors in the initial condition. The earliest studies on predictability in atmospheric models [18–20] computed the time needed for small errors in the initial condition to double in magnitude. This idea connects with traditional predictability measures for dynamical systems, such as Lyapunov exponents. The latter are asymptotic quantities that are computed for time tending to infinity, which also implies that they are independent of the initial condition [21]. Finite-time Lyapunov exponents and singular values measure the growth rate of errors over a finite time and typically they strongly depend on the initial condition and the prediction lead time. Measures of this type have been developed in celestial mechanics to separate chaotic from regular dynamics [22, 23], and they have been used to measure the growth of errors due to model perturbations [24] and the predictability of extremes [15].

Several papers demonstrated that finite-time error growth rates can show large fluctuations along the attractor of the system [25–31]. Benzi and Carnevale [32] argued that a ratio of the average growth rate to the most probable growth rate much smaller than 1 is an indication of enhanced predictability, which means that some events may be better predictable than others. A natural question is then what kind of dynamics can lead to enhanced predictability? For example, in dynamical systems with intermittency the dynamics switches between two or more different dynamical regimes and each regime can be associated with different predictability characteristics. The work in [33, 34] shows that in intermittent dynamical systems distributions of finite-time Lyapunov exponents are non-Gaussian and asymmetric and have heavy tails. Hence, in intermittent systems one can expect that some events are better predictable than others.

The aim of this paper is to demonstrate that the predictability of extremes depends on the dynamical regime of the model that is used for the predictions. In particular, we show that in weakly chaotic regimes of a dynamical system the predictability of extremes does not have universal properties. The main idea, which is in the spirit of [32], is to study the distribution of finite-time growth rates of errors in initial conditions along the attractor of the system. Comparing error growth rates of initial conditions leading to an event with all possible growth rates then gives a measure of the predictability of the event. We illustrate the method using the Lorenz-96 model [35]. On the one hand this model is simple enough for performing detailed numerical explorations. On the other hand the model has many dynamical features that are shared by a large class of geophysical models. The Lorenz-96 model can be interpreted as a model for traveling waves. The routes to chaos are myriad and different kinds of attractors can be found [36]. The bifurcation scenarios in the Lorenz-96 model can also be found in more complex geophysical models, such as the atmospheric and oceanic models studied in [37, 38]. We will focus in particular on predictability in the vicinity of bifurcations leading to intermittent and quasi-periodic dynamics.

The remainder of this paper is structured as follows. In Section 2 we explain how to quantify the predictability of an event in a general dynamical system. In Section 3 we introduce the Lorenz-96 model which we will use for our numerical experiments. For three values of the dimension of the model we investigate how the predictability of extreme waves in the model depends on intermittent or quasi-periodic nature of the dynamics. Section 4 concludes the paper with a summary and discussion of the results and suggestions for further research.

#### 2. Predictability of Dynamical Systems

This section explains the methodology of quantifying the predictability of an event in a dynamical system. In general, a deterministic dynamical system can be defined as a triple which consists of a state space , a time set , and an evolution operator , such that the following properties are satisfied:(i) is an additive half group: and for all also .(ii)For all and we haveWe also write . Particular examples that are included in this setting are discrete time systems, such as iterated maps, and continuous-time systems, such as flows of differential equations; see [39, 40]. In this paper we assume that the state space is a subset of the Euclidean space, but more generally can be a Riemannian manifold or a function space.

The predictability of a dynamical system is often quantified in terms of the growth rate of errors in the initial condition. Suppose that the initial condition is perturbed in the direction of ; then is the error growth rate over a time interval of length . Harle et al. [28] studied the statistics of these growth rates and their dependence on the parameters and in the setting of 2-dimensional dissipative and conservative maps. The error growth was found to increase exponentially fast with when is small. For larger values of the error growth follows a power law which depends on the magnitude of . In their paper it is suggested that these results are quite general.

In this paper we will make the idealized assumption that the initial perturbation size is infinitesimally small. Under this assumption the error at time is then given bywhere the derivative is taken with respect to the initial condition in the direction of the vector . The worst-case error growth over a time interval of length can be computed by maximizing the following Rayleigh quotient over all nonzero vectors :where denotes the Euclidean norm. A standard result in linear algebra [41] implies that the quotient (3) attains a maximum if and only if is the eigenvector of corresponding to the largest eigenvalue. Equivalently, the maximum is attained precisely when is the right singular vector corresponding to the largest singular value of , which throughout this paper will be denoted by . In this way we obtain a measure of finite-time predictability for a given initial condition .

In many applications it is often important to quantify the predictability of a certain event taking place in the future. We define an event to be a subset of the state space . For a given initial condition we say that the event occurs at time if , or, equivalently, . The predictability of the event can be quantified as follows. Assume that the dynamical system is equipped with an invariant probability measure supported on some attractor (in which case we also assume that ). This means that and for all measurable subsets . Then the distribution function of the time- singular values is given byThe conditional distribution of time- singular values given that the event occurs at time reads aswhere we have used that is an invariant measure so that . The predictability of the event can be quantified by comparing both distributions. For example, if the right endpoint of is much smaller than the right end point of , then the event can be called predictable. In the limit all events become equally predictable.

The advantage of the approach outlined in this section is the fact that it combines measures of predictability and the statistical recurrence properties of the system via its invariant measure. For simple dynamical systems for which the growth of errors can be computed analytically and for which the invariant measure is known the distributions (4) and (5) can be computed analytically. Hence, our approach may be used to derive general statements on the predictability of extremes for simple classes of dynamical systems in a rigorous way. This idea will be pursued in forthcoming work. Also note that the methodology applies to arbitrary events. This in particular includes the case of rare events, but these need not be extreme events in which some observable exceeds a threshold.

#### 3. Results

##### 3.1. The Lorenz-96 Model

In [35] Lorenz introduced a one-dimensional atmospheric model to study fundamental issues regarding the predictability of the atmosphere and weather forecasting. The model can be interpreted as a model for atmospheric waves traveling along a circle of constant latitude. We divide the latitude circle into equal sectors and define for the -th sector a distinct variable . The variables can be interpreted as meteorological quantities, such as pressure or vorticity, where the index of each variable plays the role of longitude. The dynamical equations arewith the periodic “boundary condition” . The dimension and forcing are free parameters. The Lorenz-96 model is often used to test data assimilation methods [42, 43] and subgrid scale parameterizations [44], for studies in statistical mechanics [45, 46], and in the general study of spatiotemporal chaos [47]. In this paper we use the Lorenz-96 model to study the predictability of extreme events in the vicinity of bifurcations.

The point is clearly an equilibrium solution of (6) for all and all . For all this equilibrium becomes unstable through either a supercritical Hopf or a double-Hopf bifurcation for [36]. In both cases a stable periodic attractor is born which has the physical interpretation of a traveling wave. Figure 1 shows the spatiotemporal properties of these waves: the period and the spatial wave number are plotted as a function of . In [36] it was proved analytically that the period tends to a finite limit as , but the wave number increases monotonically with .

The periodic attractor representing the traveling wave can undergo several subsequent bifurcations, such as period doubling bifurcations or Neĭmark-Sacker bifurcations. Further bifurcations lead to strange attractors via a multitude of routes to chaos which depend on the dimension [36]. Hence, the Lorenz-96 exhibits successive bifurcations of traveling waves. The spatiotemporal properties of the resulting waves are “inherited” from the periodic attractor that was born at the Hopf bifurcation. For an example, see Figure 2 for two traveling waves in dimension . A very similar scenario was found in a Galerkin projection of a shallow water model that was used to study the dynamical mechanisms behind atmospheric low-frequency variability [38].

**(a)**

**(b)**

The particular interest of this paper is the predictability of so-called extreme events in which an observable evaluated along an evolution of the system exceeds a threshold. Concrete examples are models for weather and climate in which extremes of physical quantities such as wind speed are of great importance [15, 16, 48]. For the Lorenz-96 model we will study the predictability of events of the formHence, if is an initial condition, then is the event that the amplitude of the traveling wave measured at the first “grid point” exceeds the threshold at time . For these events the distributions (4) and (5) not only depend on the prediction lead time , but also on the event threshold . In this paper we will study how this dependence is influenced by (nearby) bifurcations of the system. Note that due to the circulant symmetry of the Lorenz-96 model (6) the results obtained for the event defined in (7) will not change if the inequality is replaced by for any other value of .

For simple dynamical systems for which the invariant measure is known the distributions (4) and (5) can be computed analytically. However, for the Lorenz-96 model they have to be approximated by their empirical counterparts obtained from numerical simulations. In order to provide a good sampling of the attractor of the Lorenz-96 system as far as both local and global fluctuations are concerned we computed the distributions by means of an orbit on the attractor consisting of points with a time step of . The starting point of the orbit is obtained by a transient integration of time units using a random initial condition.

##### 3.2. Intermittent Periodicity for

Figure 3 shows the bifurcation diagram of the Lorenz-96 model for . The equilibrium becomes unstable at through a supercritical Hopf bifurcation. The periodic attractor remains stable until where it exchanges stability with another periodic attractor. However, at the original periodic attractor gains stability again. Finally, at , it disappears through a saddle-node bifurcation and a chaotic attractor is detected. Figure 4 shows the periodic attractor and the chaotic attractor just before and after the saddle-node bifurcation. The dynamics on the chaotic attractor consists of alternations between nearly periodic and chaotic behaviour. This is the classical type 1 intermittency scenario described by Pomeau and Manneville [49]. Note that, for intermittency to occur, it is not only necessary to have an attractor that disappears through a bifurcation, but also there has to be a global dynamical mechanism that enables recurrent visits to the location of the formerly existing attractor in state space. In the case of the Lorenz-96 system we have identified a nearby heteroclinic cycle between four equilibria that can provide such a mechanism; see [36] for further details.

**(a)**

**(b)**

**(a)**

**(b)**

Figure 5 shows the mean and the left and right endpoints of the distribution (4) for as a function of the parameter . Clearly, the variability of the singular values along the attractor increases very sharply after the saddle-node bifurcation. Note that the largest Lyapunov exponent in the bifurcation diagram of Figure 3 shows a more gradual increase of square root order except for the presence of narrow windows with periodic dynamics. For , which belongs to the chaotic regime after the periodic attractor has disappeared, the right endpoint of (4) shows large fluctuations: peaks can differ in magnitude by a factor of 10 or larger. The left endpoint suddenly decreases after the saddle-node bifurcation, which means that the predictability of some events can potentially be enhanced. In particular, this leads to the question whether the predictability of extremes can be enhanced.

Figure 6 shows graphical representations of the distributions (4) (in black) and (5) (in color) in the form of box plots for the prediction lead times . The support of the unconditional probability distribution (4) becomes larger as the lead time increases. More specifically, consider the right endpoint of the distribution which is defined as the largest singular value of the sample and which is a measure for worst-case predictability. A least squares fit computed over the lead times (not all shown in Figure 6) gives which shows that up to the right end point increases exponentially with the lead time. For larger lead times, however, tends to a constant. The exponential growth of for short lead times has been found earlier in the low-dimensional systems used in the study by Harle et al. [28] who also point out that for finite-size initial errors a power law behaviour will be observed.

**(a)**

**(b)**

Figure 6 also shows that the right endpoint of the conditional distribution (5) grows substantially slower than that of the unconditional distribution (4) for lead times . Figure 7 shows how the conditional distributions change as a function of the threshold for fixed lead times. For the interquartile range shifts towards the left endpoint, whereas the right end point is nearly constant. However, for the right endpoint decreases exponentially fast with the threshold quantile : a linear fit gives that . The main conclusion drawn from these observations is that for the worst-case error growth, as represented by the right endpoint of the distribution (4), increases exponentially with the prediction lead time , but errors for extreme events grow at a much slower rate. For the right endpoint of (4) remains nearly constant, but also in this case the right endpoint for the conditional distribution (5) remains several orders of magnitude smaller.

**(a)**

**(b)**

Figure 8 shows a time series of the Lorenz-96 model. Note that is plotted as a function of with rather than itself. By plotting in the same figure we can study whether initial conditions leading to extremes typically have small or large error growth rates and whether this is related to particular features of the dynamics. The time series clearly shows alternations of periodic and aperiodic dynamics, which is characteristic for type 1 intermittency. During intervals with periodic dynamics the singular values have a magnitude of . However, during interruptions of periodicity (which in Figure 8 are visible near and ) the singular values show very large spikes with typical magnitudes of and . This explains why the box plots in Figure 6 exhibit long tails. Note that in [33, 34] exponential tails have also been found for finite-time Lyapunov exponents near intermittent dynamics of the logistic map. Also note that in the intervals of aperiodic behaviour does not reach extreme values. These observations explain why in Figure 7 the right endpoint of the distribution decreases with the event threshold.

The computations by Pomeau and Manneville [49] suggest that in the type 1 intermittency scenario the maximal Lyapunov exponent grows like where is the parameter value of the saddle-node bifurcation. Such behaviour is indeed visible in Figure 3 with the exception of the presence of narrow windows with periodic dynamics. This suggests that further away from the bifurcation the system becomes more chaotic. A natural question then is how will the predictability of extremes behave further away from the saddle-node bifurcation? Table 1 shows the exponential growth of for for different parameter values . In each case a least squares fit computed over the lead times has been used. The behaviour is rather stable with with the exception of in which case a stable periodic attractor appears amidst the chaotic regime. Figure 9 shows a similar diagram as in Figure 6 but for the parameter value which is further away from the saddle-node bifurcation. These results suggest that the predictability of the event is rather stable across a broad range of parameter values for , except for periodic windows within the chaotic regime.

##### 3.3. Intermittent Quasi-Periodicity for

Figure 10 shows the bifurcation diagram of the Lorenz-96 model for dimension . The equilibrium becomes unstable at through a supercritical Hopf bifurcation. The periodic attractor remains stable until where it bifurcates through a Neĭmark-Sacker bifurcation. The resulting 2-torus attractor remains stable until where it disappears through a quasi-periodic saddle-node bifurcation [40, 50]. Figure 11 shows a Poincaré section of the quasi-periodic attractor before the bifurcation and the chaotic attractor just after the bifurcation. The trace of the formerly existing 2-torus attractor is clearly visible. The dynamics is characterized by alternations between quasi-periodic and chaotic dynamics. This is a form of intermittency but of a different nature than type 2 intermittency described by Pomeau and Manneville [49] since the latter scenario involves the disappearance of a stable periodic orbit instead of a 2-torus attractor.

**(a)**

**(b)**

**(a)**

**(b)**

In this case the box plots in Figure 12 indicate that predictability of the event does not increase with the threshold . The right end points of both distributions (4) and (5) grow approximately like as a function of . Also note that the interquartile range of (5) shifts towards larger values as increases. Table 2 shows that further away from the quasi-periodic saddle-node bifurcation (i.e., for larger ) the right endpoint of both the distributions (4) and (5) grows faster with . For the parameter values in Table 2 the corresponding box plots are qualitatively similar to Figure 12 and therefore they are not shown. These observations imply that initial conditions that lead to the extreme event are typically associated with large error growth rates.

**(a)**

**(b)**

The question is how can the unpredictability of extremes be explained in terms of the intermittent dynamics? Figure 13 shows a time series in which and are plotted as a function of for fixed. The time series clearly shows an episode of quasi-periodic dynamics which is interrupted for . During intervals of quasi-periodic dynamics the singular values have magnitudes ranging between 2 and 5. The 2-torus attractor at has singular values in the same range. During intervals of chaotic dynamics the singular values are typically much larger and have a magnitude ranging between 2 and 12. Also note that attains extreme values in both the quasi-periodic and the chaotic regime. These observations explain why extremes do not become better predictable with increasing threshold.

##### 3.4. Quasi-Periodicity for

Figure 14 shows the bifurcation diagram of the Lorenz-96 model for dimension . The equilibrium becomes unstable at through a supercritical Hopf bifurcation. The periodic attractor remains stable until where it bifurcates through a Neĭmark-Sacker bifurcation. The resulting 2-torus attractor remains stable until and a 3-torus attractor appears. Figure 14 suggests that the 3-torus attractor persists in a small interval of the parameter before it disappears at and a chaotic attractor is detected. It is unknown which bifurcation is involved in the disappearance of the 2-torus attractor; addressing this question is left for future work. The chaotic attractor persists until after which a 2-torus attractor is observed again.

The box plots in Figure 15 for show that, unlike in the cases and , the singular values do not grow exponentially with the lead time . For the box plots are qualitatively similar (not shown). Errors for are typically larger than for . The oscillatory behaviour weakens with increasing . This nonmonotonic behaviour of predictability is at odds with the results in [28] in which exponential or power law growth is conjectured to be typical for chaotic systems. However, nonmonotonic dependence of predictability on lead time has also been observed in wind speed predictions produced by an operational weather forecasting system [16, Figure ]. We expect that in more general systems with strong quasi-periodicity, for instance, related to diurnal or seasonal cycles, error growth rates will not follow an exponential or power law.

**(a)**

**(b)**

#### 4. Conclusion and Discussion

In this paper we quantified the predictability of a specific event in a dynamical system by comparing the growth rates of errors in initial conditions that lead to this event with growth rates for all initial conditions. Numerical experiments with the Lorenz-96 model show that the predictability of large amplitudes of traveling waves is influenced by the dynamical regime of the model. In particular, we have focused on intermittency scenarios in which episodes of regular and chaotic dynamics alternate. We have shown that predictability of extremes increases near a saddle-node bifurcation of a periodic orbit but decreases near a saddle-node bifurcation of a 2-torus attractor. Finally, near the breakdown of a 3-torus attractor we have observed a nonmonotonic dependence of predictability on lead time. The results in this paper show that the predictability of extremes in dynamical systems is not universal and warrant a further in-depth investigation to unravel generic dynamical mechanisms that lead to enhanced predictability of extremes.

We have studied the predictability in the model-driven framework (borrowing the terminology of [9]). The advantage of the approach outlined in this work is that distributions of error growth rates are computed in terms of the invariant measure of the system. In this way distributions of error growth rates for vanishingly rare events can be studied. For simple classes of dynamical systems the methodology can be used to derive rigorous results on predictability and this direction will be pursued in future work. A limitation of our method, however, is the explicit need of a dynamical system and its variational equations (also referred to as a tangent linear model). The latter problem may be remedied by replacing the singular values which maximize (3) by the quotients in (2) using a small, but finite, value of . Harle et al. [28] pointed out that for sufficiently small the growth rates of infinitesimal and finite-size errors behave similarly with prediction lead time.

Our work is written in the spirit of dynamical systems theory, and we have used a measure of predictability which fits into that framework. However, instead of using finite-time growth rates to measure predictability one could also use skill scores or receiver operator characteristic (ROC) curves. Such measures have the advantage that they can also be used in the framework of data-driven predictions in cases where a dynamical model is not available. The numerical experiments performed with the Lorenz-84 model by Bodai [9] suggest that finite-time Lyapunov exponents do not directly correspond to ROC-based measures. This would imply that assessment of predictability also depends on which measure is being used. An interesting question for further research is how can the results of different studies using different predictability measures be reconciled and under which circumstances do different measures for predictability lead to opposite conclusions?

We conclude this paper by remarking that the phenomenon of enhanced predictability of extreme events is not limited to toy models, but it also occurs in real-world applications. Recent work [16], based on output of the operational ensemble prediction system of the UK Met Office, has revealed that wind speed extremes are in general less predictable than nonextremes, but under certain conditions which are related to the distribution of the ensemble members they are better predictable. In addition, observational work shows that large-scale flow patterns, such as the North Atlantic Oscillation, cause temporal clustering of storms [51, 52]. Hence, we foresee that the predictability of extremes will remain an active topic of research in the near future.

#### Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.