Abstract

The need for improved quantitative precipitation forecasts and realistic assessments of the regional impacts of natural climate variability and climate change has generated increased interest in regional (i.e., systems-scale) climate simulation. The Salton Sea Stochastic Simulation Model (S4M) was developed to assist planners and residents of the Salton Sea (SS) transboundary watershed (USA and Mexico) in making sound policy decisions regarding complex water-related issues. In order to develop the S4M with a higher degree of climate forecasting resolution, an in-depth analysis was conducted regarding precipitation and evapotranspiration for the semiarid region of the watershed. Weather station data were compiled for both precipitation and evapotranspiration from 1980 to 2004. Several logistic regression models were developed for determining the relationships among precipitation events, that is, duration and volume, and evapotranspiration levels. These data were then used to develop a stochastic weather generator for S4M. Analyses revealed that the cumulative effects and changes of ±10 percent in SS inflows can have significant effects on sea elevation and salinity. The aforementioned technique maintains the relationships between the historic frequency distributions of both precipitation and evapotranspiration, and not as separate unconnected and constrained variables.

1. Introduction

The need for improved quantitative precipitation forecasts and realistic assessments of the regional impacts of natural climate variability and climate change has generated increased interest in regional (i.e., systems scale) climate simulation. Because climate warming associated with climate change will exacerbate water sustainability problems, heavily-populated, arid regions such as the Southwest USA are likely to experience some of the highest economic expenses and environmental losses [1]. For example, flows in the Colorado River have been projected to reduce from 10% to 50% by midcentury [2]. Other projections indicated that there would be more frequent and longer lasting droughts particularly in the Colorado River Basin in the latter half of the 21st century, suggesting future challenges regarding water supplies throughout southwest [3].

Weather generators can be used to produce synthetic weather sequences (which are time series of random numbers that resemble statistically the observations recorded in nature) and are useful for projecting climate into the future [4]. Most weather generators emphasize precipitation (Prcp) as a primary driving variable of interest and other hydroclimatic variables are either directly or indirectly affected by it [5]; for examples, see weather generator (WGEN) by Richardson and Wright [6], a geospatial-temporal (GiST) weather generator by Baigorria and Jones [7], and/or a two-stage resampling algorithm by Leander and Buishand [8]. Weather generators for simulating hourly Prcp and temperature can also be found, for example, downscaling-disaggregation weather GENerator (DD-WGEN) by Mezghani and Hingray [9] and advanced weather GENerator model (AWE-GEN) by Fatichi et al. [5]. Further, Rajagopalan et al. [10] generated Prcp independently with other climate variables, (e.g., solar radiation, temperature) conditioned on the status of Prcp (i.e., rain or no rain on a given day) using a multivariate nonparametric resampling scheme. The other climate variables can then be generated from either independent statistical distributions fitted separately to each of the variables for each of the two Prcp states (i.e., rain or no rain days) [10].

Stochastic weather models are commonly used because they are easy to calibrate, can be implemented quickly, often pertain to daily time scale [11], and can incorporate data from weather generators easily. Wilks [4] used a first-order, two-state Markov process to simulate daily Prcp, basically based on a conditional probability of a wet day following a dry day and a conditional probability of a wet day following a wet day. An assumption of this type of Markov chain model is that the probability of rainfall on any day depends only on whether it rained on the preceding day; however, the proper Markov order to represent the daily Prcp occurrence process cannot be assumed a priori but can only be determined through observational data analysis or via available results of investigation [12].

Most stochastic models of daily rainfall consist of two parts: a model for the occurrence of dry and wet days and a model for the generation of rainfall volume on wet days [13, 14]. Markov chain models specify each day in a time series as “wet” or “dry,” and develop a relationship between the state of the system at time (e.g., the current day) current day and the states of the preceding days (time ), where the number of preceding days determines the order of the Markov chain, for example, first order or higher orders [13]. Harrold et al. [15] used four different classes of rainfall amount, categorized according to the number of adjacent wet days whereby the model was conditioned on the rainfall amount of the previous day; this approach accommodates contiguous series of rainfall days (i.e., within-series correlations of rainfall amounts) by using the volume on the previous day as a conditioning variable. Many existing rainfall amount models ignore this correlation structure and assume that the volumes of Prcp at time are independent of future volumes (i.e., the volume of Prcp at time is independent of the volume occurring at time ) [15, 16]. However, Buishand [17] and Chapman [13] demonstrated that the distribution of rainfall volumes is different during solitary wet days compared to Prcp occurring over multiple, contiguous days [15]. Srikanthan and Pegram [18] describe a nested multisite daily rainfall stochastic generation model that preserved rainfall characteristics at the daily, monthly, and annual time scales. Srikanthan and Pegram [18] determined that the rainfall volumes on isolated wet days and sequences of wet days did not depend on the duration of the Prcp event as well as they should have.

Evapotranspiration (ETo) changes induced by changing climate conditions are not trivial in hydrologic modeling efforts or water resource management studies [19]. ETo demand is a sensitive parameter that needs to be accounted for; however, often due to limited observational data, it is often implicitly calculated through calibration efforts or mass balance formulation [19].

Although ETo models and procedures for determining ETo exist (e.g., McMahon et al. [20]), ETo is most often calculated based on the results of Prcp and temperature, for example, Beersma and Buishand [21], or in combination with other weather generated variables (e.g., Snyder et al. [22], Wilks [11]). Miller et al. [19] incorporated changing ETo demands with changing temperatures using the variable infiltration and capacity (VIC) model [23] for streamflow projections over the Colorado River headwater basins. More recently, Baigorria [24] simulated daily minimum and maximum temperatures with regard to covariance with rainfall occurrence, that is, max temperatures on rainy days versus nonrainy days. Snyder et al. [22] describe the development of “Simulation of Evapotranspiration of Applied Water” (SIMETAW) application program for helping California plan future water demand for agricultural purposes. SIMETAW uses only monthly averages of total rainfall volume and the number of rain days for simulating daily weather data and requires information for variables associated with a modified version of the Penman-Monteith equation [25].

There is a need to develop more precise methods for simulating Prcp and ETo that account for the natural interdependency of these two processes. This study utilized nonparametric models to determine the statistical relationships between ETo and Prcp, for incorporation of these relationships into hierarchical nested Markov chain models to preserve the interdependent nature of ETo and Prcp fluctuations. We used the Salton Sea (SS) watershed as a case study for our approach for modeling climate futures.

1.1. Study System

The Salton Sea (SS) watershed spans some 8,360 square miles (21,700 km2) in southeastern California and extends from San Bernardino County through Riverside and Imperial counties and into the Mexicali Valley, in Baja California, Mexico [26]. The terminal lake ecosystem of the SS is located in the Colorado Desert (33° 15′ N, 116° W) approximately 35 miles (56 km) north of the US-Mexico border (Figure 1) [27]. The SS is a major hydrologic element of the Lower Colorado River Basin (LCRB) and is considered important to the economic, social, and biological values of the region. However, it is suffering marked degradation as a consequence of human activity and although efforts to rehabilitate the SS ecosystem have been underway for more than a decade, they have had little success.

The present study addresses the need for an integrated systems simulation modeling approach for use in simulating complex water quality and water quantity management policies on both watershed and localized scales. The Salton Sea Stochastic Simulation Model (S4M) [28] is a spatially explicit, stochastic, simulation model, formulated as a difference-equation compartment model with a daily time step using STELLA v. 8.0. software [29] representing water flow (i.e., water volume) and quantity of total dissolved salts (TDS) and phosphorus (P) in the LCRB and SS watershed. Unlike previous models constructed for the SS that relied on datasets consisting mainly of monthly or annual averages, the S4M incorporated a high degree of seasonality and climate forecasting resolution. However, the S4M initially employed a common technique in modeling the uncertainty in future climate patterns by simply projecting a historic climate data sequence into the future (i.e., deterministic versions of the climate driving variables).

In order to further develop the S4M with a higher degree of seasonality and climate forecasting resolution, an in-depth analysis was conducted regarding Prcp and ETo for the semiarid region. ETo is one of the less understood components of the hydrologic cycle [30] but is a major component in terrestrial water balance models [31]. Basically, ETo is the sum of the volume of water used by vegetation (transpired), evaporated from the soil and the intercepted Prcp on vegetation [30, 32]. The difference between evaporation and transpiration is that the latter consists of the vaporization of liquid water contained in plant tissues and the vapor removal to the atmosphere while evaporation occurs at the topsoil if the water is available [30, 33].

Located in one of the most arid regions of North America, maximum temperatures around the Salton Sea may exceed 100°F (38°C) more than 110 days each year, while temperatures seldom drop below freezing. Annual Prcp in the region averages less than 3 inches (7.6 cm), while net evaporation rates from the sea’s surface and ETo exceed 66 inches (175 cm) annually [27, 34]. For comparison, the Sahara Desert, much like the Colorado Desert surrounding the SS, is characterized by bare soils and large amounts of available energy allowing for any rainfall to quickly return to the atmosphere [35]. Scott et al. [35] observed that the effect of a Prcp event in the Sahara desert environment had a negative effect on evaporation, but the effect rapidly decreased within the first day. Notably, the timescale of soil moisture storage determines the timescale of the ETo persistence and thus the timescale of humidity persistence in the near-surface atmosphere.

Several other models were developed to address site-specific alternatives for maintaining the Salton Sea. Two hydrodynamic models, RMA-2 and RMA-10, both formulated for the finite element solution method, were applied to simulate the circulation in the sea. This was done to quantify the effects that diked impoundments would have on the sea’s circulation and to better understand the sea’s circulation via a field monitoring program [36]. Another study conducted a couple of water quality simulations for the −240′ and −245′ southern impoundments with the BATHTUB model [37, 38]. The BATHTUB model uses a series of empirical submodels to predict the annual nutrient budgets and productivity levels in the water body that predicted mean annual water quality in two proposed impoundment configurations [38]. A UC Davis hydrodynamic model of the Salton Sea was used to estimate the effects of changes in sea elevation [39], while Chung et al. [40] developed a linked hydrodynamic and water quality model.

The primary objective of this study was to develop a more precise methodology for generating ETo and Prcp as interdependent stochastic driving variables that can be used in large-scale systems models. A second objective was to implement the stochastic ETo and Prcp driving variables in the S4M and determine the effects, if any, of a ±10 percent change of inflow volumes to the SS, thereby addressing future climate change uncertainty of this critically important watershed.

2. Methods

2.1. Determining Statistical Distributions of Historic Data

Following the method of Naoum and Tsanis [30], daily estimates, from 1980 through 2004, of ETo and Prcp were obtained from two meteorological stations [41, 42] in the Salton Sea watershed (Figure 1). Specifically, weather stations in near proximity to the North end and South end of the SS were used, that is, station IDs: Thermal and Indio (TI) and Brawley and Calipatria (BC), respectively. Data points were averaged to get estimates of central tendencies in ETo and Prcp for the SS following methods established in Voinov et al. [43], Bhuyan et al. [44], and Salton Sea Ecosystem Restoration Program [45]. The compiled, averaged 25-year dataset will be henceforth referred to as the averaged dataset.

Daily values of Prcp and ETo data from the averaged weather station data were grouped by month; then each month was tested against thirty-four probability distributions (Table 1) in order to determine the best statistical fit for each month. Monthly distributions were used to preserve seasonality when modeling future climate scenarios, while not solely restricting future climate scenarios to historical values. EasyFit Version 1.3, from MathWave Technologies [46], was used for curve-fitting analyses. The theoretical distribution that provided the “best” fit for each month was determined based on the Kolmogorov-Smirnov (K-S) and Anderson-Darling (A-D) tests of statistical significance, as well as a visual comparison of the fitted curves to the historic frequency distribution.

2.2. Statistical Models Used to Determine Relationship between ETo and Prcp

We hypothesized that different durations of Prcp events would have different effects on ETo, and that the ETo volume of a Prcp event at time would depend on the volume (or absence thereof) of Prcp at time . Therefore, individual categories representing single versus multiple Prcp events and corresponding volumes were sorted into subset datasets, along with their respective ETo volumes. In order to thoroughly test these hypotheses, 17 different variables (3 continuous, 14 discrete/categorical) were created from the averaged dataset (Table 2). Because the SS is located in an arid region, many of the daily Prcp values are zeros, 90 percent, or more in many cases. As a result of many days without Prcp, the Prcp data exhibited a mixed distribution with a high number of observations having a value of zero and a continuous distribution when there actually were Prcp events. Traditional time series analysis to account for autocorrelation between observations does not handle datasets with such high zero counts, such as those for ETo and Prcp. Therefore, to determine the relationships among Prcp events and between Prcp events and ETo levels, binomial and multinomial logistic regression models were used [47]. Preliminary analyses indicated that the data were not overdispersed. From the 17 variables, we developed 16 different statistical models to test the relationship between ETo and Prcp (Table 3). The dependent variables in the logistic regression models were coded as described in Table 2 and tested with binomial or multinomial logistic regression depending on the number of levels [48]. The normality of the standardized residuals, the assumption of linearity in the logit, and a Hosmer-Lemeshow test [48] were used to determine if a model fits the data. A classification table was created using 0.5 as a cut-off point to determine the predictive power of the model, and likelihood ratio tests were used (at ) to determine model significance [47, 48]. Results from the model selection procedure were used as the new, stochastic climate module for the S4M. Stata Statistical Software v. 9.0. [49] was used for the binomial and multinomial logistic regression statistical analyses.

2.3. Salton Sea Simulation

The S4M was constructed to represent water flow in the LCRB as it enters the SS and Colorado River Delta, where it subsequently flows, albeit intermittently, to the Gulf of California. The S4M specifically accounts for the water volume and water quality in the SS and is formulated as a compartment model based on difference equations with a daily time step using STELLA 8.0 software [29]. ArcGIS v. 9.0. software [50] was utilized for map making, with SS shapefiles obtained from the University of the Redlands Institute [51].

Previously, the strategy used in modeling the uncertainty in future climate patterns in the S4M consisted of using a deterministic version of the driving variables in which the historic pattern and number of weather events were preserved, that is, the past and present as the future. Using the methodology described above, a stochastic version of the driving variables was implemented and evaluated. This new module was evaluated by comparing observed and simulated data as a means to assess the performance of the simulation model similar to Tong and Chen [52].

In addition, a climate sensitivity analysis was conducted to observe the cumulative effects of ±10 percent change of inflow volumes to the SS, if any, on sea elevation and salinity, thereby addressing another aspect of future climate uncertainty. ANOVA and Bonferroni multiple comparisons post hoc tests were performed for SS salinity and elevation variables under both deterministic and stochastic versions of the model using the statistical software SPSS v. 12.0.1. [53].

3. Results and Discussion

A comparison of the averaged weather station historic datasets, that is, TIBC datasets, versus the individual weather station historic datasets was made for both Prcp and ETo. The TIBC monthly datasets preserved seasonality for both Prcp and ETo as illustrated in the figures (Figures 24).

Notably, for the months of February, March, and April the average monthly Prcp volume was lower than the actual weather station historic data. This occurred for two reasons: (1) the disparity in the number of observations between the two weather station datasets and the average resulting in the largest number of observations between the two, and (2) the concomitant decrease in values due to averaging, for example, one weather station having a higher Prcp volume on a given day and the other having a much lower value for the respective day. Instances of missing data in the BC dataset required the replacement of the missing data with TI weather station values, and in such cases, often with a volume of 0 due to the high frequency of days without Prcp events.

The averaging of the weather station data for both Prcp and ETo provided a complete dataset for a 25-year period (1980–2004) for the SS area. Although the pattern of monthly Prcp volumes was preserved when the two separate whether station datasets were averaged, the frequency of Prcp events of a given duration was somewhat inflated. More specifically, the averaging increased the number of consecutive Prcp events when comparing the BC dataset, having a maximum of 7 consecutive events and the TI dataset having a maximum of 9 consecutive events with the TIBC averaged dataset having a maximum of 10 consecutive events. The frequency of smaller events was inflated as well. For example, Prcp events of five or more days in duration were as follows: BC with 7 instances, TI with 13 instances, and the TIBC averaged dataset with 36 instances. Similarly, events of 8 days in duration or more were as follows: BC with 0 instances, TI with 3 instances, and the TIBC averaged dataset with 8 instances. As a result, some months in the TIBC averaged dataset experienced inflated Prcp event durations more than others, for example, January, September, and December. Therefore, the BC weather station dataset statistical relationships, concerning subsequent Prcp events, were implemented and not those based on the averaged dataset. The aforementioned strategy was undertaken as a means to avoid the inflated frequencies of multiple Prcp events being incorporated into the simulation model. The inflated values demonstrate the potential for introduced error in modeling when using averaged data.

3.1. Determining Statistical Distributions of Historic Data

Graphs of the curve fitting results based on the historic frequency distributions for both Prcp and ETo provided a visual aid for determining which theoretic probability distribution gave the “best” fit. The curve providing the “best” fit and associated levels of significance (-value) for the Kolmogorov-Smirnov (K-S) and Anderson-Darling (A-D) test statistics were recorded for each weather station dataset as well as the TIBC averaged dataset. The results of the two test statistics differed at times, that is, listing two different curves as providing the “best” fit. In the aforementioned situation, a final decision pertaining to the “best” fit was made based on a visual assessment of the figures. A summary of the overall curve fitting results can be found in Kjelland [28].

The curve-fitting exercise resulted in similar distributions providing the “best” fit to the data; however, the distribution for the TIBC averaged dataset for days of individual months often varied, more so in the case of ETo than for Prcp. For example, the most common distribution type providing the “best” fit for both the BC and TI ETo datasets was a Wakeby distribution, but the TIBC averaged dataset (average of the two) more closely resembled Wakeby, General Logistic, and General Extreme Value distributions, depending upon the month in question. The most common distribution types providing the “best” fit for both BC and TI Prcp monthly datasets were Gamma and Exponential distributions, but the TIBC averaged dataset more often resembled Gamma and Rayleigh distributions. The aforementioned differences demonstrate some of the compromises as a result of averaging the datasets.

3.2. Statistical Models Used to Determine Relationship between ETo and Prcp

It was hypothesized that Prcp and ETo would generally exhibit a negative relationship in the area surrounding the SS, that is, ETo volumes decreasing with Prcp events. Also, the volume and duration of Prcp events would be important factors to consider when exploring any relationship between Prcp and ETo. The two logistic regression models (Equations (1) and (2)) in Table 3 established that some months were significantly different than others concerning the amounts of Prcp and ETo.

The negative relationship between Prcp and ETo was tested statistically using two multinomial logistic regression models and one binomial logistic (logit) regression model (Equations (3), (4) and (5)), respectively (Table 3). The logit model (Equation (3)) resulted in negative coefficients () for all four categories of “PrcpAmt” and had a Prob of and Pseudo of 0.105. A likelihood-ratio test for independent variables resulted in a of 382.6 and Prob of . Similarly, the logit model (Equation (4)) resulted in negative coefficients (), with an increasing trend, for all five categories of “CatEvent” and had a Prob of and Pseudo of 0.124. Also, a likelihood-ratio test for independent variables resulted in a of 439.9 and Prob of . The logit model (Equation (5)) resulted in a statistically significant () and negative coefficient for the variable ETo and an overall Pseudo of 0.131.

The relationship between the volume of a single Prcp event and subsequent Prcp event volumes and associated changes in ETo was tested statistically using the logit model (Equation (6)) in Table 3. The model resulted in negative coefficients for all four categories of “CatVol” and had a Prob of and Pseudo of 0.123. The categories 1, 2, and 4 of the nominal categorical variable “CatVol” were significant (). Also, a likelihood-ratio test for independent variables resulted in a of 433.236 and Prob of .

The relationship between different Prcp volumes and associated changes in ETo before a Prcp event and the day of a Prcp event was tested statistically using the logit model (Equation (7)) in Table 3. The model resulted in statistically significant () positive coefficients for categories 1, 2, and 3 of “EToFirst” (using category 0 as the base outcome) and had a Prob of and Pseudo of 0.423. Also, a likelihood-ratio test for independent variables resulted in a of 1026.596 and of . Another logit model, (Equation (8)) in Table 3, showed that the categories of 1, 2, and 3 of the dependent nominal categorical variable “EToFirst” were less than the base outcome, as all had negative and significant () coefficients.

The relationship between ETo the day of a Prcp event (or the first day in a series of events) and the day after the event (or series of events) was tested statistically using the logit models (Equations (9)–(13)) in Table 3. The logit model (Equation (9)) resulted in positive coefficients () for categories 1, 2, and 3 of “ETolev2,” for both independent variables and had a Prob of and Pseudo of 0.494. Also, a likelihood-ratio test for independent variables resulted in a of 82.876 for “PrcpAmt” and 317.128 for “CatEvent” and both had a of . Another logit model, (Equation (10)) in Table 3, showed that “ETolev2” categories 1, 2, and 3 all had negative coefficients, although category 1 was not significant, . Two additional variables to measure the effect of month (“Month”) and the respective comparisons of Prcp volumes (“CatVol”) associated with single versus subsequent Prcp events were included in another logit model (Equation (11)) in Table 3. The aforementioned model resulted in some negative coefficients for the independent variables of “CatVol” and “Month” but not “PrcpAmt” and “CatEvent” and had a Prob of and Pseudo of 0.495. The independent variables “CatVol” and “Month” were not significant () for any of the three categories of the dependent nominal categorical variable “ETolev2.” A likelihood-ratio test for independent variables resulted in a of 1.541 for “CatVol” and 0.906 for “Month” and Prob of 0.673 and 0.824, respectively. Models (Equations (12) and (13)) in Table 3, measuring the relationship between Prcp and ETo and the percent change in ETo the day of the event, as compared to after the event, both had statistically significant () and negative coefficients and a Pseudo of 0.061 and 0.024, respectively.

The relationship between ETo the day before a Prcp event (or sequence of Prcp events) was tested statistically using the logit model (Equation (14)) in Table 3. The logit model (Equation (14)) measuring the relationship between Prcp and ETo before the day of the event had statistically significant () and negative coefficients and a Pseudo of 0.038.

The relationship between ETo the day after a Prcp event (or after the last Prcp event in the sequence) was tested statistically using the logit model (Equation (15)) in Table 3. The logit model (Equation ) measuring the percent change in ETo after the Prcp event had a Pseudo of 0.016 with positive coefficients for the independent variables. Category 2 of the dependent nominal categorical variable “Prcpbefafter” was statistically significant () whereas category 1 was not ().

The general relationship between ETo the day of the first Prcp event (or the first day in a series of events) compared to days without Prcp (or days having Prcp events that are not the first day in the series of events) was tested statistically using the logit model (Equation (16)) in Table 3. The model resulted in a statistically significant () negative coefficient and had a Prob of and Pseudo of 0.015.

The Prcp data in this study demonstrated that the majority of the rainfall occurred during the winter months. Also, months with a higher percentage of Prcp events had a larger percentage of days with low (0.21 inches or less) ETo (Figure 2(c)). And like the study by Scott et al. [35], where the effect of a Prcp event in the Sahara desert environment had a negative effect on evaporation, the results herein indicated that the same was true for ETo in the semiarid environment around the SS. However, unlike the results of Scott et al. [35] that showed the effect of a Prcp event on evaporation rapidly decreasing within the first day, the results of the present study elucidate a more complex relationship.

There are two important variables that play a role in the relationship between Prcp and ETo amounts, namely, duration (“CatEvent”) and volume (“PrcpAmt”). The results demonstrate that a longer Prcp event is more likely to suppress ETo volumes and for a longer time period. Likewise, larger Prcp volumes suppress ETo volumes more so than the small Prcp volume events; however, larger Prcp volume events also tend to have a higher proportion of the largest ETo volumes compared to small Prcp events. Therefore, the variable “CatEvent” would seem to be a better predictor of ETo volumes, and the multinomial logistic regression models support this.

The chances of a decrease in ETo the day of a Prcp event compared to the day before the Prcp event were twice as likely (62 percent) than ETo increasing the day of the Prcp event (31 percent). Further, lower Prcp amounts had a slightly higher probability (42 and 43 percent, resp.) for categories 1 and 2 of the ordinal categorical variable “PrcpAmt” versus (35 and 29 percent, resp.) for categories 3 and 4. The probability of ETo more than the day before the Prcp event increased for category 4 of “PrcpAmt” (32 percent) versus (29 percent) for categories 3 and 1, respectively, of the variable “EToFirst.” The overall negative effect of Prcp on ETo levels was supported by Equations (3)–(5) respectively, showing statistically significant () and negative coefficients for all categories of “PrcpAmt,” “CatEvent,” and “RainEvent.” Compared to days without Prcp events, days with Prcp events had significantly lower ETo volumes. Similarly, all categories of the ordinal categorical variable “CatEvent” measuring duration of Prcp events had statistically significant () negative coefficients and showed that the longer the duration of the Prcp event, the larger the decrease in ETo levels, in general. Also, the Pseudo was somewhat larger for the variable “CatEvent” versus “PrcpAmt” (0.124 versus 0.105, resp.) meaning that the duration of the Prcp event was a slightly better predictor of the decrease in ETo volumes. The nominal categorical variable “CatVol” distinguished between a single or subsequent Prcp event taking into account volume and supported these conclusions based on the logit model (Equation (6)).

The logit model (Equation (8)) with the dependent nominal categorical variable “EToFirst” measured whether ETo the day of a Prcp event was less than, equal to, or greater than before the Prcp event. The logit model (Equation (8)) demonstrated that ETo volumes the day of a Prcp event were significantly less than ETo volumes on days without Prcp. Further, the plot of the Prcp and ETo observations by month (Figure 2) revealed that months with a higher percentage of rain days also had a higher percentage of days with ETo ≤0.21 inches. ETo and Prcp exhibited a negative relationship overall, as initially hypothesized.

When observing recovery time using the nominal categorical variable “ETolev2” (Equations (9)–(11)) measuring whether ETo is the same after a Prcp event (or the last event in the series of events) as during a Prcp event (or the last day of Prcp in the series of events), there was a greater likelihood of ETo increasing the day after the Prcp event or series of events versus decreasing (38 versus 21 percent, resp.). The greater likelihood of ETo increasing the day after the Prcp event (or series of events), versus decreasing, was also the case for events lasting more than one day, for example, 13 versus 59 percent (resp.) for two-day events. The recovery time or increase in ETo after the Prcp event was more likely with lower Prcp volumes than for higher Prcp volumes, for example, 50 percent for category 1 versus 29 percent for category 4 of the dependent variable “PrcpAmt.” Also, the logit model (Equation (16)) with the dependent nominal categorical variable “PrcpOn” showed a negative change in ETo on the day of a Prcp event, or first day in a series of events when compared to days without Prcp events. Moreover, the logit model (Equation (12)) with the dependent nominal categorical variable “Prcponafter” showed that compared to days without Prcp, the first day of a Prcp event produced a lower ETo volume (coefficient of −5.842) while days after a Prcp event (or the last day in a series of events) also had a lower ETo volume (coefficient of −4.624). Clearly, the first day of the Prcp event had a larger ETo volume than the day after the Prcp event (or series of events) indicating some recovery from the Prcp event. The recovery time of ETo was tested using another logit model (Equation (13)) using the same dependent variable “Prcponafter” but with the independent variable “EToPerCh,” measuring the percent change in ETo from one day to the next. The logit model (Equation (13)) results showed that when compared to days without Prcp, ETo decreased on the first day of a Prcp event. However, a positive change in ETo the day after the Prcp event (or the last day in a series of events) means ETo increased when compared to days without Prcp events.

An analysis of the frequency of observations for each category of the variable “EToOne” showed that ETo the day after a sequence of Prcp events is generally less than the ETo the day before the sequence of Prcp events. In fact, the ETo volume after the Prcp event is less than the ETo volume the day before the first Prcp events 48 and 50 percent of the time for larger Prcp volume events. Therefore, the frequency that recovery time is within a day of a Prcp event only occurs about 50 percent of the time for larger volume Prcp events.

The logit model (Equation (14)) demonstrated that there is actually less ETo the day before a Prcp event as compared to days without Prcp. Moreover, compared to days without Prcp there is typically less ETo the day after the Prcp event, or last day in a series of events. The other logit model (Equation (15)) reveals a positive increase in ETo before a Prcp event and a positive increase the day after a Prcp event (or series of events) when compared to days without Prcp events. The analysis of single versus subsequent Prcp events result showed the probability of a given Prcp volume for a second consecutive Prcp event based on the Prcp volume of the first Prcp event.

3.3. Salton Sea Stochastic Simulation Model (S4M)

Based on the complex relationships revealed by the logistic regression models, a methodology to generate ETo and Prcp as driving variables in a simulation model other than using the historic data time series (in other words not copying the historic data time series and pasting it to represent the future data time series) was developed for ETo and Prcp data within the SS watershed. However, the technique used in this study maintains the relationships between the historic frequency distributions and interdependency of ETo and Prcp (Figure 2), instead of treating them as separate unconnected variables in the model. Curves for statistical distribution curves were fit to the historical ETo and Prcp data for the occurrence of a precipitation event, the duration of an event, the volume of the event, and the volume of evapotranspiration occurring on that day. The simulation processes the new module following the logic in Figure 3. For any time step for a given month, there are four steps: (1) the probability of a Prcp event occurring is randomly selected from the statistical distribution for that month; (2) if Prcp occurs, then a duration of the event is chosen from the probability distribution of durations for that month (with maximum durations being 7 days); (3) a volume of Prcp is calculated from the probability distributions of volumes for events of that duration for that month; and (4) ETo is generated from the probability distributions associated with Prcp events of that duration for that month (Figure 3). The ability of the S4M to reasonably simulate the fluctuations in Prcp and ETo volumes, duration of Prcp events, all while preserving the relationships between the patterns between them is evident in Figure 4.

3.4. S4M Evaluation

The deterministic version of S4M was evaluated in Kjelland [28]. When the stochastic weather simulator was added, the model simulated historic sea elevations reasonably well with a difference of only 1 foot at its most disparate point, −228 fasl (simulated) and −229 fasl (historic), and maintained patterns of seasonality. Based on the observed versus simulated sea elevation data, the model has an error rate of less than 1%. These sea elevation results are comparable to IIDWD and CH2MHILL [54], indicating that S4M has a reasonable ability to simulate the hydrology of the Salton Sea. Figure 4 shows that Prcp and ETo and patterns are similar when comparing the simulated versus historic data, in both magnitude, that is, volumes, and frequency. With regard to TDS, the model validation simulations resulted in the most disparate underestimation of 2,999 mg/L (44,788 mg/L simulated versus 47,787 mg/L observed) compared to the most disparate overestimation of 2,541 mg/L (42,963 mg/L simulated versus 40,422 mg/L observed). Based on the observed versus simulated sea TDS data, the model has an error rate of about 7% at its most disparate points and can be regarded as reasonable given that subsequent years’ error rates are much less and the sea’s general salinity trend does not change. The model verification results for salinity in a study by IIDWD and CH2MHILL [54] yielded an error rate of approximately 6% at the most disparate point (1980 to 2000), similar to results of the Salton Sea Ecosystem Restoration Program [55].

The S4M results pertaining to the baseline trends in elevation and salinity projections for the year 2024 are similar to the results from other studies. For example, the Colorado River Board of California [56] used sequential cycling of historic conditions as the basis for future inflow conditions and estimated Salton Sea elevations from approximately −231.4 to −233 fasl and a salinity of 67,000 to 79,600 ppm by the year 2020. Cohen and Hyun [27] showed a Salton Sea elevation of −233.6 fasl and a salinity of 60,000 ppm by the year 2018, and an elevation of approximately −245 fasl with salinity between 90,000 and 100,000 ppm by the year 2024. The simulated Salton Sea elevation and salinity results were −234 fasl and 75,000 ppm, respectively, by the year 2030 and assumed reduced future inflow conditions [39]. Moreover, current conditions show that model simulations for salinity are very similar to those of today, about 48,055 ppm (48 g/L) [57] to about 51,000 ppm (approximately 51 g/L) [58].

3.5. Salton Sea Climate Scenarios Using Stochastic Generator

Three future SS climate scenarios were examined: (1) no change in ETo, Prcp, and river flows (Baseline) versus (2) a scenario of a ten percent increase in Prcp and river flows, and (3) a scenario of a 10 percent decrease in Prcp and river flows (most likely scenario according to Seager et al. [59]). In the climate sensitivity analysis using the historic driving variables ETo, Prcp, and river flows, that is, the past projected into the future version of the model, the elevation of the sea at the end of 2024 was −235.38 fasl with a salinity of 59,292 mg/L under baseline conditions versus −233.38 and −237.09 fasl and salinities of 56,533 and 61,903 mg/L for scenarios 2 and 3, respectively (Figure 5, Table 4). In the stochastic version of the model, the elevation of the sea at the end of 2024 was −235.36 fasl with a salinity of 59,025 mg/L for the baseline version of the model versus −233.4 and −237.06 fasl and salinities of 56,335 and 61,598 mg/L for scenarios 2 and 3, respectively (Figure 5, Table 4). For both deterministic and stochastic versions of the model, the ANOVA (Tables 4 and 5) and Bonferroni multiple comparisons post hoc tests results revealed a statistically significant difference () for the sea’s elevations and salinities between the baseline climate scenario versus climate scenarios 2 and 3, as well as the elevations and salinities between climate scenarios 2 and 3 themselves.

4. Conclusions

The results of the logistic regression and curve fitting analyses provided valuable insight into the dynamics between single versus multiple Prcp events and the interaction between ETo and Prcp in the SS watershed, both of which are essential to the development of the aforementioned future climate scenario methodology. Results show that months with a larger percentage of Prcp events had a larger percentage of days with low ETo. Similar to the Sahara desert study by Scott et al. [35], the effect of a Prcp event had a negative effect on ETo in the semiarid environment around the SS but elucidated a more complex relationship. The results demonstrate that the longer the duration of the Prcp event, the larger the decrease in ETo volumes, in general, and for a longer period of time. Likewise, larger Prcp volume events suppress ETo volumes more so than smaller Prcp events. Based on the multinomial logistic regression models, the duration of the Prcp event is a slightly better predictor of ETo volumes than the Prcp volume associated with the Prcp event. Overall, ETo and Prcp exhibited a negative relationship.

The present study quantifies the relationships between ETo and Prcp in a semiarid region and provides a technique for maintaining these relationships in research involving stochastic climate simulation modeling. The overall low Pseudo values of the multinomial regression models lend support for using this strategy. The curve-fitting results for each monthly dataset resulted in similar distributions providing the “best” fit to the data; however, the distributions for the averaged monthly TIBC dataset often varied, more so for ETo than Prcp. Therefore, if more detailed weather patterns and accuracy in resulting fluctuations are important to the research question being addressed, then individual distributions should be incorporated for respective months when modeling future climate scenarios, such as in the SS watershed.

Concerning climate futures, a comparison can be made between the two strategies that were used in modeling the uncertainty in future climate projections: (1) the deterministic version of the driving variables and (2) the stochastic version of the driving variables. The difference between end simulation baseline mean elevations between the two strategies was less than 3 feet, that is, range of baseline minimum and maximum. The climate sensitivity analyses revealed that the cumulative effects and change of ±10 percent in SS inflows over the period of analysis can have significant effects () on sea elevation and salinity, thereby demonstrating the importance of including climate uncertainty in the model.

According to climate model projections by Seager et al. [59], the Colorado River headwaters are expected to have an annual stream flow decline of 10 percent and as much as a 20 percent drop in spring runoff in California and Nevada, as warmer temperatures of 1-2°C also boost evaporation in 2021–2040. Given that the S4M has been constructed, tested, and validated, one can use it to test many different climate scenarios and the implications that climate change may hold for policy making in the region. One can use the climate modeling method constructed and tested herein to test many different climate scenarios and the implications that climate change may hold for policy making in the region, as well as applying the technique to other semiarid regions. In future research, alternative future scenarios can be defined and used to explore the hydrologic and environmental implications of variations in, among other things, climate change and water-related policy decisions.

Conflict of Interests

The authors declare that there is not a conflict of interests regarding the present research.

Acknowledgments

The authors would like to thank Craig Forster, Edith González Afanador, and Ann L. Kenimer for their assistance. T. W. Clumpkin provided a thorough review. Financial support for this study was provided by the Southwest Consortium for Environmental Research and Policy (SCERP) through a cooperative agreement with the US Environmental Protection Agency.