Abstract

We examined models for population growth curves, contrasting integrated versions with various other forms. A sizable number of data sets for birds and mammals were considered, but the main comparisons were based on 27 data sets that could be fit to the generalized logistic curve. Akaike's information criterion was used to rank fits of those data sets to 5 integrated models. We found that the integrated models gave the best fits to the data examined. The difference equations examined gave much poorer fits as judged by AICc and coefficients of variation. We conclude that the integrated models should be used when possible.

1. Introduction

Most recent use of population growth curves has focused on difference equation models (also called β€œfinite population models”). Such models may give a somewhat wider scope for applications and for theory, than do the integrated versions of these models. However, the available integrated versions appear to give much better fits to actual growth curve data, raising some questions about the practical utility of the difference equation versions. We thus examine a number of integrated and difference equation models in this paper. The data used are for birds and mammals. Rather different results may apply for insects and some species of fish.

2. Methods

2.1. Models

The first of the integrated models used here, the generalized logistic reported by Nelder [16] and by Pella and Tomlinson [17], but best known from Ayala et al. [18], and often termed the theta-logistic is: 𝐾𝑁(𝑑)=βˆ’π‘§βˆ’ξ€·πΎβˆ’π‘§βˆ’π‘0βˆ’π‘§ξ€Έπ‘’βˆ’π‘Ÿπ‘§π‘‘ξ€»βˆ’1/𝑧.(1) The corresponding difference equation is𝑁(𝑑)=𝑁(π‘‘βˆ’1)+π‘Ÿπ‘(π‘‘βˆ’1)1βˆ’π‘(π‘‘βˆ’1)𝐾𝑧.(2) A rearrangement of (2) is designated here as the β€œSibly model” [19–22]:pgr=𝑁(𝑑)𝑁(π‘‘βˆ’1)βˆ’1=π‘Ÿ1βˆ’π‘(π‘‘βˆ’1)𝐾𝑧.(3) The Gompertz curve is𝑁𝑁(𝑑)=𝐾explog0πΎξ‚Άπ‘’βˆ’π‘π‘‘ξ‚Ά.(4) The difference equation in the form used by Dennis et al. [23] is𝑁(𝑑)=𝑁(π‘‘βˆ’1)exp(π‘Ÿ+𝑏log(𝑁(π‘‘βˆ’1))).(5) The logistic growth curve is obtained by setting 𝑧=1 in (1):𝐾𝑁(𝑑)=1+π‘π‘’βˆ’π‘Ÿπ‘‘πΎ,𝑐=𝑁0βˆ’1.(6) The difference equation is𝑁(𝑑)=𝑁(π‘‘βˆ’1)+π‘Ÿπ‘(π‘‘βˆ’1)1βˆ’π‘(π‘‘βˆ’1)𝐾.(7) The exponential model is𝑁(𝑑)=𝑁0exp(π‘Ÿπ‘‘).(8) The difference equation is𝑁(𝑑)=𝑁(π‘‘βˆ’1)+π‘Ÿπ‘(π‘‘βˆ’1).(9) The equation of Morris and Doak [24] isξ‚΅1ξπœ‡=π‘žξ‚Άπ‘žβˆ’1𝑖=0𝑁log𝑖+1𝑁𝑖(10) with variance estimate:ξ‚΅1varξπœ‡=ξ‚Άπ‘žβˆ’1π‘žβˆ’1𝑖=0𝑁log𝑖+1π‘π‘–ξ‚Άξ‚Άβˆ’ξπœ‡2.(11) In the notation used here πœ‡=π‘Ÿ. A modification of (1), with 𝑧=2, is considered here, as it has significant advantages. It is here denoted as a modified logistic:𝐾𝑁(𝑑)=βˆ’2βˆ’ξ€·πΎβˆ’2βˆ’π‘0βˆ’2ξ€Έπ‘’βˆ’2π‘Ÿπ‘‘ξ€»βˆ’1/2.(12) In the aforementioned models, 𝐾 is the asymptotic value, π‘Ÿ is a rate of increase, 𝑧 is the parameter controlling the inflection point in a growth curve, 𝑁0 represents initial population size, and 𝑏 and 𝑐 are functions of π‘Ÿ and 𝐾 in the Gompertz equations. The models have been fit by using nonlinear least-squares, [25], as implemented in the R-language [26].

2.2. Akaike’s Information Criterion

AIC is calculated as [27]AIC=𝑛log𝜎2+2π‘˜,where𝜎2=βˆ‘Μ‚π‘’2𝑖𝑛,(13) the ̂𝑒𝑖 are deviations (residuals) from the model fit (assumed to be normally distributed), 𝑛 is sample size, and π‘˜ is the number of parameters in the model plus 1. Most of the available data sets have small samples, so Burnham and Anderson [27] recommended usingAIC𝑐=AIC+2π‘˜(π‘˜+1)π‘›βˆ’π‘˜βˆ’1.(14) Model comparisons are made usingΔ𝑖=AICπ‘βˆ’AICmin,(15) where AICmin is the minimum value calculated for a set of models.

2.3. Data Sources

Part of the data used here was obtained in 2006 from the Global Population Dynamics Database (GPDD) maintained by the National Environmental Research Council at http://www.sw.ic.ac.uk/cpb/gpdd.html. ID numbers for data used from the GPDD appear in the tables. Data sets from the GPDD were obtained by searching the entire data set (over 4000 entries) by inspecting plots of each data set and attempting to fit curves. An up-to-date version of the GPDD data was obtained in 2008 and searched for additional examples. Over 100 data sets were examined in detail, but those that could be fit to (1) are the primary data sets used here.

3. Results

3.1. Integrated Models versus Difference Equations

Table 1 contrasts the generalized logistic (1) with the corresponding difference equation (2), and the ordinary logistic (6) with its corresponding difference equation (7), using residual mean squares in consequence of the experience of Eberhardt et al. [28], who found that residual mean squares performed better on data similar to those used here. We believe that the β€œadjustment factor” of Burnham and Anderson [27] (in (14)) may discriminate against models with the larger number of parameters. In nearly all instances, the integrated version is to be preferred, as shown by residual mean squares. The two apparent exceptions in Table 1 have highly variable data.

Table 2 makes the same contrast for the Gompertz equation (4) with the corresponding difference equation (5). Smaller samples are available due to the fact that the Gompertz did not fit a number of the data sets. That the β€œSibly” model gives even more variable results is illustrated in Table 3, where the Sibly model (3) has a larger coefficient of variation in all cases where a comparison could be made, with the exception of the Seneca deer data which has essentially an exponential trend.

In those cases marked by an asterisk in Table 3, the estimates of the asymptotic value (𝐾) are nearly the same for the difference equation (2), the Sibly model (3), and the generalized logistic (1). In all of these cases, the data reach an asymptotic value or closely approximate an asymptote. Coefficients of variation (standard error/K) show that the β€œSibly” model (3) gives much more variable estimates than do the other two models considered. These are cases where no gaps existed in the count data. The β€œSibly” model does not accommodate isolated data points and has to be fit in segments (yielding more variable estimates), whereas the integrated models can be applied directly in such cases.

Table 4 gives estimates of the coefficient (𝑧) governing the inflection point in the generalized logistic model, suggesting use of 2.0 in the modified logistic (12). Table 5 contrasts the 5 integrated models using AIC𝑐, showing that the modified logistic is generally superior. Clark et al. [29] found, using simulations, that there appeared to be a linear relationship between π‘Ÿ and 𝑧 in the generalized logistic model. Using our data set on birds and mammals, we found a correlation between π‘Ÿ and 𝑧 (βˆ’0.51) and a distinctly nonlinear relationship. The data appear in Figure 1.

Table 6 compares the Morris and Doak [24] ((10) and (11)) with exponential models fitted by nonlinear least-squares or a linear fit with log-transformation. The median values indicate that all 3 methods give much the same values, but the Morris and Doak approach is far more variable, as indicated by coefficients of variation. The coefficients of variation show essentially no difference between the two approaches using exponential models. It should be noted that the Morris and Doak estimator (10) can be used with virtually any data set giving nonsensical results in some cases (only exponential-type data were used in Table 6). Thus Morris and Doak [24] used 39 observations on the Yellowstone grizzly bear data, as shown in their Figure 3.6 and Table 3.1. The problem is that the population was initially decreasing and then, as protective measures began to take effect, started to increase in the early 1980s and continued to do so for the remainder of the series. Consequently, the resulting estimate of π‘Ÿ (or ΞΌ) is of no practical value. Eberhardt and Breiwick [11] give details and cite 7 published papers that largely ignore the change in trend and thus give largely meaningless results.

3.2. Scope of Models

Clark et al. [29] extracted 99 data sets from the 1198 published by Brook and Bradshaw [20] using a set of criteria that included a minimum number (19) of β€œtransitions” (used because they used difference equations), so all of the data they used had at least 20 observations. This data set contained 42 cases involving birds and mammals, which we screened with linear regression, of which 20 could be fit with the modified logistic (12) or with an exponential model (4 cases). Those that did not have significant slopes were examined with (10) and appeared to be essentially stationary series (correlation = 0) with the exception of a few instances (3) with an erratic pattern. There were just 4 cases in the Clark data that also fell in our sample (Fulmar 6527, Blue Wildebeest 7060, Sandhill crane 9990, and Blue tit 6830).

4. Discussion

The results given here indicate that, for any practical purposes, the integrated models should be used for species like those considered here (birds and mammals). The Gompertz model may be preferred for some species of fish and for insects. Our analyses here have largely been restricted to data sets that can be fit by the generalized logistic (1). The modified logistic (12) and the ordinary logistic (6) can be fit to a much wider range of data. In a few cases, the exponential (8) may appear to give a better fit, but these appear to be largely instances where the data are limited to the early stages of population increase.

The recent ecological literature contains a wide range of difference equations (finite population growth models). We have studied 5 of these [28], but there are additional examples [20–23, 30–35]. One reason for the popularity of difference equation models is that they provide a wide range of models, some of which have no integrated analogs. A popular example is the Ricker model [36, 37]:ξ‚΅π‘Ÿξ‚Έπ‘(𝑑)=𝑁(π‘‘βˆ’1)exp1βˆ’π‘(π‘‘βˆ’1)𝐾,(16) where π‘Ÿ is the rate of increase and K the asymptotic value, as before. This model was originally developed for cases where π‘Ÿ is quite large and thus may be useful for such situations. If π‘Ÿ is not large (as in all of the cases examined here), the model reduces to the difference equation version of the ordinary logistic, so that the ordinary logistic or (preferably the modified logistic) can be used. There may thus be some practical reasons to consider difference equations. In many other cases, it appears that the principal goals are the development of ecological theory.