Abstract

The classical population growth models include the Malthus population growth model and the logistic population growth model, each of which has its advantages and disadvantages. To address the disadvantages of the two models, this paper establishes a grey logistic population growth prediction model, based on the modeling mechanism of the grey prediction model and the characteristics of the logistic model, which uses the least-squares method to estimate the maximum population capacity. In accordance with the data characteristics of population growth, the weakening buffer operator is used to establish the weakening buffer operator grey logistic population growth prediction model, which improves its accuracy, thus improving the classic population prediction model. Four actual case datasets are used simultaneously, and the two classical grey prediction models are compared. The results of the six evaluation indicators show that the effects of the new model demonstrate obvious advantages. Finally, the new model is applied to the population forecast of Chongqing, China. The prediction results suggest that the population may reach a peak in 2020 and decline in the future. This finding is consistent with the logistic population growth model.

1. Introduction

Generally, a population that is too large or too small is not conducive to the overall development of a country. In 1979, British economist Thomas Robert Malthus proposed two well-known views of growth. Without hindrance, the population will grow geometrically, while the social resources on which humans depend for survival will grow arithmetically [1]. The Malthus population growth model is one of two classical population growth models; the population growth calculated by this model is infinite, but the balance between the population and social resources is disturbed after a period of unlimited population growth. Humans face various problems caused by the shortage of social resources. DeFries’ et al. [2] studied the problem of station failures driven by urban population growth and agricultural trade in the first two centuries. Bloom and Freeman [3] studied the effects of rapid population growth on the labor supply and employment in developing countries. Abundant data indicate that a population that is too large will have many negative effects on development.

Furthermore, with a decrease in a population, the aging of the population is a significant problem. Currently, the aging problem generally occurs in developed countries and some developing countries. In the absence of response measures, there may be a shortage in the labor force and national defense forces. It is well known that China is a country with a large population, but the problem of aging in China’s population is worsening; thus, the labor force is also shrinking. Zhong [4] studied the impact of population aging on income inequality in developing countries using the situation in rural China as an example, showing that the phenomenon of population aging exerts a serious impact on income inequality.

Therefore, the population not only affects economic development and environmental problems but also is closely related to social stability, labor employment, and the utilization of resources related to sustainable development. In light of the various influencing factors, it is particularly important to use scientific and accurate methods to predict and control the population and provide a reliable scientific basis for the government in making strategic population decisions [5].

In recent years, further progress in population prediction has been achieved; for example, Winiowski et al. [6] used the Bayesian extended Lee-Carter method to predict population growth. Raftery et al. [7] used Bayesian probability to predict the populations of all countries. Shang et al. [8] used the multifunctional level data method to predict the populations of all countries. The population of the United Kingdom has also been predicted. Bryant and Graham [9] used Bayesian demographic accounts to estimate the subnational population. Other papers have also contributed to population prediction [10, 11].

Currently, forecasting methods are generally divided into Bayesian methods [6], blocking growth models [12], and curve fitting [13]. Most methods are based on predicting the future population based on a time series. In addition to predicting population using a time series, the blocking growth model provides a particular description of the population’s development. There are two main classical models: the Malthus population growth model and the logistic population growth model. Notably, the Malthus population growth model is suitable when the population is low and resources are abundant. In addition, in this model, the population grows infinitely. To solve this problem, Dutch biologist Verhulst introduced a constant, that is, Nm (representing the maximum population that natural resources can accommodate), and proposed a logistic population growth model to avoid the problem of infinite population growth in the Malthus model. However, the logistic population growth model only needs three types of data, which can be used to obtain the two parameters of a binary nonlinear equation system. Because finding the analytical solution in a nonlinear equation system is very difficult and the results of the numerical solution may differ when using different methods, the prediction may be distorted. Therefore, this paper combines the differential equation of the logistic model with the modeling mechanism of the grey model equation and obtains the logistic grey prediction model.

Furthermore, the grey model is simple and adaptable and can better address mutation parameter changes because it does not need numerous data points to update predictions. Since its proposal, this prediction model [14] has been widely used in many fields, such as industry, agriculture, transportation, medicine, and military applications [1524]. Researchers have conducted many in-depth and systematic studies based on the grey prediction model with respect to the initial values, background values, modeling mechanisms, model nature, and model combinations [2528], which have promoted the development and perfection of its theoretical system. The grey model is also extended to the GM (1, N), DGM (1, 1), NDGM (1, 1), and CM (1, 1) power models and other new prediction model categories [2933]. In addition, the ability of the grey forecasting model in population forecasting has also been studied [34].

In summary, starting with the classical logistic population growth model and considering the characteristics of the model and modeling mechanism of the grey prediction model, this paper establishes a grey prediction model based on the logistic population growth model which can predict the size of a population and estimate the maximum population capacity of a region through the least-squares method. Furthermore, to improve the accuracy of the logistic population growth grey prediction model (LPGM), a weak buffering operator is introduced, and a grey prediction model based on the weakened buffer operator’s logistic population growth model is proposed. The empirical analysis shows that the established grey prediction model (LGM) calculates the population maximum capacity, that is, Nm, better than the logistic population model. The growth model is more realistic. The empirical analysis of the logistic population grey model with a weakening buffer operator (LPGMWBO) shows that its simulation effect is better than that of the LGM. The simulation accuracy is approximately 4.5%. The simulation results show that the model can predict the expected population in the upcoming years, the trend of the prediction results, and the logistic population growth.

This paper makes the following main innovation points:(1)In accordance with the differential equations of two classical models of population, the Malthus population growth model and the logistic population growth model, the grey prediction model is established through the grey difference information theory, that is, the relationship between the differential equation and the difference equation.(2)The new model uses the modeling mechanism of the grey model, the least-squares method to estimate the maximum population capacity Nm, and the weakening buffer operator to optimize the accuracy of the new model. The population data of four regions in China are used to illustrate the effectiveness of the model. Finally, the paper demonstrates that the new model can effectively predict the population of Chongqing.(3)On the basis of the Malthus model, the new model not only addresses the unrealistic assumption that the total population will continue to increase but also solves the problem of distortion encountered by the results of the logistic model, thereby allowing the new model to predict the population trend effectively.

In the following sections, we use symbols with various meanings, the corresponding definitions of which are listed in Table 1.

Similarly, the equations and their meanings are listed in Table 2.

The other sections in this paper are arranged as follows. In Section 2, the modeling mechanism of the grey prediction model is combined with the nature of the logistic population growth model to establish the LPGM. Then, the LPGM is joined with a weakening buffer operator to establish the LPGMWBO, following which the nature of the models is studied. Section 3 uses the population data of four provinces and cities in China to calculate six evaluation indicators to analyze the validity of the new model. Section 4 describes the empirical analysis using the LPGM to predict the maximum population maximum capacity, Nm, and the LPGMWBO to simulate and predict the urban population in Chongqing, China. Section 5 presents the conclusion.

2. Establishment of the Grey Prediction Model of Logistic Population Growth

The Malthus model and the logistic model are classical models of population growth (see Appendix A for relevant information about the two models). The logistic growth model is as follows:

Equation (1) is a Bernoulli equation whose solution can be obtained as follows:where the constant is the environmental capacity. In this section, the grey prediction model of logistic population growth is established according to the logistic model of population growth, and the corresponding optimization model is established according to the weakening buffer operator.

This section will establish the grey prediction model of logistic population growth based on the population growth logistic model and will establish a corresponding optimization model based on the weakening buffer operator.

2.1. Grey Prediction Model of Logistic Population Growth

According to equation (1), the following equation holds:

Let . Thus, we can observe the following:

Set the number of people during the same time period as follows:

The one-time cumulative sequence is as follows:where . According to formula (4), the differential equation of the population increment at time is as follows:

If the first-order difference is replaced by the left differential in (7), when , we obtain the following:

Thus, the following grey model can be defined.

Definition 1. If sequences and are represented as formulas (5) and (6), respectively, sequence is called the nearest mean generating sequence of .where .

Definition 2. If sequences , , and are as shown in (5), (6), and (9), respectively,is called the logistic grey model and is abbreviated as LGM (1, 1). Furthermore, the differential equation in formula (10) is called the whitening equation of the LPGM (1, 1).
Formula (10) is consistent with the Verhulst model, and its left end is consistent with the whitening equation of the model in formula (4). This equation can be regarded as a whitening equation of a grey model. Furthermore, if the initial time point is t = 1, the population between the time periods [1, t] is expressed as follows:It can be observed that the forms of formula (11) and the 1-AGO sequence in formula (5) are exactly the same. Thus, if the discrete sequence is regarded as the original sequence, is its 1-AGO sequence. Therefore, formula (10) is formally the same as the whitening equation of the Verhulst model and has the same connotation. The differential equation can thus be transformed into a grey model based on the principle of differential information to solve the population growth problem.

Definition 3. Let be the mean generation sequence of accumulative sequence generated by population sequence , and letThen, from formula (10), the least-squares estimate of parameters satisfies the following:The time response equation is as follows:The cumulative reduction value is as follows:

Proof. By inputting data into formula (10), which is , we can obtain the following:The matrix form of the above equation sets is as follows:For , by substituting with , we can obtain the following error sequence:LetThen, the parameter list that minimizes should satisfy the following:Thus,Therefore, we obtain the following:Thus,The following is true for the maximum capacity .

Definition 4. Maximum capacity .

Proof. From formula (2), we can determine that , and can be obtained.

2.2. LPGM Based on a Weakening Buffer Operator

The LPGM is a classical grey Verhulst model. There may be disturbances in the original population data that may lead to inconsistencies between the quantitative prediction results and the intuitive qualitative analysis conclusions. Therefore, the data should first be preprocessed. Because the LPGM is a classical grey Verhulst model, which is a grey power model, the buffer operator is weakened by the following power exponents.

Definition 5. (see[16]). Let be a nonnegative sequence of the system behavior data, and let . Here,where . Then, when is a monotonic growth sequence, a monotonic attenuation sequence or a concussion sequence, that is, , is a weakening buffer operator; is called the power average weakening buffer operator and is denoted as PAWBO.
Parameter has the function of adjusting the effect of the weakening buffer operator. In a monotonic growth sequence, the effect of PAWBO varies in the same direction as . If PAWBO is slowly weakened, a smaller can be used. In a monotonic attenuation sequence, the action of the weakening operator shrinks the data, and the attenuation of the weakening buffer sequence is slower than that of the original data sequence. The effect of PAWBO on monotonic attenuation sequences varies in the opposite direction of . If PAWBO is slowly weakened, a larger can be used.
The LPGMWBO using the LPGM sequence buffer operator is as follows. Set the population sizes during equal periods as follows:Set the sequences with the effect of the buffer operator PAWBO as follows:and here, .
Therefore,and the cumulative sequence of is as follows:where , generating the sequence of the means.where .
Then, the LGM under the buffer operator is as follows:Due to Definition 3.1, we letThen, the least-squares estimation of the parameters in (3.8) satisfies the following:The time response equation is as follows:The cumulative reduction value is as follows:Through defining the LPGMWBO and using Theorems 3.1 and 3.3, the flowchart of the LPGMWBO can be obtained, as shown in Figure 1.

3. Validation of the LPGMWBO

3.1. Numerical Simulation Experiments

To illustrate the effectiveness of the model, this section uses the population data of four provinces in China as reference data. The data come from http://data.stats.gov.cn/easy query.htm?Cn = E0103, including the total population of the four regions of Beijing, Shanghai, Guangdong, and Sichuan. The comparison models are the classic Verhulst model and the NGM (1, 1) model. In addition to the MAPE value commonly used in the grey model, evaluation indexes such as RMSPE, IA, U1, U2, and R are also introduced. The specific meanings and calculation formulas are shown in Table 3. The smaller the values of MAPE, RMSPE, U1, and U2 are, the higher the accuracy of the model is, and vice versa. The values of IA and R are the opposite, such that the higher the value is, the higher the accuracy of the model is. The population data for Beijing and Shanghai are selected for the comparison of the fitting effect. The data are given in Table 4. The population data of Guangdong and Sichuan province are compared with the prediction effect. These data are given in Table 5. An effect diagram is used to compare the effects of each model.

The LPGMWBO uses the first row of data in Table 4 to perform calculations through MATLAB in accordance with the steps in the forecast flowchart. The results are calculated to four decimal places. The calculation of other models is also completed through this software. The specific steps are as follows:(i)Data processingAccording to Definition 3,where . The 1-AGO sequences ,The mean sequence generated by consecutive neighbors of is given by(ii)Parameter estimationAccording to Theorem 3.1, the matrices B and Y are given byThe parameters list is given by(iii)Calculate the simulation value.By substituting parameter values according to Step 2,(iv)Calculate the simulation values and errors. The results are given in Table 6.

Table 6 shows that the six evaluation indicators of the LPGMWBO all offer the best results. To show the differences of various models more intuitively, the trend graph and effect comparison graph of the model simulation results in Table 6 and the original data are given in Figure 2. The APE values of the four points in the LPGMWBO are all low in Figure 2, so MAPE is the lowest.

The comparison models are the Verhulst model and the NGM (1, 1) model. The evaluation indexes are still MAPE, RMSPE, IA, and U1, the Shanghai data from 2013 to 2017 are used as fitting data, the population data of Guangzhou and Sichuan from 2012 to 2016 are used as fitting data, and the data from 2017 are used as forecast data. The specific results are given in Tables 79.

The six comparison indicators of the fitting effect in Table 7 show that the LPGMWBO is the best among the three models; Table 8 shows that the MAPE values of the LPGMWBO are the lowest, and the other five indicators of the LPGMWBO are also the best. In Table 9, the best fitting-effect model is the NGM (1, 1), and the best prediction-effect model is the LPGMWBO. The results for the other five comparison indexes of the LPGMWBO are slightly worse than those of the NGM (1, 1), but the effect of the LPGMWBO is good. According to Tables 79, error comparison charts are drawn, as shown in Figures 35. The LPGMWBO in Figure 3 has a one-year APE that is slightly larger, while the other three years are the smallest. Figure 4 shows that the APE values of the LPGMWBO in 2014 and 2016 were slightly higher, while the APE values in other years were relatively small. Figure 5 shows that the LPGMWBO has higher APE values in 2013, 2014, and 2016, resulting in the simulated MAPE value not being the lowest, but the predicted value in 2017 is close to the true value, indicating that the new model is also effective.

3.2. Analysis of Results

MATLAB software is used to calculate the entire simulation process for four actual cases, combined with the six evaluation index results of the three models. In the first and second cases, the fitting effect of the LPGMWBO is the best of the three grey prediction models. The value of the fitting MAPE is lower than 5%, which is an effective fitting. In Case 3, the fitting and prediction effect of the LPGMWBO is the best among the three models, and the six evaluation indicators are also the best. In Case 4, although the results of several indicators of the LPGMWBO are slightly worse than those of NGM (1, 1), they do not affect the effectiveness of the model. Therefore, the LPGMWBO is an effective grey prediction model that can be applied to population forecasting.

4. Applications

4.1. Data Description

The website http://data.stats.gov.cn/easyquery.htm?Cn=E0103, which was used in this study, provides the population data for Chongqing’s urban population from 2013 to 2017, as shown in Table 10.

4.2. Data Description

First, according to the data shown in Table 10, the parameters of the model are estimated through the LPGM and the least-squares method. Then, we calculate the maximum population size Nm, which is compared to the maximum population size Nm as calculated by the logistic population growth model, and the corresponding results are obtained. Second, the LPGM and LPGMWBO are calculated by using the data shown in Table 1. Then, the results are compared.

According to the logistic population growth model (4) and the population data from 2013, 2014, and 2015, the following equations can be obtained:

The maximum capacity of the numerical solution of the nonlinear equation can be calculated using MATLAB as follows: .

We use population data from 2013, 2015, and 2016 to calculate the population capacity as follows: . Similarly, the populations in 2013, 2016, and 2017 can be used to determine that the numerical solution is not a real root.

According to the maximum capacity in the logistic population growth model, the calculated maximum capacities in the three cases are not consistent with the actual situation.

Subsequently, is calculated according to the least-squares estimation of the LPGM; then, the maximum population is calculated, and the parameter list is as follows:

Therefore, we obtain the following:

Definition 1 leads to a maximum capacity of , and the results agree with the actual situation. The same data are used to calculate the maximum capacity of the logistic population growth model and the LPGM, which shows that the LPGM has better adaptability.

The following is an empirical analysis of the applicability of the LPGMWBO based on the data shown in Table 10. The calculation is based on the specific steps of the LPGMWBO shown in Figure 1. To illustrate the validity of the LPGMWBO, the MAPE values of the model are obtained according to the above steps and then compared with the experimental results of the LPGM, as shown in Table 11.

As shown in Table 11, the effect of buffering the original data is better than that of the original data. Thus, the effect of the LPGMWBO is substantially better than that of the LPGMWBO.

Using the data shown in Table 10, we illustrate the simulation results of the above two models in Figures 6(a) and 6(b).

As shown in Figure 6, the simulation effect of the LPGMWBO is better than that of the LPGM.

As shown in Table 11, the MAPE of the LPGM is approximately 4.5%. The grey model [29] is derived from the reference table. The LPGMWBO simulates the size of the urban population in Chongqing, and the accuracy in China is approximately 12%. Therefore, the population can be predicted using the LPGMWBO. The prediction results are shown in Table 12.

Table 12 shows that the projected population in all years does not exceed the maximum capacity, but the maximum population may be reached by 2020. When the maximum capacity is reached, the populations in 2021 and 2022 will decline, which is also consistent with the parameters of the logistic population growth model. More importantly, in the logistic population growth model (2), when , we can obtain the following by deriving through the two extremes of formula (1):

As shown in formula (42), when and , it can be determined that is an additive function. When and , it can be determined that is a subtractive function. According to the extreme value discriminant, reaches the maximum value when . Therefore, when the population reaches , the population growth rate is the fastest. To further illustrate the growth rate observed in Table 12, we provide Figure 7.

Based on the population trend chart shown in Figure 7, when the population reaches 7195.16 in 2019, the growth rate is observed to be the fastest from 2018 to 2020, which is consistent with the growth trend of the logistic population growth model. In addition, this result shows that the maximum capacity of the LPGM based on the logistic population growth model and the predicted population by the LPGMWBO based on the buffer operator are consistent with the ideal effect of the logistic population growth model. Therefore, the new LPGM and LPGMWBO are effective.

5. Conclusion

This paper starts with the logistic population growth model and combines the modeling mechanism of the grey prediction model to establish the LPGM. To further improve the accuracy of the model, a weakening buffer operator suitable for the power index is introduced to obtain the LPGMWBO; the maximum population capacity is estimated through the least-squares method, and the properties of the model are studied. The effectiveness of the LPGMWBO is demonstrated through six evaluation indexes of the three models in four practical cases.

The LPGMWBO not only solves the impractical situation in which the total population continuously increases, as in the Malthus model, but also solves problems such as the difficulty of obtaining a logistic solution and the possible distortion of the solution. Based on the empirical analysis, the MAPE of the LPGMWBO is approximately 4.5%, and the accuracy level is between one and two. Furthermore, the trend of the prediction result curve meets the ideal effect of the logistic population growth model. Finally, the urban population of Chongqing is predicted to peak in 2020 and may decline in the following several years, which could help Chongqing formulate social strategies. Thus, this paper makes the following two contributions: this paper improves the classical population prediction model and effectively predicts the actual regional population.

However, changes in the population size are affected by many factors, such as society, economy, and the population itself, and the population situation changes over time. In this paper, the population can be effectively predicted only from the perspective of population change without considering the impact of other factors. Therefore, the changes in economic, social, environmental, and other factors and the relationship between these factors and population changes could be used to build a more accurate multivariate population prediction model that could not only address the deficiency of our current work but also enhance the prospects of our future work. This study could help the country forecast and control the population with scientific and accurate methods and provides a reliable scientific basis for the government in making strategic population decisions.

Appendix

In this appendix, we provide information on the Malthus population growth model and the logistic population growth model.

A. Malthus Population Growth Model

As early as the end of the 18th century, Thomas Malthus, a British demographer, studied more than a hundred years of demographic data and concluded that, in the process of natural population growth, the relative increase is a constant; the relative increase is the ratio of the population increase per unit of time to the total population at that time.

Let the total population be at time and let the population growth rate (birth rate minus death rate) be . According to Malthus’s theory, in the process of natural population growth, is the increase in the population per unit of time and is directly proportional to the total population. Since the population growth continuously increases, it can be regarded as a differentiable continuous function. Therefore, the first model of the population increase is established as follows:

By separating the variables from the differential equation in (A.1), we can obtain the following:

In this model, when , the total population increases exponentially. In practice, population changes are examined annually. This model can also show whether the human living environment in a certain area is good, whether the human density is large, and, in the short term, whether the predicted and actual population statistics are mild; however, this approach is not suitable for predicting the actual population. Since and , this problem is infinite. Thus, the total population grows indefinitely, which is inconsistent with the actual population situation.

In fact, a key problem with the Malthus model, which is not consistent with reality, is the assumption that the natural growth rate of a population is a constant. On the contrary, this law is established only when the population is scarce and resources are abundant. When the growth reaches a certain degree and the total population reaches a certain base, negative factors hinder human development. For example, air pollution may occur with development. Other issues include environmental deterioration, the emergence of diseases that affect human lifespans, and adverse agricultural impacts that may lead to events such as food shortages and wars. Therefore, the Malthus model must be modified, and the following logistic population growth model is established.

B. Logistic Population Growth Model

In fact, there is a maximum population capacity, that is, , in a region or an environment; thus, the growth rate in the Malthus model can be slightly altered. When and , the solution tends to a fixed constant, that is, , which is called the environmental capacity. In addition, is the relative growth rate. It can be observed that the relative growth rate decreases as population increases. Assuming that is a quantity that is not affected by the environment and is only related to itself, it is called the intrinsic growth rate and is also known as the coefficient of life. When , the relative growth rate is 0, and the total population tends to the limit value . Therefore, the logistic growth model can be obtained as follows:

Formula (B.1) is a Bernoulli equation and can be solved as follows:

Based on formula (B.2), if and the total populations during two periods are known, we can obtain the following two equations:

The maximum population and the intrinsic growth rate can be obtained from the equations in formula (B.3) to predict the total population with the same spacing. This method uses only three types of data, which do not fully follow the law of population growth. Furthermore, this equation group is a nonlinear equation group, and analytically solving this group is difficult. There are many numerical solutions, and the results may be distorted. The growth model needs to be improved. The grey prediction model is simple and adaptable. This model can better address sudden changes in the parameters. Through data accumulation processing, the relationships in the data increase, the exponential relationship of a sequence increases, and the randomness of the data decreases. Then, the logistic parameters are estimated using the least-squares method. Consequently, a grey prediction model based on the logistic population growth model is established.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by Project of Humanities and Social Sciences Youth Fund of Ministry of Education of China (no. 19XJC630011), the Science and Technology Research Program of Chongqing Educational Committee (no. KJ1703057), the Humanities and Social Sciences Program of Chongqing Municipal Educational Committee (nos. 19SKGH043 and 18SKGH031), and the Chongqing Normal University Fund Project (no. 18XWB017).