Abstract

The spread of an epidemic is a typical public emergency and also one of the major problems that humans need to tackle in the 21st century. Therefore, the research on the spread, prevention, and control of epidemics is quite an essential task. This paper first briefly described and analyzed the development of COVID-19 and then introduced the basic epidemic models and idealized the population in the epidemic area by dividing them into four categories (Classes S, E, I, and R). After that, it set the relevant parameters of the basic SEIR model and the modified one and worked out the relevant differential equations and iterative equations. According to the feature of the epidemic situation and the changes in the number of contacts in different units of time, the epidemic data were substituted into the iterative equations for data fitting with an R Package. Then, analysis was performed on the epidemiological features such as the transmission time and epidemic peak and the epidemic trend was evaluated. Finally, sensitivity analysis was conducted on the parameters (government control and recovery rate), and the results showed that measures such as government restrictions on travel (reducing the contacts between virus carriers and susceptible persons) can effectively control the scale of the outbreak.

1. Introduction

At present, there are 64 major infectious diseases in the world, spreading in 82 countries and regions. In addition to the well-known influenza, measles and dengue fever are spreading in more than 20 countries and regions [1]. For many years, infectious diseases have been a major test to the medical and health systems in various countries in the world.

In 2019, a type of viral pneumonia appeared in many countries around the world, with the characteristics of “human-to-human transmission” [2]. The pathogen was officially named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the pneumonia caused by the virus was named coronavirus disease 2019 (COVID-19) by the Coronaviridae Study Group (CSG) of the International Committee on Taxonomy of Viruses (ICTV). Since the first case of pneumonia of unknown cause was reported on December 8, 2019, a total of 115,653,459 cases and 25,718,823 deaths have been reported worldwide as of March 6, 2021 [3].

Thanks to the active medical treatment and preventive and control measures, the epidemic situation in China was effectively contained and the economic and social order was being recovered in an orderly manner as early as April 2020. However, the epidemic was still spreading rapidly in countries and regions outside China, turning Europe and North America into the hardest-hit areas.

Since the outbreak of COVID-19, a number of studies have analyzed the development of the epidemic based on epidemic dynamics models and related data, interpreted the epidemic trends in different regions of China, and put forward recommendations for epidemic prevention and control [4, 5]. In addition, Zhan et al. [6] established a virus transmission spectrum using COVID-19 data in Chinese cities and evaluated the development of the epidemic in cities in Italy and South Korea. Zhuang et al. [7] estimated the basic transmission rate of COVID-19 in Italy and South Korea using the maximum likelihood method and forecasted the epidemic situation at the early stage. Li et al. [8] estimated the basic infection rates before and after Korea’s prevention and control measures and analyzed the epidemic situation in Italy. Zheng et al. [9] used the state transition matrix model to analyze the development of the epidemic in South Korea and Italy. There are also studies that estimated the basic infection coefficient of COVID-19 [1013], but most of them were carried out before March 5, 2020, earlier than the time when the epidemic broke out on a global scale and turned into a pandemic.

To prevent and control COVID-19, different countries have adopted different measures, resulting in different development trends of the epidemic. For example, due to large-scale gatherings, the number of confirmed cases of COVID-19 increased significantly in South Korea in mid-to-late February [14], and later, thanks to active prevention and control measures, the epidemic was effectively controlled. As of April 4, 2020, there had been 10,237 confirmed cases in South Korea. Italy had fewer COVID-19 patients than South Korea at the beginning of March 2020; but after that, the number of new confirmed cases per day remained high, and as of April 4, 2020, the total number of confirmed cases had reached 124,632, much higher than that in South Korea. In addition, the United States, Spain, France, Germany, the United Kingdom, and other countries were also experiencing a surge in the number of confirmed cases. The pandemic situation was very severe.

The susceptible-infected-recovered (SIR) model is a classic model for studying epidemic dynamics. It was established by Kermack and McKendrick in 1927 using the dynamic method [15]. Yu et al. proposed an SIR model based on time-varying parameters and predicted the development of the epidemic with time-varying parameters [16]. Based on the traditional SEIR model, Geng et al. predicted the development of the novel coronavirus (2019-nCoV) pneumonia epidemic by adding new parameters such as the rate of infection in the incubation period and the rate of change of infected population and evaluated the role of relevant interventions [1719]. Yan et al. [20] introduced a time-delay process to construct an infectious disease model based on a time-delay dynamic system on the basis of the traditional dynamic model to predict the epidemic situation and evaluate the effectiveness of prevention and control measures. Bai et al. [21] established a nonautonomous dynamic model to predict the development trend of the epidemic by adding isolated susceptible persons and isolated latent persons into the SEIR model and proposed sensitivity analysis on effective regeneration number to emphasize the effectiveness of tracking isolation in epidemic prevention and control.

Based on the SEIR model, this study established and modified a basic SEIR epidemic model and performed data fitting and sensitivity analysis using an R Package. In order to realize the trend prediction of the epidemic, the prevention and control suggestions were put forward, and the feasibility of the measures was effectively evaluated, so as to provide a reference for the subsequent epidemic prevention and control.

2. Transmission and Distribution of SARS-CoV-2

2.1. Data Source

The data relating to COVID-19 in this paper were all extracted from the statistics published on the official website of the National Health Commission of the People’s Republic of China [22] from January 23 to March 14, 2020, including the numbers of (new) confirmed and (new) suspected cases, deaths, discharged patients, people under medical observation, and people discharged from medical observation nationwide every day, used to construct the SEIR model and analyze the epidemic situation. Considering the rapid spread of SARS-CoV-2, government agencies actively took effective preventive measures, putting the virus transmission into a controlled state. After effective quarantines, many community residents were restricted from going out, and few people traveled across cities and provinces, putting the virus transmission into a highly controlled state.

2.2. Data Analysis

From the map of epidemic outbreaks in China in Figure 1 (from the official website of China Centers for Disease Control and Prevention), it can be seen that Hubei Province is in the darkest color, showing that Hubei was the hardest-hit area of COVID-19 in China. The first case of pneumonia with an unknown cause was also found in Wuhan, Hubei. This provided important information for the traceability investigation of the virus and the quarantine and observation of close contacts.

Figure 2 is a pie chart based on the epidemic data of various provinces in China posted on the official website of the National Health Commission as of 24:00 on March 13, 2020. From Figure 2, it can be clearly seen that Hubei Province had the most confirmed cases, followed by Guangdong Province and Henan Province. Due to the strict isolation and travel restriction measures implemented in China, as of mid-March, all provinces except for Hubei have been cleared of COVID-19 cases.

Then, based on the case data published on the official website of the National Health Commission, a line chart was drawn. It can be seen from Figure 3, with the strong intervention of the government, that the new cases across China were on a decreasing trend. In particular, the number of new confirmed cases reached the peak on February 12 and then gradually decreased; the number of new suspected cases reached the peak on February 5, and after medical observation, it gradually decreased; the number of recovered cases continued to increase, and the number of deaths gradually decreased. Later, various data also showed that the outbreak was basically contained.

3. Review of Common Epidemic Models

The basic mathematical model for an epidemic is a mathematical model constructed based on the characteristics of the epidemic, used to study the transmission speed, spatial range, and transmission routes of the epidemic so as to effectively prevent and control the epidemic.

According to the conditions of different individuals, the total population in the epidemic area can generally be divided into the following four categories:(1)The susceptible: denoted as the S group, referring to the group of people who have not been infected in a certain period of time but are susceptible to infection after contact with virus carriers.(2)The exposed: denoted as the E group, referring to the group of people who have been in contact with the infectious and are in the latency stage of the disease. This is applicable to any infectious disease with a long latent period.(3)The infectious: denoted as the I group, referring to the group of people who have been infected with the infectious disease and can spread the disease to the susceptible, and turn them into the recovered or infectious.(4)The recovered, denoted as the R group, referring to the group of people who are cleared of the infectious disease and immune to the disease after being cured. If the immunity period is limited, the recovered will soon become the susceptible.

For the sake of intuitive and convenient research, Table 1 is a specific symbolic illustration diagram.

Suppose that the total population is . During the outbreak of the epidemic, despite the prevention and control measures imposed by the government, there are still some individuals going out. Suppose the number of persons that each individual comes into contact with when going out is and that the probability of such individual successfully spreading the virus to each contact is , then there will be newly infected individuals. The birth rate and mortality rate of the population during the outbreak period are not taken into account in this study.

3.1. SI Model

The SI model is used for the situation in which the susceptible, after being infected with the disease, cannot be recovered. An example of such diseases is AIDS. The process is shown in Figure 4.

Suppose the population is divided into two groups—S and I—and the total population is equal to the sum of the number of people in the S group and the I group; then, the following equations can be established for the number of newly infected cases and the number of healthy susceptible people reduced:

The original equations can be reorganized into the Bernoulli equations:

And then there is

3.2. SIS Model

The SIS model is used for the situation in which individuals, after being infected with and recovered from an infectious disease, will return to susceptible individuals, who are likely to be infected again. An example of such diseases is influenza. The process is shown in Figure 5.

The equations are established as follows:

They can be simplified into

And it is solved as follows:

3.3. SIR and SIRS Models

The SIR model is used for the situation in which individuals, after being infected with an acute infectious disease and recovered, will have antibodies and obtain permanent immunity. Examples of such diseases are smallpox and measles. The process is shown in Figure 6.

The equations are established as follows:

The SIRS model is used for the situation in which individuals, after being infected with an infectious disease and recovered, can be immune for only a limited period of time, and after that, they will become susceptible persons, with the risk of being infected again. The process is shown in Figure 7.

The equations are established as follows:

3.4. SEIR Model

In the SEIR model, the infected patients will experience a latent period in the beginning, meaning that after contacting with a virus carrier, they will not show symptoms until after a period of time, and by then, they become the carriers of the pathogen. After being recovered, they will have antibodies and not be infected again. Examples of such diseases are COVID-19, SARS, and so on. The process is shown in Figure 8.

The equations are established as follows:

From the above, it can be seen that it is actually a modified version based on the SIR model.

4. Prediction of COVID-19 Development Based on the SEIR Model

4.1. Basic Assumptions

COVID-19 has a latent period, so it is necessary to consider the individuals who have contacted the exposed. The following model assumptions are proposed for the transmission characteristics and changing trends of COVID-19 across China.(1)This study is only intended to analyze and predict regional outbreaks within a short period of time, so it is assumed that the different groups of people are subject to conservation relations in the model, that is, . What is more, the data used are those obtained after all provinces across China activated the first-level lockdown, so there is no population flow. Also, the inflow, outflow, births, and deaths of the population are not taken into account.(2)It is assumed that the initial number of infected persons is 1.(3)Individuals who have contacted the infected are put under medical observation for 14 days, during which time; they have no contact with the outside world.(4)The individuals who are confirmed cases are completely quarantined and can no longer infect others.(5)As mentioned above, if the COVID-19 patients, after being recovered, will no longer be isolated from others, the whole population can be divided into 4 groups of people, put in 4 independent compartments. The transforming relationships between them are shown in Figure 9.

In Figure 9, represents the rate at which the number of susceptible persons moves to the E group per unit of time; is the number of people diagnosed with COVID-19 per unit of time; and is the number of people recovered per unit of time. It should be noted that, based on our understanding of the actual situation, the exposed and infectious persons are both infectious. In the model, the patients refer to the individuals infected and hospitalized (including mobile cabin hospitals). The exposed, that is, the E group, actually includes those in the latent period and those who have been infected but have not been discharged from the hospital.

4.2. Parameters and Initial Value Setting
(1)Total population : the data of the total population was provided by the Hubei survey team of the National Bureau of Statistics of The People’s Republic of China. As of the end of 2019, the permanent population of Hubei Pro was 59.27 million.(2)Infection rate : according to the official website of the National Health Commission, based on the existing cases, the latent period of COVID-19 is about 7 days on average, but currently, the latent period has been extended. The duration of infection is , so it can be calculated that .(3)The determination of the expression of [23].(4) value: the value fitted by Xu et al. [24] based on data and the actual situation is adopted, that is, .
4.3. General Model and Analysis
4.3.1. Modeling

According to the parameters, it can be found that represents the proportion of people who are not infected; is the number of people who are contacted with and can be infected by patients, and is the number of people who have recovered. Then, based on the SEIR model, the daily increase in the number of people in each group can be expressed by the following equations:

The iterative formulas can be obtained as follows:

The corresponding parameters are set as follows: the initial number of the infectious , that of the exposed , and that of the recovered .

4.3.2. Model Solution and Analysis

Without travel restrictions, assuming that in the range of , , the result calculated by an R Package [25] is shown in Figure 10, where the inflection point of the epidemic appears at approximately two months and a half after the beginning of the outbreak. It can be seen that the numbers of the infectious and the exposed both increase first and then decrease and that the outbreak period is also relatively long. It can also be seen that the number of infected individuals does not conform to reality.

According to relevant reports, some exposed individuals who carry SARS-CoV-2 but do not show symptoms can still infect others. Therefore, the following improvements were made to the model.

4.4. Model Improvement and Analysis
4.4.1. Modeling

The exposed and the infectious can both infect others. Suppose the contact rate among people is the same, and that the transmission rate is β, then the equations can be improved; that is, the daily increase in the number of people in each group can be expressed

The iterative formulas are as follows:

According to the iterative formulas, the calculation results can be obtained.

4.4.2. Model Solution and Analysis

In the modified model, it can be seen from Figure 11 that the inflection point appears earlier, which, to a certain extent, verifies the feature that virus carriers can also infect others. This is also the reason why the epidemic situation is so difficult to contain.

4.5. Parameter Sensitivity Analysis
4.5.1. Sensitivity Analysis of

In 2.4, through statistical analysis, it is determined that . Below is an analysis of the changes in the model results when changes. This is to examine how a parameter affects the model results while other parameters remain unchanged. represents the initial average number of contacts, which reflects the government’s effort to control the epidemic.

(1) The Government Imposes Less Control. If the government imposes less control, the value of will increase. Assuming that , it can be seen from Figure 12 that the number of the exposed has increased significantly, so in the same area, the healthy population are more likely to get infected.

(2) The Government Imposes More Control. If the government imposes more control, then the value of will decrease. Assuming that , it can be seen from Figure 13 that the number of the exposed has decreased.

In the above two cases, the inflection point of the outbreak and the transmission period have changed, but not much. The above results also show that the government’s travel restrictions can truly effectively control the spread of the epidemic by reducing the number of contacts.

4.5.2. Sensitivity Analysis of the Recovery Rate

It can be clearly seen from Figure 14 that if the recovery rate is increased to γ = 0.5, the number of patients will be greatly reduced, the inflection point will appear earlier, and the transmission period will be relatively shortened. From this, it can be concluded that early detection, early isolation, early treatment, and at the same time improving the recovery rate of the infectious are very important to ending the outbreak as soon as possible.

5. Prevention and Control Analysis

From the previous SEIR epidemic model, it is known that both the exposed and the infectious can infect others. Since the contact rate among people is the same and the transmission rate is , the key to preventing and controlling the epidemic is to minimize , by taking the following actions:(1)Reduce : Reduce the source of infection by reducing the number of the susceptible. The main methods are vaccination and isolation to prevent infection.(2)Reduce : Currently, patients are treated in isolation to reduce further infections.(3)Reduce the infection rate : Further reduce individual contacts and improve sanitary conditions to block the spread of the virus. Common methods include frequent and scientific hand washing, wearing disposable gloves, wearing masks when going out, imposing community quarantine, and reducing public gatherings and shutdown of companies and schools.(4)Increase the recovery rate : speed up treatment, increase the recovery rate, and increase the immune population. Common measures include actively looking for specific therapeutic drugs, developing vaccines, enhancing physical fitness, and improving individual immunity. At the government level, actions to take include simplifying the drug approval process and speeding up drug development and clinical trials.

The above analysis shows that, before the vaccine is successfully developed, the traditional isolation method is indeed one of the most effective ways to prevent and control the epidemic. However, specifically, different isolation methods should be applied to different areas. For high-risk areas, such as Wuhan, absolute isolation is required, while for nonrisk susceptible areas, real-time protective isolation is required, such as isolation of streets, communities, and villages without COVID-19 cases. In addition, for different people, different methods should also be applied, like the isolation of susceptible individuals, isolation of contacts, and quarantine of suspected patients.

For the infectious group in the model, all suspected patients should be received and treated as much as possible. Those with fever and close contacts should be isolated, reported, diagnosed, quarantined, and treated as early as possible. Just as President Xi emphasized during the investigation on the prevention and control of COVID-19, the Chinese must fight and win the battle against the epidemic by mobilizing all resources and blocking the spread of the virus.

6. Conclusions

This paper used the basic SEIR epidemic model and then obtained the results with an R Package using the data to describe the transmission pattern and development process of COVID-19. The main work done for this paper included acquiring the epidemic data from the official website, plotting relevant charts, performing data analysis, and understanding the relevant situation. In this outbreak, it was found that Hubei Province was the hardest-hit area. Then, the basic epidemic models were described, including the SI model, the SIS model, the SIR (SIRS) model, and the SEIR model. The SEIR model was given the greatest attention and applied to the outbreak of COVID-19. Differential equations and iterative equations were established, and with these equations, the final results were obtained. After comparison of the calculated results against the reality and considering the special feature of this epidemic, the equations were modified and calculations were performed again. The results obtained were more consistent with the reality. Sensitivity analysis was also performed on two parameters. Through comparison of the results and based on the actual situation, prevention and control recommendations were proposed.

Data Availability

The data used to support the findings of this study are available from the author upon request.

Conflicts of Interest

The author declares no conflicts of interest.