Abstract

Overcrowding of emergency departments (EDs) is a problem that affected many hospitals especially during the response to emergency situations such as pandemics or disasters. Transferring nonemergency patients is one approach that can be utilized to address ED overcrowding. We propose a novel mixed-integer nonlinear programming (MINLP) model that explicitly considers queueing effects to address overcrowding in a network of EDs, via a combination of two decisions: modifying service capacity to EDs and transferring patients between EDs. Computational testing is performed using a Design of Experiments to determine the sensitivity of the MINLP solutions to changes in the various input parameters. Additional computational testing examines the effect of ED size on the number of transfers occurring in the system, identifying an efficient frontier for the tradeoff between system cost (measured as a function of the service capacity and the number of patient transfers) and the systemwide average expected waiting time. Taken together, these results suggest that our optimization model can identify a range of efficient alternatives for healthcare systems designing a network of EDs across multiple hospitals.

1. Introduction

Overcrowding in hospital emergency departments (EDs) is recognized as a serious problem in many countries around the world [1] and affected many hospitals especially during the response to emergency situations such as pandemics or disasters [2]. Overcrowding occurs when the arrival rate of patients exceeds the ED’s available capacity [3]. One contributing factor to overcrowding in the USA is a reduced supply of EDs; from 1995 to 2016, although the number of ED visits increased by 51 percent, the number of EDs decreased by 12 percent [4]. Other key factors which cause overcrowding in the USA are the aging population, limited access to medical care from other providers, the safety net, seasonal illness, surgical scheduling, and high utilization of ED for nonemergency care [5,6]. Overcrowding has some negative outcomes for both patients and service providers [7]. Patients may face prolonged pain and long waiting times which leave them unsatisfied [1, 8, 9]. According to the National Hospital Ambulatory Medical Care Survey in 2017 in the United States, the average waiting time at emergency departments for a patient to visit a physician, physician assistant, or nurse is about 40 minutes and around 17 percent of patients waited more than an hour [10]. In fact, up to 10% of patients can become frustrated from long waiting times and may leave the ED without treatment [11], which increases such patients’ risk of death or hospital readmission within the next seven days [12]. ED staff frustration is recognized as one of the negative effects of overcrowding on healthcare providers [13].

One potential solution to overcome the overcrowding problem and have quick and high-quality services in EDs is adding more resources to EDs [14]. However, such an approach is limited not only by operating budget constraints but also by limitations on available personnel and the size of the ED facility [14].

Introducing an incentive policy to accident and emergency departments by the UK government in 2000 was another attempt to reduce patient waiting time in such departments [15]. This policy requires 98% of patients to be discharged, transferred, or admitted to inpatient care within 4 hours of their arrival [15]. Large penalties were imposed for failing to meet the 98% target and some hospital managers even lost their jobs for this reason [16]. Gruber et al. showed that this policy reduced patient waiting time by 19 minutes and it also decreased mortality by 14% [17].

Transferring patients has also been discussed in some studies as an option to help address overcrowding [3, 18, 19] and was utilized in some areas facing large numbers of patients due to emergency situations, such as New York City [2]. Nezamoddini and Khasawneh [3] proposed a mathematical model to quantify the effect of transferring patients between hospitals on patients’ waiting time in a multihospital system.

In this study, we propose a novel mathematical model to capture the effects of transferring patients between hospitals on patients’ waiting time. Similar to Nezamoddini and Khasawneh [3], the objective of our model is to determine the number of servers in each ED and the rates of patient transfer between EDs, in such a way that the cost of the system is minimized. However, unlike [3], who did not explicitly account for queueing effects, our model includes concepts from queueing theory (QT) to account for delays in patients’ receiving service due to overcrowding.

The remainder of the paper is organized as follows. Section 2 presents a literature review on research examining ED overcrowding. Section 3 provides our mathematical modeling approach. Section 4 presents the results of numerical testing and sensitivity analysis. Section 5 provides a conclusion and suggestions for future work.

2. Literature Review

Overcrowding in EDs has been recognized as a problem for many years. Various solutions and methods have been applied to improve patient flow in EDs. In many operations research studies examining the overcrowding problem in EDs, the main question was how many resources should be allocated to each queue in an ED or to each hospital in a multihospital system, to reduce the patients’ waiting times. Some researchers have found that an optimized manpower allocation can reduce the patients’ waiting time in ED by up to 20% [3, 20]. Daldoul et al. [5] proposed a stochastic mixed-integer linear programming (MILP) model to optimize the number of staff and beds in each queue (six queues for six main activities) in an ED to minimize patients’ waiting time. El-Rifai et al. [21] also proposed a stochastic MIP model to find the optimal number of personnel for each shift to minimize patients’ waiting time. Izady and Worthington [15] proposed a heuristic algorithm that combined queueing and simulation models to determine the required number of each type of medical staff during each “staffing interval” to meet a 4-hour sojourn time target (98% of patients must be discharged, transferred, or admitted to inpatient care within 4 hours of arrival), where a “staffing interval” is the time interval utilized for analysis, during which the number of staff is constant (a representative time might be one or two hours in an ED) [15]. Izady and Worthington [15] applied their method in a generic ED and showed that significant improvement with respect to this target can be made even without an increase in total staff hours. Sinreich et al. [22] introduced two iterative heuristic algorithms, which combined simulation and optimization models for scheduling the work shifts of the ED medical staff. These authors’ algorithm shifted the resource capacity from low-demand hours to peak demand hours, and as a result, there was a significant reduction in patient waiting time as well as the peak utilization values of the ED medical staff.

Some studies examined the effects of transferring patients between hospitals in a multihospital setting [3, 18]. Nezamoddini and Khasawneh [3] found that transferring patients between hospitals can be an effective way to reduce patients’ waiting time. They used the concept of a capacitated network to model a multihospital system and allowed the nonemergency patients to be transferred between hospitals subject to capacity constraints on the maximum number of transfers allowed per unit time. Soni [18] developed rule-based patient transfer protocols and tested the protocols in a multihospital patient flow simulation model and found that effective patient transfer protocols can optimize the patient flow in a hospital system.

Regarding the solution techniques utilized, some researchers have used simulation models to capture the complexity and dynamic nature of processes in EDs. Cabrera et al. [14] used an agent-based simulation to model EDs. They concluded that although their simulation experiments helped to generate a better understanding of the problem, they were time-consuming even for a small problem. Hung and Kissoon [19] used discrete-event simulation (DES) to evaluate the effect of using an Observation Unit (OU) and patient transfer to other inpatient units on overcrowding in a pediatric emergency department (PED). They considered four scenarios representing combinations of regular PED operations with and without a five-bed OU and transfer mandate. They concluded that a combination of an OU and patient transfer mandate improved the waiting time compared to PED with neither an OU nor a transfer mandate. Moreover, their results showed that the simulated OU without transfer mandate had an occupancy rate of 73.1%; this rate dropped to 48.1% by applying the transfer policy, indicating a significant improvement in the occupancy rate of OU. Gul et al. [23] also used DES to analyze the effect of the patient surge in EDs after an earthquake. They first used Artificial Neural Networks (ANNs) to estimate earthquake causalities and generate inputs for the DES model. Then, the DES model used the ANN outputs to simulate a network of EDs and generate performance outputs for the corresponding EDs. After constructing the simulation model, a Design of Experiments (DOE) was conducted to assess the effects of different factors on the LoS in the ED and the utilization of ED resources. To show their framework, Gul et al. [23] used a network with five EDs located in one of the regions with the highest estimated injury rate after an earthquake in Istanbul, Turkey. The results from their study can be helpful for planning for the expected earthquake in Istanbul.

Some researchers have combined QT concepts and simulation to analyze patient flow [24, 25]. Alavi-Moghaddam et al. [26] showed that by using QT analysis (with discrete-event simulation to model and validate patient flow metrics), one can identify solutions that improve patients’ flow and reduce waiting times in EDs. Hu et al. [7] compared the use of QT with discrete-event simulation in modeling EDs. They reported that QT models had lesser data requirements and computational cost, due to QT models’ tendency to simplify the problems, while simulation models captured more details in systems but were more sensitive to changes of parameters. Thus, they suggested that a combination of both was the ideal approach to model such problems.

3. Modeling

Our model attempts to reduce the negative impacts of ED overcrowding in a multihospital system by making optimal allocation decisions in two areas: (1) the number of servers at each hospital’s ED and (2) the rate of nonemergency patient transfers between hospitals. To capture the nonlinear queueing effects, we utilize an approach based on that of [27], which allows for an MILP model to represent each ED as an M/M/C queueing system. Our research extends the model of [27] in that it allows for each queueing system (ED) to potentially transfer some patients to other EDs, which requires that we utilize a mixed-integer nonlinear programming (MINLP) model. Figure 1 presents such a notional three-ED system, showing the arrivals of patients into the system and transfers of patients between EDs.

The sets and indices, data parameters, and decision variables used in the MINLP model are as follows:Sets and indices(i): set of EDs, indexed by (ii): set of values considered for a number of servers, indexed by (iii): set of values considered for server utilization, indexed by (iv): set of patient types, indexed by , where denotes emergency, denotes nonemergency(v): set of time periods, indexed by Data parameters(i): number of servers associated with the set element (ii): server utilization associated with the set element (iii): cost per unit service capacity at ED (iv): cost per patient transferred ED to ED (v): waiting penalty cost, per unit time spent waiting (time in queue plus time in transfer), for patients treated at ED (vi): queueing penalty cost, per patient type in a queue, at ED (v): service rate at ED (vi): arrival rate, from outside of the system, of patient type at ED (vii): travel time required to transfer a patient from ED to ED (viii): maximum number of patients allowed to be transferred from ED (ix): total budget available for servers at ED (x): expected waiting time in queue for an ED having servers operating at a utilization rate Decision variables(i): expected number of patients in a queue awaiting service at ED (ii): expected number of patient type in a queue awaiting service at ED (iii): expected time in queue per patient treated at ED (iv): expected waiting time in system (time in queue plus time in transfer) per patient treated at ED (v): maximum utilization allowed at ED (vi): number of servers at ED (vii): effective arrival rate of patients into ED (viii): rate at which nonemergency patients are transferred from ED to ED; note by assumption(ix)

Note that this model makes the following assumptions:(i)The patient interarrival times follow an exponential distribution(ii)All patients who enter the system from the outside must be treated at some ED(iii)Due to the potential risks of transferring emergency patients such as heart rate changes, increased intracranial pressure, and respiratory rate changes [28], it is assumed that they are admitted immediately after arrival to the ED. However, nonemergency patients can be transferred between EDs(iv)The service time per patient follows an exponential distribution, and to simplify the model, it is not differentiated by patient type. However, it can be differentiated by patient type for future research(v)Each patient departs the system following service.Objective functionObjective function (1) minimizes the total system cost, defined as the sum of each ED’s service capacity cost, the cost of transferring patients between EDs, along with penalty costs associated with the average waiting time per patient at each ED, and the average number of patients waiting in a queue at each ED.ConstraintsConstraints (2)–(4) assign a unique number of servers and utilization levels to each ED.Constraint (5) calculates the expected waiting time in queue for patients at ED , based on the M/M/C queueing system with servers and utilization level of . Note that this can be computed a priori for all pairs utilizing the standard M/M/C formulae.Constraint (6) ensures that the utilization level at ED does not exceed .Constraint (7) computes the effective arrival rate into ED, comprised of both patients arriving into ED from outside of the system, and the net patients transferred into ED .Constraints (8)–(10) compute the total number of patients in the queue and then disaggregate this into the number of emergency patients and nonemergency patients in the queue, respectively.Constraint (11) allows only nonemergency patients to be transferred between EDs.Constraint (12) permits at most patients to be transferred from ED to ED .Constraint (13) limits the total cost for servers at ED to not exceed .Constraint (14) computes the expected waiting time in the system (time in queue plus time in transfer) per patient treated at ED .

4. Experimental Results

4.1. Example Problem

Consider the following example problem, similar in many respects to that presented in [3]. It is assumed that there are three emergency departments in the system, each having identical arrival rates of 5 and 5.5 emergency and nonemergency patients per hour, respectively. Each unit of service capacity costs $30 per hour. Transferring one patient between any pair of EDs costs $10 and takes 0.25 hours. The service rate at each ED is 0.5 patients per hour. A penalty cost of $2 per hour is assumed for patient waiting time in the system (time in queue plus time in transfer). Penalty costs are also incurred based on the average number of patients waiting in a queue at each ED, at a cost of $5 and $2 per emergency and nonemergency patient, respectively. Table 1 presents the sets of utilization values and the number of servers considered for each ED. The available budget for servers at each ED is assumed to be $1700, which is greater than the expense incurred if the maximum number of servers (56, from Table 1) was selected.

4.2. Computational Results

The mathematical model presented in Section 3 was coded in the GAMS 27.2.0 modeling environment and solved using the MINLP solver SCIP 27.2.0. The optimal solution has an objective function value of $2,195. No patients are transferred between EDs in this optimal solution. Table 2 presents the optimal values for the decision variables , and at each ED; note that these values are identical at each ED in this solution.

4.3. Sensitivity Analyses

To determine the effects of the various input parameters on the optimal solutions obtained by our MINLP, a Design of Experiments (DOE) was conducted; all statistical analyses were performed utilizing Minitab 17. In this DOE, input parameters were varied at only one emergency department (denoted ED1), and all parameters at the other two EDs remained unchanged from their previously tested baseline values, with one exception: values and were set equal to $1400, such that at the assumed value of server hour, up to 44 servers would be feasible at each of ED2 and ED3. In total, ten input parameters were examined in this DOE, with a resolution V fractional factorial design utilized for screening, using a single replicate for each point and zero center points. Table 3 presents the high and low levels tested for each input parameter in this DOE for ED1 (the values for the other two EDs correspond to the center point of the values in Table 3). For each of these 128 experiments, the MINLP model was solved using GAMS/SCIP to obtain the optimal values for all decision variables. Appendices A and B present the designs and responses, respectively, for these 128 experiments.

The following responses were tracked with respect to ED1: , and the number of patients transferred from and to ED1 ( and , respectively). The regression model specification considered all potential first and two factor interaction terms. The regression model selection was performed using a stepwise procedure, with the value threshold to enter and depart the model set equal to 0.05, with the necessary first-order terms retained to produce a hierarchical model. Appendices A, B, and C (see Supplementary Materials) present the fractional factorial designs, table of coded coefficients, and significant main effects and interaction terms for all responses. Appendix D presents a table of significant factors for each response containing the level of significance and the direction of effects, along with plots of significant two-way interactions (see Supplementary Materials, Figures D.1D.8).

The remainder of this section presents a detailed examination of two responses of particular importance to ED overcrowding: (expected waiting time in queue plus time in a transfer per patient treated at ED1) and (number of patients transferred to ED1).

4.3.1. Sensitivity Analysis for

Consider the response , the expected waiting time in queue plus time in a transfer per patient treated at ED1. The stepwise regression procedure described above returned the regression model (in uncoded units) presented in equation (17); this regression model had an adjusted R-squared value of 71%. Table 4 presents statistics on this (coded) regression model’s coefficients. According to this analysis, there are seven main effects and nine interaction terms significant at the level (factors and , while not significant individually, are included to retain a hierarchical model, since they appear in statistically significant interaction terms). Three of these main effects, , , and , are significant at the level, indicating that the expected waiting time plus time in the transfer is impacted considerably by changes to the cost per unit service capacity and the travel time between EDs (with time in system increasing as each of these parameters increases) and to the service rate (with time in system decreasing as this parameter increases). Figure D.5 in the Supplementary Materials presents interaction plots for the nine significant interaction terms. Observe that three interaction terms are significant at the level, namely, , , and . The latter two of these interaction terms somewhat mediate the effects of the travel time on the expected time in the system; on average, the reduced level of the arrival rate of patients from outside of the system accelerates the increase of the expected time in the system when the travel time increases. This would only be reasonable if this increased arrival rate of patients from outside of the system is impacting the likelihood of patient transfers between EDs, which will be examined next.

4.3.2. Sensitivity Analysis for

Consider the response , the number of patients transferred into ED1. The stepwise regression procedure described above returned the regression model presented in equation (17); this regression model had an adjusted R-squared value of 78%. Table 5 presents statistics on this (coded) regression model’s coefficients. According to this analysis, there are four main effects and six interaction terms significant at the level. Each main effect is significant at the level, indicating that the number of patients transferred into ED1 is impacted considerably by changes to the cost per unit service capacity and the arrival rate of both emergency and nonemergency patients from outside of the system (with the number of transferred patients decreasing as each of these parameters increases) and to the service rate (with the number of transferred patients increasing as this parameter increases). Figure D.8 in the Supplementary Materials presents interaction plots for the six significant interaction terms. Observe that the interaction terms and all magnify the main effects of these individual terms, with even greater decreases in the number of patients transferred into ED1 when either pair of these parameters are jointly increased. In aggregate, an increase in the arrival rate of emergency or nonemergency patients into ED1 from outside the system is associated with a decreased number of patients transferred into ED1, which partly explains the interaction effect discussed in the previous section, in which the reduced level of the arrival rate of patients from outside of the system accelerates the increase of the expected time in the system when the travel time increases. Recall that , the expected waiting time in queue plus time in a transfer per patient treated at ED1, does not account for the time in the system spent by patients transferred from ED1 to other EDs; the only transfer time that it accounts for is that of patients transferred into ED1.

4.4. Sensitivity Analyses on ED Size

To assess the effect of ED size on the number of transfers occurring in the system, a sensitivity analysis was performed examining three EDs of different sizes. The large ED has arrival rates of 10 and 11 emergency and nonemergency patients per time unit, respectively. The medium ED has arrival rates of 5 and 5.5 emergency and nonemergency patients per time unit, respectively. The small ED has arrival rates of 3.75 and 4.125 emergency and nonemergency patients per time unit, respectively. The optimization model is modified slightly here, to include only constraints (2)–(7), (11), and (14). The objective function is modified as represented in equation (18), deleting the final two summation penalty terms from objective (1). Rather than associating a financial penalty with delay times, we introduce a new constraint (19) which imposes an upper bound, denoted by , on the systemwide average expected waiting time in queue plus time in transfer, which can be computed as . We varied this upper bound across a range of values, from a minimum value of 0.0284 to a maximum value of 1.3913 (the systemwide average for the minimum cost solution if constraint (19) is not considered). In total, 26 different solutions were identified, constituting an efficient frontier for the tradeoff between objective function (18) and the left-hand side of constraint (19). All parameters were assumed to take the baseline values from Section 4.1 with two exceptions: we assume that the cost per unit service capacity at each ED is equal to 10 times the cost per patient transferred between EDs, say, $10 and $1, respectively. The potential numbers of servers considered at each ED were also modified from the values presented in Table 1; for this sensitivity analysis, was varied to include all integer values between 2 and 60. Table 6 presents the sensitivity analysis’ objective values. As it can be seen, the objective value decreases as the upper bound value increases. In fact, it implies that as the average expected waiting time in queue plus time in transfer in the system becomes more flexible, a fewer number of servers and fewer patient transfers are required in EDs. Therefore, the associated costs (equation (18)) decrease. The following figures present the sensitivity analysis’ expected waiting time in queue plus time in a transfer per patient treated (Figure 2), number of servers (Figure 3), ED utilization (Figure 4), and percent of nonemergency patients transferred (Figure 5).

These results demonstrate how the optimization model utilizes a variety of strategies to achieve a constrained systemwide average expected waiting time at minimum cost. Consider, for example, solutions 22 and 23. They achieve relatively similar performance, with respective objective function values of 840.06 and 837.79 and respective systemwide average expected waiting times of 0.7988 and 0.8733. The utilization at each ED is essentially unchanged across solutions 22 and 23, with 96%, 92%, and 92% utilization, respectively, at the large, medium, and small ED in each solution. However, the underlying structure has changed significantly, with solution 22 utilizing 44, 23, and 17 servers, respectively, at the large, medium, and small EDs, and very little patient transfer (1% of the nonemergency patients transferred from the small ED to each of the medium and large EDs). By contrast, solution 23 utilizes 60, 14, and 9 servers, respectively, at the large, medium, and small EDs (one fewer server, in total, than does solution 22), but extensive patient transfer (91% and 74% of the nonemergency patients from the small and medium EDs, respectively, are transferred to the large ED).

Across all 26 solutions identified, the optimization model utilized patient transfer extensively for nonemergency patients arriving at the small ED; on average, 25.3% and 4.4% of such patients were transferred to the large and medium EDs, respectively. The patient transfer was utilized less frequently for nonemergency patients arriving at the medium ED; on average, 6.5% and 0.1% of such patients were transferred to the large and small EDs, respectively. There were no instances across all 26 solutions in which nonemergency patients were transferred from the large ED to another ED.

5. Conclusions and Future Work

Overcrowding in hospital emergency departments (EDs) is a problem that affected many hospitals especially during the response to emergency situations such as pandemics or disasters. In this study, we propose a novel optimization model to address overcrowding in a network of EDs via a combination of two decisions: modifying service capacity to EDs and transferring patients between EDs. This model is similar to that presented in [3]; however, whereas the authors in [3] did not account for queueing effects, our model includes queueing considerations in a MINLP, capitalizing on the closed-nature form of M/M/C queueing effects, similar to the approach utilized in [27].

Computational testing was performed, using a Design of Experiments to determine the effects of changes to the various input parameters for a single ED (denoted ED1) on the optimal solutions obtained by our MINLP. Regarding the expected waiting time in queue plus time in a transfer per patient treated, the most significant main effects indicated that this response is impacted considerably by changes to the cost per unit service capacity and the travel time between EDs (with time in the system increasing as each of these parameters increases) and to the service rate (with time in system decreasing as this parameter increases), with interaction terms somewhat mediating the effects of the travel time on the expected time in system; on average, the reduced level of the arrival rate of patients from outside of the system accelerates the increase of the expected time in the system when the travel time increases. This would only be reasonable if this increased arrival rate of patients from outside of the system is impacting the likelihood of patient transfers between EDs. Examining this further, we find that for the number of patients transferred into ED1, the most significant main effects indicated that this response is affected significantly by changes to the cost per unit service capacity and the arrival rate of both emergency and nonemergency patients from outside of the system (with the number of transferred patients decreasing as each of these parameters increases) and to the service rate (with the number of transferred patients increasing as this parameter increases), with interaction terms between the cost and each arrival rate magnifying the main effects of each these individual terms. In aggregate, an increase in the arrival rate of emergency or nonemergency patients into ED1 from outside the system is associated with a decreased number of patients transferred into ED1, which partly explains the aforementioned interaction effect, in which expected time in the system is found to increase with increases in the travel time between EDs only when the arrival rate of patients from outside of the system into ED1 is at its reduced level.

Additional computational testing examined the effect of ED size on the number of transfers occurring in the system, considering three EDs of different sizes (denoted large, medium, and small). The MINLP was modified slightly here; rather than including a financial penalty for delay times in the objective, we introduce a new constraint imposing an upper bound on the systemwide average expected waiting time in queue plus time in the transfer. Computational testing varied this upper bound across a range of values, identifying an efficient frontier for the tradeoff between the modified objective function and the systemwide average expected waiting time. This optimization model utilizes a variety of strategies to achieve a constrained systemwide average expected waiting time at minimum cost, balancing changes to the numbers of servers at each ED with patient transfers across EDs. Across all points identified on the efficient frontier, the MINLP utilizes patient transfer extensively for nonemergency patients arriving at the small ED, somewhat infrequently for arrivals to the medium ED, and in no instances for arrivals to the large ED. Taken together, these results suggest that our optimization model can identify a range of efficient alternatives for healthcare systems designing a network of EDs across multiple hospitals. Moreover, the model can be helpful to have more balanced EDs with respect to the number of patients and patient waiting time in a network of EDs in case of emergency situations such as natural disasters.

Future work could extend this analysis by considering queueing systems other than M/M/C to represent the stochastic nature of patient arrivals and service times at EDs. Further, while this analysis models steady-state performance, which is useful for network design, an extension to transient system performance in nonsteady-state would allow for similar models to be used in a real-time dispatching environment. Finally, a more nuanced differentiation between patient types, which are modeled as being either emergency or nonemergency patients in this research, could allow for such an MINLP approach to be used to allocate special types of ED service (e.g., pandemic virus testing).

Data Availability

Supplementary materials, including data, will be posted at the University of Missouri’s data repository https://hdl.handle.net/10355/86722.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Supplementary Materials

Appendix A: a table of fractional factorial designs. Appendix B: a table of responses for fractional factorial designs. Appendix C: detailed statistical model outputs for responses. Appendix D: detailed statistical model outputs for responses. (Supplementary Materials)