Abstract

Estimators for the parameters of the Markovian multiserver queues are presented, from samples that are the number of clients in the system at arbitrary points and their sojourn times. As estimation in queues is a recognizably difficult inferential problem, this study focuses on the estimators for the arrival rate, the service rate, and the ratio of these two rates, which is known as the traffic intensity. Simulations are performed to verify the quality of the estimations for sample sizes up to 400. This research also relates notable new insights, for example, that the maximum likelihood estimator for the traffic intensity is equivalent to its moment estimator. Some limitations of the results are presented along with a detailed numerical example and topics for future developments in this research area.

1. Introduction

One of the major research interests in queuing theory is to study the performance characteristics of the queues as a function of their parameters, namely, the arrival rate λ, the service rate μ, or else the traffic intensity ρ. The mathematical problem dealt with here is how these parameters can be estimated using some statistical inferential method. This article focuses on the M/M/s queuing system, which in Kendal notation represents the Markovian number of arrivals, exponential (Markovian) service times, s identical servers working in parallel, and an infinite maximum capacity of users simultaneously allowed in the system, which is one of the classical basic queuing models [1]. From the above parameters and reminding that for M/M/s queues the traffic intensity is defined as the ratio , important performance measures can be derived, such as the empty system probability P0, the expected number of customers in the system L, the expected number of customers in the queue Lq, the expected time in the system W, and the expected time in the queue Wq.

One of the classical basic queuing models [1], M/M/s queues have many applications and consequently great necessity of inference. In fact, many real-life systems are approximately described by queuing models, including computer and telecommunication networks [24], manufacturing and service systems [57], and heathcare systems [810].

Classical inference results for queues (based on the maximum likelihood) are unknown before the results reported in parallel by Clark [11], with the development of the maximum likelihood estimates for λ and μ for Markovian single-served queues, that is, M/M/1 queues, in Kendal notation, and by Benes [12], for Markovian infinite queues, that is, M/M/∞ queues, in Kendal notation. More recently, Schruben and Kulkarni [13] proved that popular performance measures unfortunately do not have expected values and standard errors, Basawa and Prabhu [14] discovered some asymptotic properties of the estimators for the performance evaluation of M/M/1 queues, Zheng and Seila [15] showed that under a certain upper limit for the traffic intensity (), the estimators do exist and have the necessary properties, and Almeida et al. [16] proposed bias correction methods for classical estimators for M/M/1 queues.

However, probably because of the inherent computational difficulties, Bayesian inference has not gained enough interest until the pioneering results reported by Armero and Bayarri [1719], which were followed by the improvements from Armero and Conesa [2023], McGrath et al. [24], McGrath and Singpurwalla [25], Choudhury and Borthakur[26], and, more recently, Almeida and Cruz [27], Cruz et al. [28], Quinino and Cruz [29], Cruz et al.[30], and Choudhury and Basak, [31], among many others.

Almost all the literature that addresses estimation issues describes methods that require the continuous observation of the process over a fixed interval of time. Among the exceptions is the article by Basawa et al. [32], which considers the estimation for M/M/1 queues from the sojourn time data, and the articles by Choudhury and Borthakur [26] and Chowdhury and Mukherjee [33], in which the number of customers present in the M/M/1 queue at successive departure epochs is used to estimate the traffic intensity. In Ross et al. [34], a method that requires the numbers in the queue at successive time points, which are not necessarily equally spaced, has been developed. The specific results presented by Ross et al. [34] apply to M/M/s queues with large number of server s (in particular when , a limit that generates the smallest estimation errors).

Thus, this study aims to derive a maximum likelihood estimator for ρ in M/M/s queues, from samples of the number of clients in the system at arbitrary points, which has been shown to be conveniently equivalent to the moment estimator. Additionally, with the use the additional information from samples of the sojourn times, a maximum likelihood estimator for the arrival and the service rates is derived. To the best of the authors’ knowledge, these have not been done before.

Summarizing, the main contributions of this paper are as follows. The traffic intensity moment estimator is proved to be equal to the maximum likelihood estimator. Also, it is shown that if additional information of the sojourn times is used then estimators for the arrival and service rates can be deduced. Finally, it is demonstrated by extensive computational experiments that the method of moments is competitive to estimate arrival and service rates in M/M/s queues.

The rest of this paper is organized as follows. In the next section, the probabilistic model of the M/M/s queue is detailed, the respective maximum likelihood estimator for ρ is deduced, demonstrating its equivalence to the moment estimator, and the estimators for λ and μ are derived. In the following section, results from Monte Carlo simulations are presented to attest the efficiency of the estimators, along with an illustrative numerical example. In the last section, the paper is concluded, with discussions, final remarks, and the presentation of some topics for future research in this area.

2. Materials and Methods

2.1. Probabilistic Model

In M/M/s queues, the arrivals of the customers are assumed to follow a Poisson process with rate λ, which means that the times between successive arrivals are independent exponentially distributed random variables with mean 1/λ. Upon arrival, a customer goes immediately into service, if any of the servers are free, or else joins the queue if all s servers are busy. After service, a customer leaves the system and if there are any customers waiting, the next one in line enters the service. The successive service times are assumed to be independent exponential random variables with mean 1/μ.

Denoting by the number in the system at time t, then is a birth-and-death process with , if 1 ≤ n s; , if n > s; and , if . Defining as the traffic intensity (also known as the server utilization), it is required that ρ < 1, for the queue to be stable, which is a common assumption in queuing theory [1]. The traffic intensity ρ may also be seen as the average proportion of time that each of the servers is occupied (assuming that the jobs finding more than one vacant server choose their servers randomly). If the traffic intensity is such that , then the system is under equilibrium and it has a stationary distribution. From the theory developed for birth-and-death processes, the stationary distribution of the number of customers N in the system at the departure epochs is given by (for instance, see Gross et al. [1])in which is given from the usual boundary condition that the probabilities must sum to 1; that is,

2.2. Maximum Likelihood Estimator for the Traffic Intensity

In this section, the traffic intensity is the focus; that is, the ratio . To generate data, it is necessary to observe the system at arbitrary epochs. Such a scheme ensures that the data-generating process is consistent with the distribution given by (1). Assume that the number of customers in the queue is . If constitutes our sample of size , then the corresponding likelihood function isin which is the indicator function. It is pertinent to note here that this data-generating scheme must ensure the independence of the sample observations, provided that they are sufficiently spaced in time.

The maximum likelihood estimator for the traffic intensity is the value that maximizes the likelihood given by (3) for . Observe that (3) may be written as , in which α is a constant, and that the value that maximizes (3) also maximizesThe value that maximizes (4) is the maximum likelihood estimator, . Applying the natural logarithm to (4), we obtainMaking the derivative of (5) equal to zero will produceThe value that solves (6) is the maximum likelihood estimator, . However, because (6) is not algebraically simple, it is necessary to implement a numerical search to find , as we will show shortly. For our convenience, the software selected was MATLAB® [35], but many others could be used as well.

To better understand the behavior of (6), it is interesting to closely analyze some special cases of s. Table 1 illustrates some of these special cases for s = 1, 2, 3, and 4, in which . Observe that each polynomial is of a degree equal to the number of servers, s. Figure 1 illustrates the likelihood function, , and the derivative of its natural logarithm with respect to , , for s = 4 and sample value . The maximum likelihood estimated by the software is . It is noticeable that the functions were well behaved and seem to be unimodal. Other examples for different values of s were tested, and the results (not shown) were similar.

2.3. Traffic Intensity Moment Estimator versus the Maximum Likelihood Estimator

The moment estimator [36] for can be obtained by setting the analytical expression of the expected number of clients in the queuing system, L, equal to the average of the sample of the number of clients at the arbitrary epochs, . In accordance with Gross et al. [1], the expected number of clients in an M/M/s queuing system is given byTo obtain the moment estimator, , we set (7) equal to the average of the sample, . In other words, the moment estimator is the value that satisfieswhich should be solved preferably by numerical methods since finding an analytical solution does not seem to be trivial.

After some computational experiments (not shown), the authors observed that the results from the maximum likelihood and moment estimators were actually equal, which led to the conjecture that, in M/M/s queuing systems, the maximum likelihood estimator and the moment estimator for the traffic intensity are equivalent. Indeed, it is easy to verify that both estimators are equal by showing that an algebraic manipulation of (6) will produce (7):Thus, the moment estimator is in fact equal to the maximum likelihood estimator, which is very convenient because, in general, obtaining the moment estimator is simpler than obtaining the likelihood estimator and this facilitates the explanation of the estimation process. In addition, in this particular case, the moment estimator for the traffic intensity inherits the good asymptotic properties shared by all maximum likelihood estimators.

2.4. Arrival and Service Rate Estimators

If the sample related to the number of clients in the system at arbitrary epochs is collected and also the sojourn times of the clients, it is possible to estimate the arrival rate λ and the service rate μ as well. Define the sojourn time of a client in the system, which includes both the waiting time in line and the service time. In accordance with Gross et al. [1], the expected sojourn time W may be expressed asWith a sample of the sojourn times in the system, it is possible to estimate W as . Also with the estimate , which was obtained in the previous section, it is possible to estimate μ asNotice that in (11) may be calculated from (2) using the estimated traffic intensity, . Then, from the traffic intensity, , it is possible to estimate as

3. Performance Analysis of the Estimation Methods

3.1. Simulations

In this section, the performances of the estimation methods described earlier are evaluated. The algorithms were implemented in MATLAB [35], and the codes are available directly from the authors upon request for research and teaching purposes. A Monte Carlo simulation study was conducted as follows to evaluate the methodology described earlier in this paper.

Random samples of the number of customers in the arbitrary epochs were obtained from simulations of an M/M/s queuing system, with sample sizes , traffic intensities , and number of servers . Without loss of generality, the arrival rate was fixed to λ = 10 for all experiments, where the service rate is given by . With these values, it is possible to evaluate a broad range of different situations that are likely to occur in practice.

Table 2 shows the average values (means) and the standard deviations (SDs) of estimates for , , and , from (8), (11), and (12), respectively, obtained from a Monte Carlo simulation procedure, repeated 1,000 times. From Table 2, it is observed that, with the increase of the sample size, the difference between the estimates and the true values reduces and the standard deviation decreases.

3.2. Numerical Example

Now an application based on data collected in a large supermarket network in a region of interest is discussed for a better understanding of the presented methodology. The cashier's desks in this network are separated into normal desks and fast desks. The focus of our example is the latter, which serve exclusively consumers with up to 15 items. The primary interest of those responsible for the supermarket is to evaluate the time between 15:00 and 22:00 on Saturdays, mainly because there is a large volume of buyers at that time, and the volume has increased because the supermarket chain has an advertising agency conducting regular campaigns. The fear is that users will give up shopping or not return to this supermarket because of dissatisfaction with the sojourn time. The past experiences of those responsible for supermarkets indicate that the traffic intensity should not exceed 85% to minimize the risk of compromising current and future sales. Based on several previous studies, it is reasonable to assume that consumers arrive during a given period of time approximately according to a Poisson process and that the service times are exponentially distributed. The objective is to evaluate the traffic intensity, arrival rates, and the service rates for an M/M/6 queuing system, when the current number of servers (that is, 6) is insufficient.

In this scenario, 200 random observations of the number of customers in the system were collected, 28 at each of 8 consecutive Saturdays, at times sufficiently spaced and previously defined by the person responsible for data collection. The observed values (O) and frequencies (F) are presented in Table 3. For example, from the 200 observations, at 2 times, only 1 customer was found in the system and at 6 times, 2 customers were found.

Along with the observation of the number of clients in the system, the sojourn times for 200 randomly selected customers were also collected (which was performed simultaneously with the observation of the number of customers in the system). Figure 2 depicts the histogram for the observed results.

Using the collected data, the estimates from (8), (11), and (12) are =0.9013, =2.0046 clients/min, and =10.8247 clients/min, respectively. The data suggests that s = 6 servers are insufficient to handle the clients and to keep the traffic intensity below 85%. Using the previous estimates for μ and λ, we can verify that increasing the number of servers to s = 7 would reduce the traffic intensity to =10.8247/(7×2.0046) = 0.7714, which would result in an acceptable performance for the queuing system (that is, below 85%).

4. Conclusions and Final Remarks

The estimation of the traffic intensity ρ, the arrival rate λ, and the service rate μ, for Markovian multiservice queues, known in Kendall notation as M/M/s queues, is a challenging mathematical problem that was solved here by statistical inferential methods. For the M/M/s queues, even without knowledge of the arrival rate and queue length of service, it is possible to make inferences about the traffic intensity by collecting a random sample of the number of customers in the system at the arbitrary epochs. If, in addition to the number of customers, the sojourn times in the queue are also collected, then it is possible to accurately estimate the arrival and service rates by means of simple expressions, as demonstrated here. In fact, a comprehensive set of Monte Carlo simulations were performed to show that estimates for ρ, μ, and λ may be obtained that are very close to the true values, especially when the sample sizes are close to 400.

Finally, it is worthwhile to mention that future studies in this area could include extensions to more general single-server queuing systems, such as M/G/1 and G/M/1, and general multiserver queues, such as M/G/s and G/M/s. These are only a few of the many topics for future research in this area.

Data Availability

The data used to support the findings of this study are included within the article.

Disclosure

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this article.

Acknowledgments

This research is partially supported by the Brazilian agencies CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico), under Grant 305515/2018-7, and by FAPEMIG (Fundação de Amparo à Pesquisa do Estado de Minas Gerais), under Grant CEX-PPM-00564-17.