Abstract

We extend the analysis of queueing systems for real-life situations, where the arrival pattern of customers is unknown. In real systems, we must understand how the choice of a method of estimation influences the configuration of the system. Using kernel smoothing, we evaluate algorithms to estimate performance measures of a 𝐺𝐼𝑋/𝑀/𝑐/𝑁 system, including the invariant probability distribution of the number of customers in the system, the blocking probability, the average queue size, and the average client queue time. We successfully apply the method to the arrivals to a call center to plan and improve the performance of these important queueing systems.

1. Introduction

There is a large practical interest in investigating the behavior of general-arrival queueing systems, namely, those of 𝐺𝐼𝑋/𝑀/𝑐/𝑁 type, because when managing real queueing systems, the behavior of the arrival process generally is not Markovian. In Kendall [1] notation, in these queueing systems, the interarrival times are independent and follow a general distribution (𝐺𝐼), the service times follow an exponential distribution (𝑀), we have 𝑐 identical servers working in parallel, and a maximum capacity of 𝑁 users that are simultaneously allowed in the system, including those in service. Finally, 𝑋 is a random variable representing the size of group (bulk) arrivals. Such queueing systems could be used in situations where we have relative control over how the servers work, but we do not know beforehand how the customers arrive at the system.

Naturally, the mathematical model depends on the type of queueing system considered, and there are several methods to obtain such models. The most widely used methods are those that attempt to explain the density functions of interarrival and service times by means of parametric statistical models. Nevertheless, real data sometimes do not fit well into parametric models; instead, they produce intractable models. Exact results for performance evaluation of Markovian and some simple general queueing systems are known [2], but such systems sometimes are not found in real life.

An alternative approach is using nonparametric methods to study queueing systems. Nonparametric methods that use kernel smoothing have received much attention lately [3]. Kernel estimators provide a simple way of finding structure in data sets without imposing a specific parametric model [4], which gives us flexibility to handle virtually any data set. There is extensive literature discussing queueing systems [5] and kernel smoothing [4] as separate concepts, but virtually no study has brought the two concepts together.

The contribution of this paper is twofold. First, we implement algorithms to calculate performance measures of queueing systems, where the density of the interarrival time is estimated by the kernel method. Second, we evaluate the performance of these algorithms as a function of the kernel estimator, the smoothing window, the intensity rate, and the system size. We also present a case study in a call center that illustrates the usefulness of the method.

First, we present a literature review about queueing systems and kernel estimators as well as the fundamental concepts required to understand the proposed queueing system model. We also discuss the use of kernel estimators, the chosen models, and the issue of selecting the smoothing parameter. Then, we describe how we estimated system performance and present the comparative results of simulations with the different methods discussed. We then apply our methods to a call center case and end with our main conclusions and some ideas for future work in the area.

2. Literature Review and Fundamental Concepts

2.1. Previous Works

There are many situations in real life where queues occur and queueing models may be helpful. Recently, queueing models have been used successfully in manufacturing processes [6, 7], transportation [8], airports, ports, and product distribution systems [9], computer and telecommunication systems [10], call center modeling [11], and the analysis of health systems [12]. Queues may cause the quality of the services or the prices of the goods to rise or fall, depending on their efficiency [8], which may be estimated by means of the mathematical tools developed in queueing theory.

As we mentioned earlier, there is not much literature that incorporates both general arrival queueing systems and kernel smoothing. Takács [13] analyzed a closed solution for various systems that have nonspecific distributions, including some multiserver queues such as 𝐺𝐼/𝑀/𝑐 and 𝑀/𝐺/𝑐. Hokstad [14] established some closed form results to the 𝐺𝐼/𝑀/𝑐/𝑁 system. Chaudhry and Templeton [15] analyzed various types of queue with bulk arrivals. Zhao [16] proposed a closed form solution for the 𝐺𝐼𝑋/𝑀/𝑐 system, and Bareche and Aïssani [17] proposed a method to evaluate the proximity of 𝐺𝐼/𝑀/1 and 𝑀/𝑀/1 systems when the density of the interarrival time is estimated by kernel estimators.

Concerning kernel smoothing, Wand and Jones [4] introduced general fundamental concepts. Regarding the issues related to the asymmetry of the random variable under analysis (e.g., nonnegativity), Zhang et al. [18] proposed a boundary corrected kernel estimator based on pseudodata generation, transformation, and reflection around the 𝑌-axis. Chen [19] proposed the use of a gamma kernel to avoid boundary problems present in certain situations. Scaillet [20] studied the application of other asymmetric kernels. Bouezmarni and Scaillet [21] were concerned about the consistency of these asymmetrical estimators. For recent developments in the area of kernel smoothing and a thorough literature review, see Atuncar et al. [22] and de Lima and Atuncar [3].

2.2. The 𝐺𝐼𝑋/𝑀/𝑐/𝑁 Model with Partial Blocks

Vijaya Laxmi and Gupta [23] described a generalization of the system 𝐺𝐼/𝑀/𝑐/𝑁 when customers arrive in groups of size 𝑋 with 𝑃(𝑋=𝑖)=𝑔𝑖 (𝑖1) and mean 𝐸(𝑋)=𝑔. The 𝐺𝐼𝑋/𝑀/𝑐/𝑁 is a finite capacity system such that a customer that arrives to the saturated system is refused with a probability that we will call 𝑃BL, or partial blocks. This refers to the case in which an arrival group of a size greater than the remaining spots in the system is partially denied according to the number of the remaining vacancies until the system is complete. Let 𝐵𝑛 be the number of clients who were served between the arrival of the 𝑛th customer and its successor. Therefore, the number of clients the 𝑛th customer finds in the system at the arrival, 𝑌𝑛, would depend on 𝑋𝑛 and 𝐵𝑛 such that 𝑌𝑛+1=[min(𝑌𝑛+𝑋𝑛,𝑁)𝐵𝑛]+. Because 𝑌𝑛+1 depends only on 𝑌𝑛, 𝐵𝑛, and 𝑋𝑛 but not on 𝑌𝑛1, 𝑌𝑛2, and so on, the stochastic process {𝑌𝑛} is a first-order Markov Chain.

Vijaya Laxmi and Gupta [23] report that when the traffic intensity rate 𝜌=(𝜆𝑔/𝑐𝜇), where 𝜆 is the arrival rate and 𝜇 service rate, is smaller than 1, this Markov Chain has an invariant probability distribution of 𝜋𝑘=lim𝑛𝑃(𝑌𝑛=k), 𝑘=0,1,2, associated with the number of clients an arbitrary customer finds in the system at arrival. The 𝜋𝑘’s are often called prearrival probabilities.

Prearrival probabilities can be determined by the following system of linear equations:𝜋𝑘=𝑁𝑗=0𝑝𝑗𝑘𝜋𝑗,(2.1) for 𝑘=0,1,2,,𝑁1, and𝑁𝑗=0𝜋𝑗=1,(2.2) which gives𝑝0,0𝑝11,0𝑝𝑁,0𝑝0,1𝑝1,1𝑝10,𝑁1𝑝𝑁,𝑁1𝜋1110𝜋1𝜋𝑁=0001,(2.3) where 𝑝𝑗𝑘 are named transition probabilities such that𝑝𝑗𝑘𝑌=𝑃𝑛+1=𝑘𝑌𝑛==𝑗𝑖=1𝑃𝑋𝑛=𝑖,𝑌𝑛+1=𝑘𝑌𝑛==𝑗𝑖=1𝑔𝑖𝑃𝑌𝑛+1=𝑘𝑌𝑛=𝑗,𝑋𝑛==𝑖𝑖=1𝑔𝑖𝑃𝐵𝑛==min(𝑗+𝑖,𝑁)𝑘𝑁𝑗𝑖=max(1,𝑗𝑘)𝑔𝑖𝑃𝐵𝑛+=𝑗+𝑖𝑘𝑖=𝑁𝑗+1𝑔𝑖𝑃𝐵𝑛==𝑁𝑘𝑁𝑗𝑖=max(1,𝑗𝑘)𝑔𝑖0𝜋𝑗+𝑖,𝑘(𝑧)𝑑𝜏(𝑧)+𝑖=𝑁𝑗+1𝑔𝑖0𝜋𝑁,𝑘(𝑧)𝑑𝜏(𝑧),(2.4) where, 𝑗𝑁, 𝑘𝑁, 𝜋𝑗+𝑖,𝑘(𝑧) is the serving probability of (𝑗𝑘+𝑖) clients under the assumption of the interarrival time 𝜏𝑛=𝑧 and 𝜏(𝑧) is the interarrival time distribution.

We will analyze how 𝜋𝑗+𝑖,𝑘(𝑧) behaves for all possibilities of (𝑗+1) and (𝑘) in regard to the number of servers 𝑐.

(a) (𝑗+𝑖)𝑐 and 𝑘𝑐
When 𝑗+𝑖𝑐 and 𝑘𝑐, there will be more clients than the servers can handle in the entire interval. Because the service process is Markovian, we can treat the server group as a single unit that serves customers at a rate 𝑐𝜇𝑧 and a Poisson distributed transition probability 𝜋𝑗+𝑖,𝑘(𝑧)=𝑒𝑐𝜇𝑧(𝑐𝜇𝑧)𝑗+𝑖𝑘.(𝑗+𝑖𝑘)!(2.5)

(b) (𝑗+𝑖)<𝑐
When 𝑗+𝑖<𝑐, all clients within the system are being served and only 𝑘 customers will remain in the system to time 𝑧. Knowing that the probability of a service time greater than 𝑧 is 𝑒𝜇𝑧, we can describe this transition probability as a Binomial distribution 𝜋𝑗+𝑖,𝑘𝑘𝑒(𝑧)=𝑗+𝑖𝑘𝜇𝑧(1𝑒𝜇𝑧)𝑗+𝑖𝑘.(2.6)

(c) (𝑗+𝑖)𝑐 and 𝑘<𝑐
When 𝑗+𝑖𝑐 and 𝑘<𝑐, there will be (𝑗+𝑖𝑐) customers waiting and 𝑐 customers being served at the beginning of the interval, but there will be (𝑐𝑘) free servers at the end. Let 𝑦 be the interval ending immediately before (𝑗+𝑖𝑐+1) clients are served. If each service time is exponentially distributed with a rate 𝑐𝜇, then 𝑦 is gamma distributed with a shape (𝑗+𝑖𝑐+1) and rate a 𝑐𝜇. The other 𝑐 customers will be served in a time (𝑧𝑦), and only 𝑘 will remain. The transition probability of this subinterval will follow a Binomial distribution with success probability 𝑒𝜇(𝑧𝑦). The values 𝜋𝑗+𝑖,𝑘(𝑧) can be obtained by the convolution of these two variables 𝜋𝑗+𝑖,𝑘(𝑧)=𝑧0𝑒𝑐𝜇𝑦(𝑐𝜇)𝑗+𝑖𝑐+1𝑦𝑗+𝑖𝑐𝑐𝑘𝑒(𝑗+𝑖𝑐)!𝑘𝜇(𝑧𝑦)1𝑒𝜇(𝑧𝑦)𝑐𝑘=𝑐𝑘𝑒𝑑𝑦𝑘𝜇𝑧𝑧0(𝑐𝜇𝑦)𝑗+𝑖𝑐𝑒(𝑗+𝑖𝑐)!𝑘𝜇𝑦𝑒𝑐𝜇𝑦(1𝑒𝜇𝑦𝑒𝜇𝑧)𝑐𝑘=𝑐𝑘𝑒𝑐𝜇𝑑𝑦𝑘𝜇𝑧𝑧0(𝑐𝜇𝑦)𝑗+𝑖𝑐𝑒(𝑗+𝑖𝑐)!𝜇𝑦(1𝑒𝜇𝑦𝑒𝜇𝑧)𝑐𝑘=𝑐𝑘𝑒𝑐𝜇𝑑𝑦𝑘𝜇𝑧𝑧0(𝑐𝜇𝑦)𝑗+𝑖𝑐(𝑒(𝑗+𝑖𝑐)!𝜇𝑦𝑒𝜇𝑧)𝑐𝑘.𝑐𝜇𝑑𝑦(2.7) Therefore, from (2.4), we can get the transitions probabilities 𝑝𝑗𝑘 given by (2.4) when we take each transition probability 𝜋𝑗+𝑖,𝑘(𝑧) as 𝜋𝑁,𝑘(𝑧) when 𝑖>𝑁𝑗𝑝𝑗𝑘=𝑁𝑗𝑖=max{1,𝑘𝑗}𝛽𝑗+𝑖𝑘𝑔𝑖+𝛽𝑁𝑘𝑘=𝑁𝑗+1𝑔𝑖,𝑘𝑐,𝑁𝑗𝑖=max{1,𝑘𝑗}𝑉𝑗+𝑖,𝑘𝑔𝑖+𝑉𝑁,𝑘𝑘=𝑁𝑗+1𝑔𝑖,0<𝑘<𝑐,1𝑁𝑟=1𝑝𝑗𝑟,𝑘=0,(2.8) where 𝑉𝑗+𝑖,𝑘 and 𝛽𝑗+𝑖𝑘 are as follows: 𝑉𝑗+𝑖,𝑘=0,𝑗<𝑘<𝑐,0𝑘𝑒𝑗+𝑖𝑘𝜇𝑧(1𝑒𝜇𝑧)𝑗+𝑖𝑘𝑑𝜏(𝑧),𝑘𝑗𝑐,0𝑧0𝑐𝑘𝑒𝑘𝜇𝑧(𝑐𝜇𝑦)𝑗𝑐(𝑗𝑐)!𝑐𝜇(𝑒𝜇𝑦𝑒𝜇𝑧)𝑐𝑘𝛽𝑑𝑦𝑑𝜏(𝑧),𝑘<𝑐<𝑗,𝑗+𝑖𝑘=0𝑒𝑐𝜇𝑧(𝑐𝜇𝑧)𝑗+𝑖𝑘(𝑗+𝑖𝑘)!𝑑𝜏(𝑧),𝑗+𝑖𝑘0.(2.9) There is a relationship between the vector of prearrival probabilities 𝝅 and the vector of arbitrary time probabilities 𝐏, related to the number of people that an outside observer finds in the system. Viajaya Laxmi and Gupta [23] established a method that proved the relationship 𝑃𝑘=𝜌𝑐min{𝑘,𝑐}𝑔𝑘1𝑖=0𝜋𝑖𝑗=𝑘𝑖𝑔𝑗,0<𝑘𝑁,1𝑁𝑖=1𝑃𝑖,𝑘=0.(2.10) Performance measures (below) are used, including the average queue length 𝐿𝑞, blocking probability of an arbitrary customer 𝑃BL, and average waiting time in the queue 𝑊𝑞, to analyze the efficiency of queueing systems 𝐿𝑞=𝑁𝑖=0(𝑖𝑐)𝑃𝑖,𝑃BL=𝑁𝑖=0𝜋𝑖𝑗=𝑁𝑖𝑔1𝑘=𝑗+1𝑔𝑗,𝑊𝑞=𝐿𝑞𝑔𝜆1𝑃𝐵𝐿).(2.11)

2.3. Kernel Estimators

Suppose that we have a sample of the interarrival times, 𝑋1,,𝑋𝑛, with an unknown density 𝜏(𝑡). The kernel estimator is an analytical tool that provides an effective way of revealing the structure behind such a sample.

2.3.1. Gamma Kernel Estimator

Recently, Chen [19] suggested an asymmetric kernel with naturally varying shape as a way to avoid allocating weight for negative values. The gamma kernel estimators are always nonnegative, free of boundary bias, and achieve the optimal rate of convergence for the mean square error (MSE) in the nonnegative kernel estimator class. Bouezmarni and Scaillet [21] showed that this estimator is consistent and able to avoid boundary bias. Let 𝐾𝐺(𝑝,𝑞) be the gamma density function with parameters 𝑝 (shape) and 𝑞 (rate). The gamma kernel considered is𝐾𝐺𝑡𝑏𝑋+1,𝑏𝑗=𝑋𝑗𝑡/𝑏𝑒𝑋𝑗/𝑏𝑏(𝑡/𝑏)+1Γ[],(𝑡/𝑏)+1(2.12) where 𝑏 is a smoothing parameter that satisfies the condition 𝑏0, 𝑛𝑏 as 𝑛. The gamma kernel estimator iŝ𝜏(𝑡;𝑏)=𝑛𝑛1𝑗=1𝐾𝐺𝑡𝑏𝑋+1,𝑏𝑗.(2.13) The smoothing parameter 𝑏 is critical for the overall performance of the kernel estimator considered. A small 𝑏 leads to a relatively bumpy density, while a large one results in a smooth density. There are several methods to determine the best fit, from a minimization of the mean integrated squared error (MISE) of ̂𝜏(𝑡;𝑏) to the asymptotic behavior of the MISE (AMISE).

(a) Least Squares Cross-Validation (LSCV) Method
The least squares cross-validation (LSCV) method starts from the MISE expansion MISE{̂𝜏(𝑥;𝑏)}=𝐸̂𝜏(𝑥;𝑏)2𝑑𝑥2𝐸̂𝜏(𝑥;𝑏)𝜏(𝑥)𝑑𝑥+𝐸𝜏(𝑥)2𝑑𝑥.(2.14) The minimization of the first term is equivalent to the minimization of MISE{̂𝜏(𝑥;)}𝐸𝜏(𝑥)2𝑑𝑥=𝐸̂𝜏(𝑥;)2.𝑑𝑥2̂𝜏(𝑥;)𝜏(𝑥)𝑑𝑥(2.15) The right-hand side is unknown, because it depends on 𝜏. However, an unbiased estimator for this quantity is LSCV()=̂𝜏(𝑥;)2𝑑𝑥2𝑛𝑛1𝑖=1̂𝜏𝑖𝑋𝑖,;(2.16) where ̂𝜏𝑖=(𝑋𝑖;) is the density estimate based on the sample with 𝑋𝑖 deleted; this is often called the “leave-one-out” density estimator. A disadvantage of this method is that it suffers from high variation.

(b) Asymptotic Behavior of the MISE (AMISE) Method
An alternative parameter selector is to consider the asymptotic behavior of the MISE of the gamma kernel estimator. Chen [19] uses some aspects of the gamma distribution and a Taylor expansion to determine the MISE as follows: MISE(̂𝜏)=𝑏20𝑥𝜏1(𝑥)+2𝑥𝜏(𝑥)2+𝑑𝑥2𝑛𝑏𝜋10𝑥1/2𝜏𝑛(𝑥)𝑑𝑥+𝑂1𝑏1/2+𝑏2.(2.17) The asymptotic MISE disregards the last term; therefore, the optimal 𝑏  that minimizes the leading terms above is 𝑏AMISE=2𝜋10𝑥1/2𝜏(𝑥)𝑑𝑥4𝑛0𝑥𝜏(𝑥)+21𝑥𝜏(𝑥)2𝑑𝑥2/5,(2.18) where the functions 𝜏, 𝜏, and 𝜏 are unknown. These quantities are obtained from the fitted gamma density with parameters adjusted from the sample. This solution still requires further study, but our paper shows promising results.

2.3.2. Zhang et al. [18] Estimator

Zhang et al. [18] submitted a model that works particularly well when 𝜏(0)>0 and combines pseudodata creation, its transformation, and its reflection around the 𝑌-axis in the following three steps.

Step 1. Transform the original data 𝑋1,,𝑋𝑛 to 𝑔(𝑋1),,𝑔(𝑋𝑛), while keeping the original data, where 𝑔 is a nonnegative, continuous, and monotonically increasing function from [0,) to [0,).
Based on extensive simulations, the transformation that best suits a broad variety of densities is 𝑔(𝑥)=𝑥+𝑑𝑥2+𝐴𝑑𝑥3,(2.19) where 𝐴>1/3 and 𝑑=𝑓(0)/𝑓(0).

Step 2. Reflect the pseudodata, 𝑔(𝑋1),,𝑔(𝑋𝑛), around the origin.

Step 3. Based on the enlarged data sample, 𝑔(𝑋1),,𝑔(𝑋𝑛),𝑋1,,𝑋𝑛, define the new estimator as ̂𝜏𝑛1(𝑥,)=𝑛𝑛𝑗=1𝐾𝑥𝑋𝑗𝑋+𝐾𝑥+𝑔𝑗,𝑥0,(2.20) where is a smoothing parameter and 𝐾 is a symmetric probability function with support [1,1] like the Epanechnikov kernel 3𝐾(𝑡)=41𝑡2𝐼[1,1].(2.21) Notice that the transformation 𝑔 defined above is not available in practice, because 𝑑 is unknown. A good estimator can be obtained when 𝑑 is written as (𝑑/𝑑𝑥)log𝑓(𝑥)𝑥=0, 𝑑𝑛=log𝑓𝑛()log𝑓𝑛(0),(2.22) where 𝑓𝑛()=𝑓𝑛()+1/𝑛2, 𝑓𝑛(0)=max(𝑓𝑛(0),1/𝑛2), 𝑓𝑛()=(1/𝑛)𝑛𝑗=1𝐾((𝑥𝑋𝑗)/), 𝑓𝑛(0)=(1/𝑛0)𝑛𝑗=1𝐾0(𝑋𝑗/0), and 𝐾0 is a so-called endpoint kernel, satisfying 01𝐾0(𝑡)𝑑𝑡=1,01𝑡𝐾0(𝑡)𝑑𝑡=0,01𝑡2𝐾0(𝑡)𝑑𝑡0,0=11𝐾(𝑡)𝑑𝑡201𝐾0(𝑡)2𝑑𝑡01𝑡2𝐾0(𝑡)𝑑𝑡211𝐾(𝑡)2𝑑𝑡.(2.23) Zhang et al. [18] proved that for 𝑡, the effect of reflected pseudodata is insignificant, and the estimator can be reduced to the Parzen-Rosenblatt estimator ̂𝜏𝑛1(𝑡,)=𝑛𝑛𝑗=1𝐾𝑡𝑋𝑗.(2.24) They also established that 0̂𝜏𝑛1(𝑡)𝑑𝑡=1+𝑛𝑛𝑖=1𝑔(𝑋𝑖)/𝑋𝑖/𝐾(𝑧)𝑑𝑧.(2.25) Thus, ̂𝜏𝑛(𝑡) only integrates to 1 when 𝑑𝑛=0, so 𝑔𝑛(𝑋𝑖)=𝑋𝑖, or when 𝑋𝑖=0 for all 𝑋𝑖’s, because 𝑔𝑛(0)=0. However, when 𝑛, both limits of the second term will eventually converge to 0 and ̂𝜏𝑛(𝑡) will integrate to 1 asymptotically.
Zhang and Karunamuni [24] used the endpoint kernel 𝐾01(𝑡)=12(1+𝑡)2𝐼+𝑡[1,0](2.26) and showed that this kernel minimizes the MSE when estimating 𝜏(0). Therefore, 0=2 is approximately the optimal smoothing parameter for estimating 𝜏(0) except when 𝜏(0)=0.
Chiu [25] described a parameter selecting method that considered the optimal that minimizes the asymptotic MISE when 𝐾 is a symmetric probability function with up to the fourth moment being finite AMISE=𝐾(𝑥)2𝑑𝑥𝑛𝑥2𝐾(𝑥)𝑑𝑥2𝜏(𝑥)2𝑑𝑥1/5,(2.27) where the function 𝜏(𝑥)2𝑑𝑥 is unknown.
Chiu’s [25] “plug-in” method consists of estimating this quantity through the characteristic function of the sample 𝜑(𝜆)=𝑛𝑛1𝑗=1𝑒𝑖𝜆𝑋𝑗,(2.28) and calculating the optimal with the previous formula. The characteristic function of 𝜏 is 𝑒𝜑(𝜆)=𝑖𝜆𝑥𝜏(𝑥)𝑑𝑥.(2.29) By the inversion formula, we have 𝜏(𝑥)=(2𝜋)1𝑒𝑖𝜆𝑥𝜑(𝜆)𝑑𝜆,(2.30) thus, 𝜏(𝑥)=(2𝜋)1𝜆2𝑒𝑖𝜆𝑥𝜏𝜑(𝜆)𝑑𝜆,(𝑥)2𝑑𝑥=(2𝜋)1𝜆2𝑒𝑖𝜆𝑥𝜑(𝜆)𝑑𝜆2=𝜆𝑑𝑥4(2𝜋)1𝑒𝑖𝜆𝑥𝜑(𝜆)𝑑𝜆2=𝜆𝑑𝑥4[𝜏(𝑥)]2𝑑𝑥.(2.31) Using Parseval’s identity, we can show that 𝜆4[]𝜏(𝑥)2𝑑𝑥=(2𝜋)1𝜆4||||𝜑(𝜆)2𝑑𝜆.(2.32) Chiu [25] introduced a cutoff value Λ for 𝜆 such that |𝜑(𝜆)|2<𝑐/𝑛. Extensive computational experiments from Bessegato et al. [26] show that 𝑐=3 is the value that minimizes the estimator variance. The final “plug in” estimator is then AMISE=𝐾(𝑥)2𝑑𝑥𝑛𝑥2𝐾(𝑥)𝑑𝑥2𝜋1Λ0𝜆4||||𝜑(𝜆)2𝑛1𝑑𝜆1/5.(2.33)

3. Experimental Results

This section presents some results of simulations for 𝐺𝐼𝑋/𝑀/𝑐/𝑁 systems with partial blocks, where the interarrival time is estimated through the following kernel methods:(i)gamma kernel estimator with LSCV method,(ii)gamma kennel estimator with optimal 𝑏AMISE,(iii)Zhang et al. [18] estimator with Chiu’s [25] “plug in” method.

To evaluate the performance of the estimators above, we will compare the mean square error (MSE) of each estimated prearrival probability in the following way.

Step 1. Generate a sample of size 𝑛 of general interarrival distribution 𝜏.

Step 2. Calculate the mean service rate 𝜇=𝑔/𝜌𝑐E(𝜏).

Step 3. Estimate the optimal smoothing parameter or 𝑏.

Step 4. Use the kernel density method to estimate the theoretical density function 𝜏(𝑥).

Step 5. Find each estimated transition probability.

Step 6. Solve the linear system: ̂𝑝0,01̂𝑝1,0̂𝑝𝑁,0̂𝑝0,1̂𝑝1,11̂𝑝0,𝑁1̂𝑝𝑁,𝑁1111𝜋0𝜋1𝜋𝑁=0001.(3.1) The algorithm above was coded in R 2.8.0 (or earlier versions, see [27]) and is available upon request from the authors for educational and research purposes.
The experiments are based on random samples of interarrival times of size 𝑛=100. Slightly larger and smaller samples were also tested, but the results (not shown) are similar. The theoretical interarrival distributions considered in this experiment were(i)Weibull distribution with shape = 2 and rate = 20,(ii)gamma distribution with shape = 10 and rate = 2,(iii)gamma mixture distribution of 0.45 × gamma (5; 2) + 0.55 × gamma (30; 1).The Weibull density has 𝜏(0)0, the pure gamma has 𝜏(0)=0, and the gamma mixture is bimodal. We choose Weibull and gamma distributions because of their well-known flexibility for modeling real databases. For simplicity, the group size 𝑋 is constant and equal to 1. Two different numbers of servers were considered, 𝑐=5 and 10, with maximum capacities of 𝑁=20 and 25, respectively, (which result in buffers of fixed sizes equal to 15). Figures 1 and 2 show the results, where 𝜋𝑘 is the theoretical distribution of the number of clients and MSE(𝜋𝑘) is the mean square error of the estimates.

4. Discussion

In general, the errors decreased as the number of servers 𝑐 increased, and they were dependent on the theoretical distribution considered. We obtained the largest errors for the last distribution (gamma mixture). The LSCV method is better than 𝑏AMISE on the first and last distributions. Zhang et al.’s [18] estimator has the worst performance on the first and a competitive performance on the second distribution (gamma distribution). All the estimators performed well with the second distribution. The last distribution (gamma mixture) has a particular behavior. Although the errors are large for most of the estimates, the error is small for the blocking probability (i.e., 𝜋20, for the system with 𝑐=5, Figure 1, and 𝜋25, for 𝑐=10, Figure 2). The blocking probability is an important performance measure, because it indicates the fraction of costumers lost by the system.

Finally, simulations showed that when 𝜏(0)0 the gamma kernel method had the best performance. This suggests the Zhang et al. [18] method does not work well when 𝜏(0)>0. At the same time, its behavior with the bimodal density showed a very low MSE for probabilities near the maximum state. This implies a good estimation of the blocking probability and other performance measures.

The method used to select the smoothing parameter for the gamma kernel estimator had no effect. A better selector would have the function of its AMISE optimal parameter estimated like the “plug-in” method for symmetric kernels. The combined use of smoothing parameter selection and Bayesian techniques is promising (see de Lima and Atuncar [3]).

5. Application to a Real Call Center Data Set

We analyzed a real database of 7,761 phone calls made on April 7, 2006, from 8:00 am until 1:00 pm to a call center (data available from the authors upon request). We worked on a set of interarrival and service times to find the required minimum size of the server facility taking into account that the arrival rates are not homogeneous. In fact, larger interarrival times were detected toward the beginning of the day rather than throughout the day. This will lead to the adjustment of a different system for each hour of service that had an approximately homogeneous arrival rate. Using this data, we will model a 𝐺𝐼𝑋/𝑀/𝑐/𝑁 system with a kernel interarrival density estimate for each hour of service.

The data set is presented in seconds, which is the precision given by the data acquisition system. This constraint will lead to ties in the arrival times (i.e., more than one call can arrive in the same second). Therefore, we treated same-time calls as part of a single arrival group and fit a discrete probability distribution. Table 1 shows the observed frequency of group sizes 𝑋 and the fitted distribution 𝐷 by hour. These arrivals were observed in each second.

We used the gamma kennel-estimator with optimal 𝑏AMISE as the kernel density estimation method. This method gave estimated densities bounded at 𝜏=0 and illustrates the information we miss when data is rounded. We also considered the gamma kernel estimator with LSCV method, but it did not behave as well with discrete data. Table 2 shows the smoothing parameter calculated, and Figure 3 shows the estimated densities.

Service time distribution fits an exponential distribution with parameter 𝜆=0.003339 and standard error of 3.8×105. The behavior of the system then becomes restricted to the choice of the number of servers 𝑐 and capacity of the system 𝑁. In Table 3, the minimum number of servers required is set to maintain the stability of the system at each hour considered.

To optimize system performance, it is necessary to establish some criterion function for the adopted design. If we only consider the effective arrival rate (𝜆EF=𝜆[1𝑃BL]), we can describe this performance measure according to the number of servers and the maximum queue size. Figure 4 shows this relationship from 8 to 9 am. We saw similar results (not shown) for different periods. A good criterion would be the 𝐶=(𝐶1/𝜆EF+𝐶2𝑐+𝐶3𝑁), where 𝐶1, 𝐶2, and 𝐶3 are costs related to each parameter and defined by the system environment. As an example, if we have 𝐶1=500, 𝐶2=1 and 𝐶3=1, the state with minimal criterion would be 𝑐=19 and 𝑁=20. If cost 𝐶1 is raised to 600, the best configuration would be 𝑐=21 and 𝑁=21.

Figure 5 shows the invariant distributions estimated for other periods with different queue and server facility sizes. The method seems helpful for adjusting the maximum size 𝑁, and it is clear that the period plays a key role.

6. Conclusions

The main focus of this paper is on using kernels in the analysis of queuing systems. We studied the adequacy of the kernel estimator methods for calculating the invariant probability distribution and performance measures of queueing systems that have general interarrival distribution times with bulk arrivals. Discussions in simulations showed that the method is effective. Also, we successfully applied the method to the calls from a call center to plan and improve the performance of these important queueing systems.

Future research may take other directions—for instance, determining how the estimate of the prearrival invariant distribution moves away from its real value. One approach is from the variance of the bias and variance of each probability estimated. For example, the variance of an estimate of transition probability 𝑝𝑖𝑗 could depend on the variance of term ̂𝛽0, so we would need to findvar̂𝛽0=var0𝑒𝑐𝜇𝑧.̂𝜏(𝑧,)𝑑𝑧(6.1) This research can be applied to several areas of practical interest, including health and industry. For example, it could be critical in a medical emergency room that must optimize resource allocation.

Another possible future direction for research in this field involves the development of models to deal with dependency for the interarrival times. Such a case is particularly true in computer networks. Thus, in computer communications networks, people are interested in finding proper bounds of arrivals (see Li and Zhao [28]), with the assumption that arrivals are dependent (see Li and Zhao [29]), which are recently published results based on the so-called deterministic queuing theory (for details, see Le Boudec and Thiran [30]).

Funding

The Brazilian government funding agencies mentioned above had no role in the study.

Acknowledgments

This research is supported by CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico; Grants no. 201046/1994-6, 301809/1996-8, 307702/2004-9, 472066/2004-8, 304944/2007-6, 561259/2008-9, 553019/2009-0, 550207/2010-4, 501532/2010-2, and 303388/2010-2), by CAPES (Coordenação de Aperfeiçoamento de Pessoal de Nível Superior; Grant no. BEX-0522/07-4), by FAPEMIG (Fundação de Amparo à Pesquisa do Estado de Minas Gerais, Grants nos. CEX-289/98, CEX-855/98, TEC-875/07, CEX-PPM-00401/08, and CEX-PPM-00390-10), and by PRPq-UFMG (Pró-Reitoria de Pesquisa da Universidade Federal de Minas Gerais).