Abstract

A multiserver queueing system, the dynamics of which depends on the state of some external continuous-time Markov chain (random environment, RE), is considered. Change of the state of the RE may cause variation of the parameters of the arrival process, the service process, the number of available servers, and the available buffer capacity, as well as the behavior of customers. Evolution of the system states is described by the multidimensional continuous-time Markov chain. The generator of this Markov chain is derived. The ergodicity condition is presented. Expressions for the key performance measures are given. Numerical results illustrating the behavior of the system and showing possibility of formulation and solution of optimization problems are provided. The importance of the account of correlation in the arrival processes is numerically illustrated.

1. Introduction

Queueing theory is widely used for decision making about the resources needed to provide service in a variety of real life systems including contact centers, intelligent transportation systems, telecommunication networks, manufacturing and administrative systems, and banking. Due to the increasing technical and logical complexity of real life systems and diversity of the provided services, queueing models of these systems and their fragments become more and more involved. One of the essential features of modern systems, which should be accounted in queueing models, is that the parameters of the system operation may vary, for example, depending on time of a day or night, available amount of required resources, and possible sharing with some other systems. This fact gave rise to the progress in the study of the so-called queues operating in the random environment ().

In contrast to the classical queueing models, where the parameters and the distributions characterizing arrival, service, and other processes describing dynamics of the system are assumed to be fixed in advance, queues operating in the presuppose that some or all parameters may dynamically vary due to influence of some external medium called Early research in this topic was done by Gnedenko and Kovalenko [1], Yechiali and Naor [2], Yadin and Syski [3], O’Cinneide and Purdue [4], Purdue [5], Neuts [6, 7], and others. A brief history of the development of theory of queues in the , the reference list, and real life examples of queues in the can be found, for example, in the papers [812]. In [8], an unreliable retrial queue in a Markovian random environment is analyzed via matrix-analytic methods. Ergodicity condition is proved and approximate distribution of the number of customers in the system is computed. Optimization problem of choosing the arrival and service rates for each environment state is considered. In [9, 10], the finite source retrial queue operating in a random environment is studied. The arrival flow is described by the Markov arrival process () (for definition, more details, and usefulness in modelling in telecommunications, see [1315]), and the service time has a phase-type () distribution (for definition and more details, see [7]). It is assumed that the parameters defining the process and distribution depend on the state of the that is a continuous-time Markov chain with a finite state space. In [9], it is assumed that there is additional arrival process of negative customers. The arrival of the negative customer with equal probability goes to any busy server to remove the customer being in service. In [9, 10], the finite state multidimensional Markov chain describing the behavior of the systems is investigated. The algorithms for calculating the stationary state probabilities are elaborated. Main performance measures are obtained and the illustrative numerical examples are presented. In [11], the queue operating in the is investigated. The arrival flow is described by the batch Markov arrival process (). The system does not have a buffer. An arriving customer who did not succeed to find a free server upon arrival is lost. Due to possibility of batch arrivals, disciplines of partial admission, complete admission, and complete rejection are analyzed. The stationary distribution of the system states and the waiting time distribution are computed. Numerical illustrations are presented. In particular, it is demonstrated that reasonable engineering approximations of performance measures of the system may be very poor. In [12], the retrial queue operating in the is investigated. As in the previously described model, the system does not have a buffer. But the customer who did not succeed to find a free server upon arrival is not lost, but he proceeds to the retrial orbit, which is a location from which customers may attempt to gain or regain service. The intensities of repeating the attempts are also assumed to be dependent on the state of the In [12], the sufficient condition of stability of the system is proved, the stationary distribution of the system states is computed. The presented numerical examples illustrate a poor quality of the approximation of the main performance measures of the system by means of the simpler queueing models. An effect of possible smoothing the traffic and an impact of the retrial intensity are shown. Analysis presented in [12] is more complicated than the analyses in [9, 10] due to two evident reasons: (i) the state space of the multidimensional Markov chain describing the behavior of the system is finite in [9, 10] and is infinite in [12]; (ii) the generator of the multidimensional Markov chain has tridiagonal structure in [9, 10] (i.e., the chain is level dependent quasi-birth-and-death process) while it has more general upper-Hessenbergian structure in [12].

The main advantages of the model considered in this paper, compared to the ones analyzed in [11, 12] and to all the other papers devoted to multiserver queues operating in the , are as follows.(i)The model under study has more flexible service discipline that combines the features of loss systems, systems with a finite buffer and systems with retrials. The queueing systems that incorporate both normal queues and retrial orbits are called hybrid retrial queues. The importance of their investigation stems from the fact that many modern technologies of customers random access assume the existence of some places where the customers, who did not succeed to get access upon arrival, may be temporarily kept (in registers for handover customers in cells of mobile communication networks, in IVR (Interactive Voice Response) machines in call centers, etc.). Such hybrid systems were considered, for example, in [16, 17]. Queueing systems in with customers loss considered in [11] and with retrials considered in [12] are obtained as special cases of the hybrid retrial queue in considered in this paper.(ii)We suppose that the capacity of the service area of the system (the number of servers and places in the buffer) may depend on the state of the , while it is usually assumed that the capacity is constant and the influences only the parameters of the arrival and service processes. Note that the changes in service capacity may be considered as server breakdowns, among other possibilities.(iii)Customers balking, impatience (abandonment), and nonpersistence are taken into account. This allows us to account psychology of customers, use of visible queue option, information obsolescence during the waiting time, customer’s mobility, service provider’s competition, and so forth.

Due to the generality of the considered queueing model, it has a lot of possible applications for the investigation of a variety of real life systems. Besides the examples of application to the performance evaluation, capacity planning, and optimization of some systems, which coincide with the ones described in [12] (application to modelling hot spot in airport and wireless local area network), it is worth to mention important potential applications for work force management in call-centers. Fluctuation of the intensities of customer arrivals and retrials occurs here, for example, due to the well known existence of hours of low, middle, and peak load of the center. The servers correspond to the operators of the call-center and places in the buffer correspond to IVR machines. Impatience of customers staying in the buffer reflects the possibility to get the requested information directly from IVR without contacting to an operator. It is necessary to create the schedule of the work of the operators and IVRs to dynamically fit the number of the active operators and IVRs to the current level of the load of the call-center in such a way as to guarantee the best quality of the customer’s service under the existing restrictions imposed on the total number of available operators and their working schedule and on energy consumption by IVRs.

The rest of the paper is organized as follows. In Section 2, the mathematical model is described. The process of the system states is defined; its generator in a block matrix form is presented. In Section 3, the stability condition is derived and the problem of computation of the stationary probabilities of the system states is touched on. Formulas for computation of the steady state performance measures of the system are presented in Section 4. Numerical illustrations are given and briefly discussed in Section 5. Section 6 concludes the paper.

2. Mathematical Model

We consider a retrial multiserver queueing system with varying capacity and behavior of the customers. The structure of the system under study is presented in Figure 1.

The dynamics of the system depends on the state of the . The is defined by the stochastic process ,  , which is an irreducible continuous-time Markov chain with the state space and the infinitesimal generator

Under the fixed state of the , capacity of the system is equal to including servers, , and a waiting room (buffer) of size , Without the loss of generality, we assume that the states of the are enumerated in ascending order of the capacity of the system; that is, We also suggest that We call the servers and the buffer as the service area of the system.

Arrival of customers is defined by the Markovian arrival process (). The underlying process of this is ,  , where is the state of the and the process has a finite state space . Under the fixed state of the the process behaves as irreducible continuous-time Markov chain. The sojourn time of this chain in the state is exponentially distributed with the positive finite parameter When the sojourn time in the state expires, with probability , the process jumps to the state without generation of a customer, ,  , With probability , the process jumps to the state with generation of a customer, ,

The behavior of the arrival process under the fixed state of the is completely characterized by the matrices and defined by the entries

The square matrix of dimension represents the generator of the process ,  , under the fixed state , .

The average arrival rate under the fixed state of the is given as where is the invariant vector of the stationary distribution of the Markov chain , , under the fixed state . The vector is the unique solution to the system

The squared coefficient of variation of intervals between successive arrivals under the fixed state of the is given as

The coefficient of correlation of two successive intervals between arrivals under the fixed state of the is given as

Let us introduce the following matrices:

The averaged (over all the states of the ) intensity of input flow of customers is defined as where the vector is the unique solution of the system The squared coefficient of variation of intervals between successive arrivals is given as The coefficient of correlation of two successive intervals between arrivals is given as

We assume that, during the epochs of the transitions of process , , the states of the process , , do not change, and only the intensities of transitions of this process change.

If, at an arbitrary arrival moment, some server is free, an arriving customer starts the service. If all servers are busy, but the buffer is not full, the customer is placed to the buffer. If the buffer is full and the state of the is , the customer balks (leaves the system permanently) with probability or, with the complementary probability, moves to orbit and tries again later on.

If the transits from the state to the state , where , the capacity of the system (the number of available servers and/or places in the buffer) decreases. If the current number of customers in service and/or in the buffer exceeds the available number of servers and/or places in the buffer under the state of the , the redundant customers leave the service area. The customers leaving the service area go, independently of each other, into orbit with probability or leave the system permanently with the complementary probability.

When the state of the is , , each customer staying in orbit repeats the attempts to reach the service area after an exponentially distributed time described by the parameter , Customers retry independent of each other and are not absolutely persistent. If the attempt is successful, the customer leaves orbit and moves to service or to the buffer. Otherwise, the customer returns to orbit with probability With the complementary probability, the customer leaves the system permanently.

Customers in orbit are impatient. When the state of the is , , each customer may leave the orbit and the system after an exponentially distributed time described by the parameter ,   If the customers staying in the orbit are patient, we put

We assume that the probability of the service completion of a customer during the time interval is equal to when the state of the is ,

If the transits from the state to the state , where , the capacity of the system increases and the corresponding number of customers from the buffer, if any, occupies additional servers.

Mention that in the described queueing model the discipline of customers admission may be different under the various states of the For example, if and , no customers are admitted to the system if the buffer is full; that is, the system operates as the system with a finite buffer. If, additionally, , the system operates as the system with customers loss (Erlang loss model). If , the system operates as the usual retrial system where all arriving customers, who meet the service area full, go into orbit. In the presented analysis, it is accounted that can be equal to 0, that is, for some states of the service to customers are not provided at all.

In order to improve the readability of the paper, we collect and already introduced some new notation in Notation.

Consider the following:(i)let  , , be the number of customers in orbit,(ii)let , , be the state of the ,(iii)let , , be the number of customers in the service area,(iv)let , , be the state of the second component of the underlying process of customers arrivalsat the moment ,

It is easy to see that the process , , is the four-dimensional irreducible Markov chain. Let us enumerate the states of the Markov chain , , in the direct lexicographic order of the components . We call the set of the states having value of the two first components of the Markov chain the macrostate .

Let be the generator of the Markov chain , It is formed by the blocks , consisting of the matrices that define the intensities of the transitions of the Markov chain , , from the macrostate to the macrostate , The diagonal entries of the matrix are negative. The modulus of each entry defines the intensity of departing from the corresponding state of the Markov chain ,  .

Lemma 1. The generator has the following block structure: where
The nonzero blocks ,  , are defined as follows. Consider(i)where Hereinafter, the entry of the matrix , ,  , of dimension defines the probability that when the state of the RE changes from to (and the capacity of the service area decreases from to ) the number of customers in the service area changes from to and customers move to orbit. The matrices have the following nonzero entries. Consider The probability that customers go into orbit in the case when customers leave the service area is given by The matrix ,  ,  , of dimension , which defines the transition probabilities of the component when the state of the RE changes from to (and the capacity of the service area increases from to ), is equal to the matrix Consider(ii)where the entries of the matrices define the intensities of the transitions of components of the Markov chain leading to the arrival of customers into the orbit. These matrices are defined by the following formulas: Consider (iii)

Proof of the lemma is performed by means of the analysis of the intensities of all possible transitions of the Markov chain during the time interval having infinitesimal length. The generator is block upper-Hessenbergian matrix; that is, all the blocks below the first subdiagonal are equal to zero blocks. This stems from the fact that, at any moment, no more than one customer may leave the orbit, so the blocks are equal to zero when The number defines the maximal number of customers that can simultaneously move to the orbit at an arbitrary moment. We recall that one customer may arrive to the orbit if the buffer is full at the customer’s arrival moment and at most customers may arrive to the orbit if the jumps from state to state 1 when all servers are busy and the buffer is full. So, the blocks are equal to zero when This explains structure (11) of generator

The negative diagonal entries of the matrix define, up to the sign, intensities of the exit of the Markov chain from the macrostate This exit can happen due to a customer service completion, attempt of a customer from the orbit to reach the service area, departure of a customer from the orbit due to impatience, change of the state of the , and change of the state of the underlying process of arrivals. The nondiagonal entries of the matrix define intensities of transition of the Markov chain inside of the macrostate These transitions can happen due to the customer service completion, change of the state of the , and change of the state of the underlying process of arrivals. This explains form (13) of the block

Transition from the macrostate to the macrostate ,  , can happen when the state of the changes from to ,   Such a change of the state of the implies the possible reduction of the number of available servers and places in the buffer. This may cause departure of customers from the service to the orbit. Matrices ,  ,  , define the probability that when the state of the changes from to (and the capacity of the service area decreases from to ) the number of customers in the service area changes from to and customers move to the orbit. This explains formula (14). Careful analysis of probabilities, which form the matrices , , , leads to formulas (16).

Transition from the macrostate to the macrostate , , can happen when the state of the changes from to ,   Such a change of the state of the implies the possible increase of the number of available servers and places in the buffer. This explains formula (15).

The increase of the value of the first components of the Markov chain from to ,  , may happen if one customer arrives to the orbit from outside (for ) or customers leave the service area due to the reduction of the available space there. This explains formulas (18)-(19).

Finally, the decrease of the value of the first components of the Markov chain from to may happen if a customer makes an attempt from the orbit when the service area is not full or a customer leaves the orbit due to impatience. This explains formula (20). Lemma is proved.

Remark 2. It is easy to verify that the following limits exist: where is a diagonal matrix with the diagonal entries which are defined as the moduli of the corresponding diagonal entries of the matrix , The matrices , , have the following form: where indicates the Kronecker delta, Here, and are the diagonal matrices, the diagonal entries of which are defined as the corresponding diagonal entries of the matrices and , respectively.
According to the definition given in [18], the existence of the limits , , means that the Markov chain ,  , belongs to the class of continuous-time asymptotically quasi-Toeplitz Markov chains (). This fact allows us to use the results of [18] for derivation of the sufficient condition of the ergodicity of the Markov chain and computation of its steady state distribution.

3. System Stability and Stationary Distribution

As follows from [18], the sufficient condition for the ergodicity of is the fulfillment of the inequality where the vector is the unique solution to the system

Thus, to check whether or not the Markov chain is ergodic, it is necessary to substitute the matrices ,  , to system (26), solve this system, and verify the fulfillment of inequality (25). If this inequality is fulfilled, the Markov chain under study is ergodic. The solution of a finite system of (26) on computer does not meet any essential problems.

Mention that there exists an important particular case (the customers are impatient or nonpersistent at least for one state of the ) when ergodicity of the Markov chain can be established more easy than via solution of system (26) and verification of inequality (25). For this case, the following statement is true.

Theorem 3. If customers are impatient or nonpersistent ( or ) at least for one state of the RE, then the Markov chain is ergodic for any set of parameters of the queueing system under study.

Proof. Let be the set of the states of the for which at least one of inequalities or , is fulfilled, It can be verified that, in this case, by means of coordinated perturbations of the block rows and block columns, the matrix can be transformed to the canonical normal form Here, is the matrix obtained from the matrix by means of removing the block rows and block columns with numbers ,   is the matrix obtained from the matrix by means of removing the block rows with numbers , , and the block columns with numbers ,  , and This means that the matrix is reducible. So, it follows, from Theorem  2 in [18], that the sufficient condition for the ergodicity of the Markov chain can be rewritten in the form where the vector is the unique solution to the system and the matrices and are obtained from the matrices and by means of removing the block rows and block columns with numbers ,   It is easy to check that ,   and is the stochastic matrix. Consequently, Theorem is proved.

In the sequel, we assume that the ergodicity condition is fulfilled. Then, the following stationary probabilities exist:

Let us form the row-vectors as follows:

It is well known that the vectors , , satisfy the system where is the generator of , . System (33) is infinite and cannot be directly solved on computer. However, it can be successfully solved by means of the numerically stable algorithms developed in [18, 19].

4. Performance Measures of the System

Having computed the vectors of the stationary probabilities , , it is possible to compute a variety of the steady state performance parameters of the system.

The distribution of the number of the customers in orbit is

The average number of customers in the service area is

Here and throughout this section, formulas for the main performance measures contain the infinite sums. However, this does not create essential difficulties in computer implementation. It is well known that if the ergodicity condition is fulfilled, the stationary probability vectors converge in norm to zero vector when approaches infinity. So, computation of an infinite sum may be stopped if the summand becomes less than some preassigned value (e.g., ).

The average number of customers in the buffer is

The average number of busy servers is

The average number of customers in orbit is

The intensity of output of customers is

The probability that a customer arrives at the system when the buffer is full and leaves the system is

The probability that a customer arrives at the system when the buffer is full and goes into orbit is

The loss probability of a customer is

The probability of customers loss due to the decrease of the number of servers caused by change of the state of the is This formula is quite transparent. The right hand side of this formula represents the fraction. The denominator of this fraction is the average rate of customers arrival to the system. The numerator is the average rate of customers loss due to the decrease of the number of servers caused by the change of the state of the Such a loss occurs, with probability , every time when the state ,  , of the changes to the state ,  , while the number of customers in service area is greater than The number of the simultaneously lost customers is equal to if , equal to if , and equal to if Applying the formula of total probability, we get formula (43).

The probability of an arbitrary customer loss from orbit is

The probability that an arbitrary customer from orbit will make an attempt to receive service when the system is full and return to orbit is where

The probability that an arbitrary customer from orbit makes an attempt to receive service when the system area is full and leaves the system without service is

5. Numerical Results

Let us consider the following set of the system parameters. The number of the states of the is The generator of the is given by so the stationary probability of state 1 is and the stationary probability of state 2 is These probabilities are the components of the vector which is computed as the unique solution to the system ,

We assume that the arrival flow under state 1 of the is defined by the matrices and under state 2 of the it is defined by the matrices For both arrival processes, the coefficient of correlation is and the coefficient of variation is , The average arrival rate of customers under state 1 of the is , and the average arrival rate of customers under state 2 of the is

The rest of the system parameters under state 1 of the are as follows: Under state 2 of the , the parameters are as follows:

Because arrival rate is 2.5 times higher than while the service rate is only 1.25 times higher than , we may interpret state 1 as a normal mode of the system operation while we interpret state 2 as a congestion mode. To provide good quality of customers service, we should properly adjust the number of available servers to the mode of system. From economical considerations, let us assume that the average number of servers operating at an arbitrary moment of time should not exceed the predefined number The average number of servers operating at an arbitrary moment evidently is computed by formula Suppose that we have an opportunity to arbitrarily assign the numbers of available servers and that satisfy inequality aiming to provide good quality of customers service. It is intuitively obvious that the number of available servers under operation in the congestion mode should be not less than the number of available servers in the normal mode.

Let us vary the number of available servers under the different states of the , and , in such a way that and inequality (52) holds good.

Figures 26 illustrate the dependence of some performance measures of the system for values and varying in the described set when

It is seen, from Figure 2, that the average number of customers in orbit significantly changes (from about 0 to about 30) when and vary, and this number is maximal when both and are small and is minimal when about 9 servers are active in both modes.

Similar observations can be made about the average number of busy servers based on Figure 3. However, the value of is less sensitive with respect to the change of compared to

It is seen, from Figure 4, that the average number of busy servers also significantly changes (from about 0 to about 7) when and vary, and this number is minimal when both and are small and maximal when about 9 servers are active in both modes.

Because the capacity of service area (the number of servers plus the number of places in a buffer) under both the states of the is pretty large, 25, the probability that a customer arrives at the system when the buffer is full and leaves the system is not very high (about 0.08) even when the number of active servers is small; see Figure 5. This may seem surprising. But, this effect is easily explained by Figure 6. Due to the impatience of customers, the loss probability of customers is very high (close to 1) when the number of active servers is small. So, the system is never overcrowded. It is worth noting, based on Figure 6, that the loss probability of customers due to impatience essentially decreases with growth of and, especially, growth of

All the mentioned performance measures of the system are quite important. However, likely, the most important measure from the point of view of the system manager is the loss probability of an arbitrary customer because the loss of a customer implies the loss of a potential profit from his/her service. So, let us consider, namely, the loss probability as criterion of the quality of the system operation and find the optimal number of the servers under the different states of the , , and , under the condition that inequality (52) should be fulfilled.

Because, to build Figure 6, we have computed the probability for all feasible and , the solution of optimization problem is trivial. The optimal (minimal) value of the loss probability is when and The values of stationary probabilities under the optimal values and are presented in Table 1. Probabilities for are less than 0.001 and they are omitted here.

It is worth noting that if , that is, the number of active servers is constant and is not adjusted to the alternation of the normal and congestion mode, the value of the loss probability is about three times higher: So, the adjustment of the number of active servers to the current load of the system definitely makes sense.

To briefly illustrate the profound effect of correlation and variance of interarrival times in the arrival process, let us assume now that, instead of the flows with the coefficient of correlation 0.2 considered above, arrival flows are described by the stationary Poisson processes with the same mean arrival rates. These processes are defined by Let us repeat the experiment described above with these arrival processes having zero correlation and coefficient of variation equal to 1. We obtain that the minimal value of the loss probability () is achieved when and At the same time, under these values of and , is equal to =0.037619 for arrival flows with , So, the ignorance of the possible correlation in the arrival process gives relative error in the prediction of the value of equal to percent. This is not admissible in the performance evaluation and capacity planning of real life systems. Note that our experience of investigation of arrival flows in contact centers of several banks shows that these flows are indeed correlated.

6. Conclusion

We considered a multiserver queuing system, the dynamics of which depends on the state of the . Change of the state of the may cause variation of the parameters of arrival, service, retrial, impatience processes, number of available servers, and available buffer capacity, as well as the behavior of the customers. Evolution of the system is described by the multidimensional continuous-time Markov chain. The generator of this Markov chain is derived in a block matrix form. The ergodicity condition is presented. In particular, it is shown that if the customers are impatient or nonpersistent at least in one state of the , then the Markov chain is ergodic for any set of the system parameters. Expressions for the key performance measures are given. Numerical results illustrating the behavior of the system and showing the possibility of formulation and solution of optimization problems are provided. Positive effect of adjusting the number of active servers to the current load of the system is demonstrated. The importance of the account of correlation in the arrival processes is numerically illustrated.

Notation

:The service area capacity under the state of the
:The number of servers under the state of the RE
:The square matrices of size that characterize under the state of the
:The average arrival intensity of customers under the state of the
:The averaged arrival intensity
:The retrial intensity under the state of the
:The intensity of impatience of customers from orbit under the state of the
:The service intensity under the state of the
:The probability that a customer goes to the orbit in the case of its arrival when the buffer is full under the state of the
:The probability that a customer goes to orbit in the case of leaving the service area due to the decrease of its capacity under the state of the
:The probability that a customer returns to orbit after unsuccessful attempt under the state of the
:A column vector of appropriate size consisting of 1’s
A row vector of appropriate size consisting of zeroes
:The identity matrix of the corresponding dimension
:A zero matrix
and :The Kronecker sum and product of matrices, respectively (see, e.g., [20, 21])
The block diagonal matrix with the diagonal entries
:The square matrix of dimension defined by formula
:The square matrix of dimension with all zero entries except the entries , , that are equal to 1
:The square matrix of dimension with all zero entries except the entries ,  , that are equal to 1
:The square matrix of dimension with all zero entries except the entry .

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

The work is supported by Tomsk State University Competitiveness Improvement Program.