Abstract

We consider a multiserver queueing system with an infinite buffer and two types of customers. The flow of customers is described by two Markovian arrival processes (MAPs). Type 1 customers have absolute priority over type 2 customers. If the arriving type 1 customer encounters all servers busy, but some of them provide service to type 2 customers, service of one type 2 customer is terminated and type 1 customer occupies the released server. To avoid too frequent termination of service of type 2 customers, we suggest reservation of some number of servers for type 1 customers. Type 2 customers, who do not succeed to get a server upon arrival or are knocked out from a server, join the buffer or leave the system forever. During a waiting period in the buffer, type 2 customers can be impatient and may leave the system forever. The ergodicity condition of the system is derived in an analytically tractable form. The stationary distribution of the system states and the main performance measures are calculated. The Laplace-Stieltjes transform of the waiting time distribution of an arbitrary type 2 customer is derived. Numerical examples are presented. The problem of the optimal channel reservation is numerically solved.

1. Introduction

Queueing theory is the well recognized mathematical tool for solving the problems of design, capacity planning, performance evaluation, and optimization of many real life objects, especially in telecommunications, manufacturing, computer engineering, and so forth. The relevant literature is huge. The most part of the literature is devoted to queueing systems with homogeneous customers while it is a quite frequent real life situation when arriving customers have different importance to the system and various requirements to the quality of their service. So, an important part of queueing literature is devoted to so called priority queues. In such queues, customers of different types are arranged into several classes enumerated, for example, in descending order of their value (economical, social, etc.) for the system and customers from different classes have different treatment in the system. Customers from the classes having higher priority have preferences to others in access to the servers, if they are available or picking-up from the queue to service.

The existing literature concerning the priority queues is also huge. So, to essentially reduce the list of the relevant references, here we will cite only papers devoted to priority queues with a Markovian arrival process (MAP); see, for example, [1, 2]. Such an arrival process is the significant generalization of the stationary Poisson process and is very popular descriptor of the flows of customers in modern real life systems now. In contrast to the stationary Poisson process, which allows fitting of only the average value of interarrival times, the MAP allows fitting of also the variance and high order moments of the distribution of interarrival times and correlation of successive interarrival times. The single-server priority queues with the MAPs or a bit more general marked Markovian arrival process (MMAP) (see, e.g., [3]) were considered, for examples, in [48].

There are different kinds of priorities provided to important customers. The most well known are preemptive and nonpreemptive priorities. Nonpreemptive priority suggests that an arriving high priority customer cannot interrupt service currently provided to a low priority customer. Nonpreemptive priority plays a role only when a server finishes service and the next customers should start processing at this server. Preemptive priority suggests that an arriving high priority customer interrupts service currently provided to a low priority customer. This low priority customer leaves the server. It can leave the system without service permanently or try to get service later when a server will become available. Its service time may be started from the point of the interruption, from the last check point before the interruption, or from the early beginning with the same or different distribution of the service time.

The most popular variant of priority queues assumes existence of only two types of customers. Note that the important special cases of priority queues with a preemptive priority are queues with server breakdowns or interruptions. The breakdowns and interruptions can be interpreted as high level customers whose arrival causes termination of service of a usual (low level) customer. Short review of the recent papers related to this subject can be found in [9] where a multiserver queue with quite involved mechanisms of servers breakdowns and repair is analyzed with emphasis to evaluation of survivability of the system. In [1012], multiserver queues with nonpreemptive priorities are investigated. Model in [12] assumes self-generation of priorities. In [10], multiserver queue with nonpreemptive priority and impatient customers is investigated. Model in [11] assumes that a decision about admission of a nonpriority customer to the first station of a tandem queue is based on information about the current number of customers at the second station of a tandem.

In this paper, we consider a multiserver queue with a preemptive priority. The most close papers in literature are [13, 14]. It is supposed in [13] that there is a finite buffer for the nonpriority customers whose service was interrupted. In [14], it is assumed that the nonpriority customers whose service was interrupted go to orbit and retry for the service later on as the priority customers.

It is evident that the preemptive priority is much better comparing to the nonpreemptive priority from the point of view of high priority customers. However, the preemptive priority is much worse comparing to the nonpreemptive one from the point of view of low priority customers and from the point of view of the system resources utilization. Some work already done by a server is wasted when the service interruption occurs. As a trade-off, the discipline with the nonpreemptive priority can be used in combination with reservation of some servers exclusively for the service of priority customers. Queueing systems with nonpreemptive priority and reservation of some servers were considered, for example, in [1517]. In [15], such a system was used for analysis and optimization of the work of the cell of a mobile communication network. The models considered in [16, 17] are the dual tandem queues.

The obvious aim of the proposed combination of the reservation with the non-preemptive priority in [1517] was the desire to give more advantage to high priority customers comparing to an usual nonpreemptive priority discipline. In our present paper, we propose the combination of a reservation with a preemptive priority. Because the preemptive priority itself gives too much advantage to high priority customers, they do not need an additional reservation of the servers. So, motivation of the discipline considered in this paper is to improve quality of low priority customers by means of decreasing the frequency of service interruptions. This is important because interruptions may be quite offensive for low priority customers and also lead to the waste of the system resources.

Related model was analyzed in our recent paper [18]. In that paper, it is assumed that the arriving nonpriority customers are not registered by the system manager and the nonpriority customers who cannot enter the service immediately upon arrival or whose service was interrupted go to some virtual place called as an orbit and retry for the service later on. In this paper, we consider another type of arrival process and also assume that there exists registration of nonpriority customers and they are placed in an infinite buffer if they cannot enter the service immediately upon arrival or are interrupted. This assumption allows us to provide a detailed analysis of waiting time distribution for nonpriority customers while such an analysis is impossible for the system with retrials.

The results of analysis presented in our paper can be used for enhancement of operation of many real life systems. Let us briefly mention three of them.(i)Cognitive radio systems; see, for example, [1924]: in these systems, the high priority is assigned to the primary, licensed customers and the low priority is assigned to the secondary, unlicensed customers. The secondary customers may occupy the free servers (channels or subchannels), but the service of some secondary customers is terminated if a primary customer arrives and does not see a free server.(ii)Contact and call centers; see, for example, [2534]: in these systems, the high priority is assigned to the more important customers or voice requests and the low priority is assigned to the less important customers, e-mail requests, and work relating to providing a call-back option.(iii)Different technical, manufacturing, service systems where, to avoid possible starvation and increase a profit, the servers provide the service to some background or external customers in absence of the primary customers.

The rest of the paper is organized as follows. In Section 2, the mathematical model is described. Multidimensional Markov chain defining behavior of the system is introduced in Section 3. The infinitesimal generator of this Markov chain is written down there. The ergodicity condition and the stationary distribution of the system states are analyzed in Section 4. The expressions for the main system performance measures are given in Section 5. The Laplace-Stieltjes transform of the waiting time distribution of an arbitrary type 2 customer is derived in Section 6. Section 7 contains some numerical illustrations. Finally, Section 8 concludes the paper.

2. Mathematical Model

We consider an -server queueing model with an infinite buffer and two types of customers. Type 1 customers arrive according to the Markovian arrival flow MAP1. The MAP1 is defined by the underlying process , which is an irreducible continuous-time Markov chain with the state space . Arrivals occur only at the epochs of jumps in the underlying process . The intensities of transitions of the process , which are accompanied (not accompanied) by the arrival of a customer, are defined by the square matrix of size . The matrix is an infinitesimal generator of the process . The stationary distribution vector of this process satisfies the system of equations , . Here and throughout this paper, is a zero row vector, and e denotes a unit column vector.

Such arrival process was introduced as a versatile Markovian point process (VMPP) by M. F. Neuts in the 70s. The original development of VMPP contained extensive notations; however, these notations were simplified greatly in [2] and ever since this process bears the name Markovian arrival process. The class of MAPs includes many input flows considered previously, such as stationary Poisson (M), Erlangian (), Hyper-Markovian (HM), Phase-Type (PH), and Markov Modulated Poisson Process (MMPP). Generally speaking, the MAP is correlated, so it is ideal to model correlated and/or bursty traffic in modern telecommunication networks. For more information about the MAP, its properties, and special cases, see [1, 2]. Discussion of applicability of the MAP for description of real life information flows in modern telecommunication networks can be found in [35, 36]. Possibilities of fitting the real world arrival process by means of the MAP are discussed, for example, in [37].

The average intensity (fundamental rate) of the MAP1 is defined by . The coefficient of variation of intervals between customer arrivals is calculated as , and the coefficient of correlation of successive intervals between arrivals is given as .

Type 2 customers arrive to the system according to the MAP2. The MAP2 is defined by the underlying process , with the finite state space , and described by the square matrices and of size .

The average intensity of arrival of type 2 customers is given by ; there the vector is the stationary distribution vector of the process and is defined as the unique solution to the system , .

We assume that type 1 customers have absolute priority over type 2 customers. If there is a free server during a type 1 customer arrival epoch, this customer receives service immediately. If all servers are busy during a type 1 customer arrival epoch and there are type 2 customers receiving service, the service process of one type 2 customer is terminated and type 1 customer occupies the released server. The type 2 customer who was knocked out from the system goes to the buffer in the tail of the queue with probability and leaves the system with the complimentary probability . If all servers are occupied by type 1 customers during a type 1 customer arrival epoch, this customer leaves the system forever.

We assume that some parameter (threshold) is fixed, . Type 2 customers are admitted to the system if the number of busy servers is less than during type 2 customer arrival epoch. If the number of busy servers is greater than during an arbitrary type 2 customer arrival epoch, this customer goes to the buffer with probability and with the complimentary probability leaves (balks) the system.

Additionally, we assume that type 2 customers can be impatient and leave the buffer after an exponentially distributed time described by parameter , , due to lack of service. If we assume that type 2 customers are patient, we set .

The service time of type customers has an exponential distribution with the parameter , .

3. Process of System States

Let , be the number of type 2 customers in the buffer, let , be the number of busy servers, let , be the number of type 2 customers in service, let , be the state of the directing process of the MAP1, and let , be the state of the directing process of the MAP2 at the epoch . Here the notation like means that the variable takes integer values from the set .

The behavior of the system under study can be described in terms of the regular irreducible continuous-time Markov chain .

Let us introduce the following notation:(i) is the identity matrix, and is a zero matrix of appropriate dimension. If the dimension of a matrix or a vector is not clear from context, it is indicated as the suffix;(ii) and indicate the Kronecker sum and product of matrices, respectively; see, for example, [38];(iii) is the diagonal matrix with the diagonal entries or blocks ;(iv), , ;(v), ;(vi), are the matrices of size with all zero entries except the entries , and , which are equal to 1;(vii), are the matrices of size with all zero entries except the entries , , and , , which are equal to 1;(viii) is the square matrix of size with all zero entries except the entries , which are equal to 1;(ix), are the square matrices of size with all zero entries except the entry which is equal to 1.

Let us enumerate the states of the Markov chain in the direct lexicographic order of the components and refer to the pair as a macrostate.

Let be the generator of the Markov chain , consisting of the blocks , which, in turn, consist of the matrices of the transition rates of the Markov chain from the macrostate to the macrostate . The diagonal entries of the matrices are negative, and the moduli of the diagonal entries of these matrices define the total intensities of leaving the corresponding state of the Markov chain .

Lemma 1. The infinitesimal generator of the Markov chain , has the block-three-diagonal structure The nonzero blocks , have the following form: where

Proof of the lemma is implemented by the analysis of the Markov chain , transitions during the interval of infinitesimal length, and further combining the corresponding transition intensities in block matrix form.

4. Ergodicity Condition and Computation of the Stationary Probabilities

Further, we will separately consider the following two cases.

(1) Let us assume that ; that is, customers in the buffer are impatient. In this case, it is possible to verify that the following limits exist: where is a diagonal matrix with diagonal entries defined as the moduli of the corresponding diagonal entries of the matrix , .

The matrix is the block-diagonal matrix with the diagonal blocks , , , defined as follows: where , , , and are diagonal matrices with diagonal entries defined by the diagonal entries of the matrices , , and , respectively.

The matrices , , and have the following form: so, their sum is the stochastic matrix.

According to the definition given in [39], the Markov chain , , belongs to the class of so called continuous-time asymptotically quasi-Toeplitz Markov chains (AQTMC).

As follows from [39], the sufficient condition for the ergodicity of the AQTMC is the following condition: where the row-vector is the unique solution to the following system of linear algebraic equations:

Taking into account explicit values (6) of the matrices , , and , inequality (7) can be rewritten as , and system (8) is rewritten in the form , . Hence, ; so, inequality (7) holds true for each set of the system parameters. So, the Markov chain under consideration is ergodic and the queueing system under study is stable for any set of the system parameters.

(2) Let us assume now that . In this case, the blocks of the generator have the following form:

Thus, the blocks of the generator do not depend on the variable when and the Markov chain , belongs to the class of continuous-time quasi-Toeplitz Markov chains (QTMC) or -type Markov chains; see [40].

As follows from [40], the necessary and sufficient condition for the ergodicity of the QTMC is the fulfillment of the following inequality: where the vector is the unique solution to the system

It is easy to verify that system (11) can be rewritten in the form

By postmultiplying the left and right hand side of (12) by we obtain that

By postmultiplying the left and right hand side of (12) by we obtain that

It follows from (13) and (14) that the vector can be represented in the form where is a stochastic vector of size , is the invariant probability vector of the underlying Markov chain of the MAP1, and is the invariant probability vector of the underlying Markov chain of the MAP2.

Substituting the vector form (15) into (12), postmultiplying this equation by , and taking into account that ,, , , and , we obtain that the vector is the unique solution to the system where

It easy to verify that the matrix is the generator of two-dimensional Markov chain , defining the number of busy servers , and the number of servers occupied by type 2 customers , at the moment in the case when the system is overloaded. It is evident that when the system is overloaded the number of busy servers can vary in the interval . It follows from (16) that the vector , , , defines the joint stationary distribution of the number of busy servers and the number of servers occupied by type 2 customers when the system is overloaded.

Taking this into account, substituting the vector form (15) into inequality (10) and performing some algebra, we obtain the following inequality: where is the vector of size with all unit entries except the entry which is equal to zero.

Thus, we have proved the following assertion.

Theorem 2. If , the Markov chain , is ergodic for any set of the system parameters. If , the Markov chain , is ergodic, if and only if inequality (18) holds true where the vector is defined as the solution to system (16).

Remark 3. Condition (18) is intuitively clear and can be interpreted as follows. If the system is overloaded and type 2 customers are patient (), then type 2 customer can leave the buffer only if the number of busy servers becomes less than . The components , of the vector define the probability that during an arbitrary epoch the number of busy servers is and there are type 2 customers who receive service. So, the left hand side of (18) defines the intensity of the service completions when servers are busy (the intensity of customers leaving the buffer). The right hand side of (18) defines the total intensity of type 2 customers arrival into the buffer. When the system is overloaded, all admitted type 2 customers (the intensity of this flow is ) and type 2 customers whose service was terminated by arrival of type 1 customers (the intensity of this flow is ) arrive into the buffer. Thus, ergodicity condition (18) requires that, in the situation when the system is overloaded, the intensity of type 2 customers arriving to the buffer is less than the intensity of type 2 customers leaving the buffer.

Further, we assume that Markov chain , is ergodic. Then the following limits (stationary probabilities) exist: Here indicates the Kronecker delta.

Let us form the row vectors of these probabilities as follows:

It is well known that the probability vectors , satisfy the following system of linear algebraic equations:

System (21) is infinite, so it can not be solved on a computer by standard methods. To compute the probability vectors , in both the cases, and , the numerically stable algorithms presented in [39] can be used. The idea of these algorithms is as follows. Instead of solution of the system (21), another infinite system of linear algebraic equations for the probability vectors , is derived by means of sequential constructions of so called censored Markov chains (see, e.g., [41]) for the initial Markov chain , with different levels of censoring. This leads to numerically stable procedure for computation of the vectors .

5. Performance Measures

Having computed the vectors of the stationary probabilities , it is possible to compute a variety of the system performance measures.

The average number of customers in the system is

The average number of customers in the buffer is

The average number of busy servers is

The average number of busy servers providing service to type 1 customers is

The average number of busy servers providing service to type 2 customers is

The intensity of output flow of type 1 customers is

The intensity of output flow of type 2 customers is

The intensity of output flow of customers is

The loss probability of type 1 customers is

The loss probability of type 2 customers is

The loss probability of an arbitrary customer is

The probability of type 2 customer loss at the entrance to the system is

The probability that an arbitrary type 2 customer will be forced to terminate its service and go to the buffer is

The probability that an arbitrary type 2 customer will be forced to terminate its service and leave the system is

The probability that an arbitrary type 2 customer will leave the buffer due to impatience is

6. Distribution of the Waiting Time of an Arbitrary Type 2 Customer

Type 1 customers having preemptive priority do not wait for service. So, we analyze only the distribution of waiting time of an arbitrary type 2 customer. Speaking about the arbitrary type 2 customer, we do not distinguish type 2 customers arriving to the system from outside and customers arriving to the buffer due to the service force termination.

Let be the distribution function of the waiting time of an arbitrary type 2 customer in the system and let , be its Laplace-Stieltjes transform (LST).

To derive an expression for the LST , we use the method of collective marks; see, for example, [42, 43]. Let us tag an arbitrary type 2 customer and keep track of its staying in the system. According to the idea of the method of collective marks, has the meaning of the probability that no catastrophe from some virtual stationary Poisson flow of catastrophes with the intensity arrives during the waiting time of the tagged type 2 customer.

Let be the probability that no catastrophe arrive during the rest of the tagged customer waiting time conditioned on the fact that, at the given moment, the position of the tagged customer in the buffer is equal to , the number of busy servers is , the number of busy servers providing service to type 2 customers is , and the state of the process , is .

Let us enumerate the probabilities in the lexicographic order of the components , and and form the following column vectors:

Let us also introduce the following notation:where

Lemma 4. The vectors ,, can be recursively calculated as follows:

Proof. Based on a probabilistic sense of the LST, we obtain the following system of equations for calculation of the vectors : System (42) can be rewritten in the following matrix form: which can be further transformed to form (41). Lemma is proved.

Theorem 5. The LST of the distribution of the waiting time of an arbitrary type 2 customer in the system is calculated as follows: where

Proof. The proof follows from the law of total probability and a probabilistic sense of the LSTs.
The following situations are possible during the arrival epoch of the tagged type 2 customer.(i)The number of busy servers is less than and the tagged customer immediately receives service. The probability of this event is . In this case, the probability of no catastrophe arrival during the waiting time is equal to one.(ii)The number of busy servers is greater than and the customer decides to balk the system. The probability of this event is . In this case, the probability of no catastrophe arrival during the waiting time of the tagged type 2 customer is also equal to one.(iii)The number of busy servers is greater than and the customer decides to join the buffer. The probability of this event is . In this case, the probability of no catastrophe arrival during the waiting time of the tagged type 2 customer under the fixed values of the components , and is equal to .(iv)The tagged customer arrives to the buffer after the termination of its service by arriving type 1 customer. The probability of this event is . In this case, the probability of no one catastrophe arrival during the waiting time of the tagged type 2 customer under the fixed values of the components is equal to .
Using the law of total probability, now it is possible to easily verify the validity of the statement of the theorem.

Corollary 6. The average waiting time of an arbitrary type 2 customer can be calculated as follows:
Here, the column vectors are defined as blocks of the vectors , which can be calculated as

7. Numerical Examples and Optimization Problem

The purposes of this section are to demonstrate the feasibility of the proposed algorithms for computation of the key performance measures of the system, to give an example of numerical solution of optimization problem and to bring out some qualitative aspects of the considered queue.

Let the arrival process of type 1 customers be defined by the matrices The fundamental rate of this arrival process is , the coefficient of correlation of successive interarrival times is , the coefficient of variation of interarrival times is .

Basically, the arrival process of type 2 customers is defined by the matrices The fundamental rate of this arrival process is , the coefficient of correlation of successive interarrival times is , and the coefficient of variation of interarrival times is .

The rest of the system parameters are given by

To illustrate behavior of the key performance measures of the system, we will vary the threshold of admission strategy (recall that a type 2 customer is admitted to service only if the number of busy servers at its arrival instant is less than ) in the range and the intensity of the arrival process of type 2 customers in the range . Variation of the intensity is easily implemented by means of multiplying the matrices and by the corresponding factor.

Three-dimensional Figure 1 illustrates the dependence of the average number of customers in the system and the average number of customers in the buffer on and .

It is quite natural that both and increase when the intensity grows: the grow of the arrival rate under the fixed services rates causes presence of more customers in the system and in the buffer. But the increase of oppositely affects the values and . When grows, the value of increases because the growth of decreases the probability of type 2 customers balking (increases the rate of the flow of admitted customers). The value of decreases when grows because the growth of increases the percentage of type 2 customers that succeed to get access to service immediately upon arrival, without visiting the buffer. However, when becomes too close to , many type 2 customers are compelled to terminate service due to type 1 customer arrival and the value of stops decreasing.

Figure 2 illustrates the dependence of the probability of the type 2 customer loss at the entrance to the system and the probability that an arbitrary type 2 customer leaves the buffer due to impatience on and .

It can be seen that the value of sharply grows when decreases and increases. It is obvious because the decrease of and the growth of imply congestion in the system for type 2 customers. When is small or is large, type 2 customers practically have the chance to start service immediately upon arrival. In this situation, they balk with probability what explains the presence of almost plate part on the surface in Figure 2 at level 0.2. Behavior of correlates with behavior of in Figure 1 what is easily understandable because the rate of customers departure from the system due to impatience positively correlates with the number of customers in the buffer.

Figure 3 illustrates the dependence of the probability that an arbitrary type 2 customer will be forced to terminate service and will leave the system and the probability that an arbitrary type 2 customer is lost on and .

The plot illustrating behavior of the probability well illustrates the reasonability of reservation of some part of servers exclusively for service of priority customers. When the parameter approaches the value , that is, the number of reserved servers decreases, the probability drastically increases, especially when the arrival rate is not very small. The plot illustrating behavior of the probability is quite clear taking in mind that this probability is the sum of the probabilities , , and illustrated in the previous figures.

Figure 4 illustrates the dependence of the intensity of output flow of type 2 customers and the average waiting time of an arbitrary type 2 customer on and .

Behavior of is quite clear taking into account the illustrated above behavior of and formula . Behavior of is similar to behavior of in Figure 1. However, does not grow so quickly for small and large as the value grows. This is explained by the fact that it is accounted for in calculation of the average waiting time of an arbitrary type 2 customer that, by definition, some such customers do not visit the buffer at all (they are lost at the entrance to the system or are lucky to receive service without visiting the buffer) and, so, have zero waiting time.

Let us consider optimization problem. First of all, we have to formulate a cost criterion in terms of which the quality of system operation will be evaluated. Because type 1 customers have preemptive priority, the quality of their service is completely defined by the value of which does not depend on the arrival and service processes of type 2 customers. is constant for any values of the control parameter . So, cost criterion should include only characteristics of service of type 2 customers. Because the goal of the operation of almost any queueing system is to provide maximal profit to the system owner and this profit is proportional to the number of customers, who got service in the system, it is reasonable to include the output rate as the main component of the cost criterion. But if the cost criterion will include only the output rate , solution of optimization problem seems be trivial ; that is, no server reservation should be assumed. Any arriving customer should be admitted to the system if ergodicity condition for the system is fulfilled. But, as we see from the presented numerical results, the absence of servers reservation implies high values of the probabilities of losses of type 2 customers, in particular, the probability of force termination of service provided to type 2 customers. Such a termination can be quite offensive and frustrating for type 2 customers. Also the absence of servers reservation causes long average waiting time, especially waiting time of customers who really wait for beginning of service.

As the result of these considerations, we conclude that the reasonable form of the cost criterion (profit obtained during a unit of time) is the following one: where the performance indices , , , , and were introduced above, is the average profit earned by service of one type 2 customer, and , are the corresponding charged for customer loss and waiting. Our aim is to find the optimal value of the parameter that provides the maximal value of the cost criterion . It should be noted that maximization of has to be done with respect to for a fixed value of . We included to notation just to stress that the optimal value may be different for various values of .

Let us fix the same values of parameters of the system as above and the values of cost coefficients are fixed as follows:

Figure 5 illustrates the dependence of the cost criterion on and .

Presented table gives a bit more information about the behavior of the cost criterion on . For various , the table gives the optimal value of the parameter , optimal value of the cost criterion, value of the cost criterion for the system without servers reservation, the difference (the profit provided by means of the optimal reservation), and the ratio of the relative profit provided by means of the optimal reservation comparing to a strategy of customers access without servers reservation.

It is seen from Figure 5 and Table 1 that the profit and the relative profit provided by means of the optimal reservation comparing to a strategy of customers access without servers reservation can be significant.

8. Conclusion

We considered a multiserver queuing system with an infinite buffer and two types of customers one of which has preemptive priority over another. To reduce the frequency of forced termination of service of type 2 customers, we offered the threshold strategy of access of type 2 customers. Access of such customers is denied if the number of busy servers at the customer arrival moment is not less than the preassigned threshold value. For the fixed value of the threshold, we described behavior of the system by the multidimensional Markov chain. For this chain, the ergodicity condition is derived; the stationary distribution of the system states and the main performance measures are calculated. The Laplace-Stieltjes transform of the waiting time distribution of an arbitrary type 2 customer is derived. Numerical examples confirming the advantage of the proposed access strategy under properly chosen value of the threshold are presented.

Presented analysis may be extended to the cases of the MMAP (marked Markovian arrival process) of customers arrival (what allows to take into account possible correlation within arrivals of priority and nonpriority customers) and the BMAP (batch Markovian arrival process) of nonpriority customers, possibility to have an additional finite buffer for priority customers, and so forth.

In this paper we assume that the service time distribution for both types of customers is exponential. This is rather restrictive assumption. It is worth to note that in many practical situations available information about the service time is only the mean service time. In this case, there is no chance to estimate parameters of the service time distribution and assumption about exponential service time distribution is quite natural because it greatly simplifies the mathematical analysis. If the detailed statistics about the service time is available, more general distribution of service time can be suggested. The presented analysis could be more or less easily extended to the case of much more general phase type distribution (see [40]), which can be used to approximate many other distributions. However, in this case we meet the following difficulty. If service time distribution is exponential, it does not matter (from the point of view of the distribution of the number of customers in the system) in which busy server the service of nonpriority customer is interrupted. In case of phase type distribution, in situation when the service in some server should be terminated it is necessary to fix the rule of a choice of the concrete busy server in which service will be terminated; for example, the phase of the service at this server should belong to some fixed group of phases. The question of justification of a fixed rule is quite complicated. If this question will be answered, the problem of analysis of the corresponding queueing model can be solved by analogy with presented above analysis, but the dimension of the blocks of generator of the Markov chain, which describes behavior of the system, catastrophically increases with increase of the number of phases and (or) servers due to necessity of taking into account the current phase of service in each busy server or the number of servers providing currently the service at each phase. Solution of this problem may be simplified by means of a proper use of methodology by D. Lucantoni and V. Ramaswami by analogy with, for example, [44] or more involved methodology from [45].

In this paper we assumed that the service time distribution for both types of customers is exponential. This is rather restrictive assumption. But it is worth to note that in many practical situations available information about the service time is only the mean service time. In this case, there is no chance to estimate the parameters of the service time distribution and the assumption about an exponential service time distribution is quite natural because it greatly simplifies the mathematical analysis. If the detailed statistics about the service time is available, more general distributions of service time can be suggested. The presented analysis could be more or less easily extended to the case of much more general phase type distribution (see [40]) which can be used to approximate many other distributions. However, in this case we meet the following difficulty. If the service time distribution is exponential, it does not matter in which busy server service of nonpriority customer is interrupted. In the case of phase type distribution, in the situation when service in some server should be terminated it is necessary to fix the rule of a choice of the concrete busy server in which service will be terminated; for example, the phase of service at this server should belong to some fixed group of phases. The question of justification of the fixed rule is quite complicated. If this question will be answered, the problem of analysis of the corresponding queueing model can be solved by analogy with presented above analysis, but the dimension of the blocks of generator of the Markov chain, which describes the behavior of the system, catastrophically increases with increase of the number of phases and (or) servers due to necessity of taking into account the current phase of service in each busy server or the number of servers providing currently service at each phase. Solution of this problem may be simplified by means of a proper use of methodology by D. Lucantoni and V. Ramaswami by analogy with, for example, [44] or more involved methodology from [45].

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

This work was supported by MEST 2012-002512, NRF, Korea.