Abstract

In this paper, we analyze an M/G/1 priority queueing model with finite and infinite buffers under the -preemptive priority discipline, under which preemption decisions are made based on the number of high-priority customers. This priority queueing model can be used for the performance analysis of communication systems accommodating delay- and loss-sensitive packets simultaneously. To analyze the proposed model, we extend the method of delay cycle analysis and develop a queue length version of it for finite-buffer queues. Throughout our analysis, we demonstrate that by the proposed method the analysis of the complex priority queueing model can be reduced to that of simple delay cycles, so two different preemption modes of the queueing model can be dealt with in a unified way. The numerical study reveals that adjusting the decision variables and allows us to fine-tune system performance for different classes of customers, and operates as a primary control variable, regardless of the preemption mode and service-time distributions.

1. Introduction

In this paper, we consider an M/G/1 priority queueing model with a finite buffer for high-priority customers and an infinite buffer for low-priority customers. We call this an M/G/1/ priority queue. The priority queue operates under the -preemptive priority discipline. This is a flexible priority discipline in which the decision whether to preempt a low-priority customer in service is made based on the number of high-priority customers in the system.

In general, priority queueing models are extensively used for the analysis of communication and computer systems [16]. Priority queueing models with finite and infinite buffers have been studied mainly for the analyses of communication systems used to service heterogeneous streams of traffic simultaneously [712]. In such a system, there is loss-sensitive traffic as well as delay-sensitive traffic arriving at the system. To accommodate heterogeneous service needs of these two types of traffic, the loss-sensitive traffic is offered space priority with an effectively infinite buffer, while the delay-sensitive traffic is given time priority with the privilege of getting serviced earlier than the loss-sensitive traffic. However, all the studies mentioned above are restricted to priority queueing models with finite and infinite buffers under the nonpreemptive priority discipline. Although the nonpreemptive priority discipline is more straightforward to implement than the preemptive priority discipline, there is a major drawback that the service quality for high-priority customers may degrade severely when the service time of a low-priority customer is long or varies significantly. For this reason, there have been recent studies on preemption-based scheduling methods for communication systems that accommodate both high-volume multimedia traffic and delay-sensitive traffic [26]. In this line of thought, [13] has recently analyzed an M/G/1/ priority queueing model under the preemptive-resume priority discipline as well as under the nonpreemptive one. To do this, [13] developed a delay cycle analysis of single-class finite-buffer M/G/1 queues and combined it with the traditional delay cycle analysis of infinite-buffer M/G/1 priority queues. [14] also has studied M/G/1 priority queues under the preemptive-repeat-different priority discipline as well as under the preemptive-repeat-identical discipline.

In this study, we further extend the method proposed in [13] to a more sophisticated M/G/1/ priority queueing model under the -preemptive priority discipline. The -preemptive priority discipline was proposed in [15] to deal with the drawbacks of the classical nonpreemptive and preemptive priority disciplines. As mentioned in [15], both nonpreemptive and preemptive disciplines are extreme cases with respect to preemption conditions. During servicing of a low-priority customer, the former never allows high-priority customers to preempt the service, regardless of how many high-priority customers are waiting for their services in the system, while the latter always does preempt the service on the arrival of high-priority customers. To refine both extreme conditions for preemption, several preemption-based dynamic priority disciplines have been proposed [1519], and the -preemptive priority discipline is one of such priority disciplines. Under the -preemptive priority discipline, high-priority customers can preempt a low-priority customer in service when the number of high-priority customers in the system reaches , . Then, the preempted service of the low-priority customer is restored when the number of high-priority customers drops to , . [15] also showed that, in the M/G/1 priority queue with infinite buffers for both high- and low-priority customers under the -preemptive priority discipline, the performance measures of both classes of customers can be balanced and fine-tuned by adjusting the thresholds and . While [15] dealt with infinite-buffer M/G/1 queueing models under the -preemptive resume and -preemptive repeat-identical priority disciplines, [20] studied an infinite-buffer M/G/1 queueing model under the -preemptive repeat-different priority disciplines. There are also studies on discrete-time queueing models with geometric service times under the -preemptive priority discipline [21, 22]. However, all these studies are limited to priority queueing models with infinite buffers for both high- and low-priority customers. To the best of the author’s knowledge, there are no studies on priority queueing models with finite and infinite buffers under any preemption-based dynamic priority discipline, including the -preemptive priority discipline, rather than the classical nonpreemptive and preemptive disciplines. Only work on priority queueing models with finite and infinite buffers under a sophisticated priority rule other than classical priority disciplines is [11, 12], where an arriving customer can determine whether to join a finite high-priority queue or an infinite low-priority queue. However, these queueing models employ a nonpreemptive dynamic priority discipline without preemption, so they are different from our study because we consider a preemption-based dynamic priority discipline here. As mentioned above, since the classical nonpreemptive and preemptive priority disciplines are rigid and static, a more sophisticated preemptive-based dynamic priority discipline needs to be applied to service systems with finite and infinite buffers to balance service performance between the classes.

In this study, the delay cycle analysis of finite-buffer M/G/1 queues proposed in [13], where the waiting time distribution of the delay cycle is derived, is extended to develop a queue length version of the delay cycle analysis of finite-buffer M/G/1 queues. The reason for this is that the derivation of queue length distributions is more straightforward under the -preemptive priority discipline. This is due to the nature of the discipline depending on the queue length of high-priority customers. Using the proposed queue length version of the delay cycle analysis of finite-buffer M/G/1/ queues, we then analyze the structure of the effective service time of a low-priority customer under two different preemption modes of the -preemptive disciplines: the preemptive resume (PR) and preemptive repeat-identical (PRI) modes (see Section 2). Then, combining the developed finite-buffer delay cycle analysis with the traditional delay cycle analysis of infinite-buffer priority queues, we derive the customer loss probability and queue length distribution of high-priority customers and the queue length distribution of low-priority customers for M/G/1/ priority queues under the -PR and the -PRI disciplines.

Furthermore, we demonstrate by numerical examples that we can balance and fine-tune system performance measures between high-priority and low-priority customers by adjusting the decision variables of and . The impact of the thresholds and on the system performance is different between the preemption modes and between the service-time distributions of a low-priority customer (see Section 7). However, regardless of the preemption mode, the threshold can be used as a primary control variable to tune the system performances between the classes of customers, while the threshold as a secondary control variable to fine-tune the balance between the classes.

Compared to the previous studies [1315, 20], there are a couple of distinguishable contributions in this article. Unlike [15, 20], where infinite-buffer priority queues are considered, this study deals with a priority queue with finite and infinite buffers. Thus, the service-time structure of a low-priority customer is different from that in [15, 20], and the analysis of the service-time structure is more involved (see the first and second paragraphs in Section 4). With this new analysis, we can analyze the effect of the finite buffer size on the system performance under the -preemptive priority discipline. On the other hand, unlike [13, 14], where the waiting-time version of the delay cycle analysis is used, this study develops a queue length version of the delay cycle analysis of finite-buffer M/G/1 queues, because it is more suitable for the analysis of finite-buffer priority queues with sophisticate preemption decisions made based on queue lengths (see Section 4).

In summary, the novelty of this study is twofold. First, to the best of the author’s knowledge, this is the first study dealing with a queueing model with finite and infinite buffers under a preemption-based dynamic priority discipline other than the classical priority disciplines. Second, the method of delay cycle analysis of finite-buffer M/G/1 queue is extended to a queue length version of it. The proposed method is helpful for the analysis of complex priority queueing models with finite and infinite buffers because it can be so easily combined with the traditional delay cycle analysis of infinite-buffer queues that the analysis of a complex priority queueing model with finite and infinite buffers can be reduced to the analysis of several simple finite- and infinite-buffer delay cycles. As a result, we can analyze similar priority queues in a unified manner by exploiting a common delay cycle structure. We will demonstrate this point throughout our analysis.

2. Model and Notation

In this paper, we consider the following M/G/1 priority queueing model with finite and infinite buffers. There are two classes of customers, namely, class 1 and class 2, arriving at the system. We assume that class- customers arrive at the system according to a Poisson process with rate , . The service times of class- customers follow an i.i.d. general distribution , . The Laplace-Stieltjes transform (LST) of is denoted by , . While there is an infinite buffer for class-2 customers, there is a finite buffer of size , , for class-1 customers. Hence, when a class- customer is in service, there can be at most class-1 customers in the system (one in service and at most in the buffer). However, when a class- customer is in service, there can be at most class-1 customers in the system (only in the buffer).

Class-1 customers are given priority over class-2 customers according to the following -preemptive priority rule. Once the service of a class-2 customer has been started, preempting the service is not allowed if the number of class-1 customers in the system is less than a certain threshold , . As soon as the number of class-1 customers reaches , the class-2 customer service is immediately preempted, and a service period for class-1 customers begins. The server then works continuously for class- customers until the number of class-1 customers drops to another threshold , . As soon as the number of class-1 customers reaches , the preempted service of the class-2 customer is restored in one of the following two modes: the preemptive resume (PR) and preemptive repeat-identical (PRI) modes. Under the PR mode, the preempted service is resumed when it is restored, whereas under the PRI mode, it is completely restarted with the original service time of the class-2 customer. Notice that, when , no preemption occurs so that the -preemptive priority discipline reduces to the nonpreemptive priority discipline, and that when and , preemption occurs on every arrival of class-1 customers so that the -preemptive priority discipline reduces to the classical preemptive priority discipline. To exclude trivial cases, we assume throughout the paper. Since we only consider the loss probability and the queue length distributions, we assume that the service order between customers of the same class is one of impartial service orders such as the FCSF (first come first serve), LCFS (last come first serve), or ROS (random order of service) orders. Under any impartial service order between customers of the same class, the customer loss probability and the queue length distributions are unaffected by the service order between customers of the same class.

In this paper, we derive the queue length distributions of both classes and the customer loss probability of class 1. Let and , , denote the PGF (probability generating function) and mean of the number of class- customers in the system in a steady state. Let also denote the loss probability of class-1 customers in a steady state due to no vacancy in the buffer.

We also introduce a standard notation with respect to a random variable which denotes the length of a certain time period. can be one of the random variables in Table 1; the definitions of which will be given when they are first used in the subsequent sections. Throughout the paper, , , and denote the numbers of class-1 customers who arrive, who arrive and enter the system, and who arrive but are lost due to no vacancy in the buffer during the time period , respectively. Thus, . Let denote the LST of , and denote the following joint transform:

Hence, from the definition. We also define the partial transform as

Hence, and from the definition. In addition, we define and to be the PGF and mean of the number of class-1 customers during the period, to be the loss probability of a class-1 customer during the period, and to be the probability that a class-1 customer arrives during an period in a steady state.

3. A Queue Length Version of the Delay Cycle Analysis of Finite-Buffer M/G/1 Queues

Let denote the length of the standard busy period, starting with only one customer and terminating when the system becomes empty of customers, in the M/G/1/ queue where the buffer size is , the service time is , and the arrival rate is . Then, the joint transform of , , and can be obtained by the recursive equation (4) in [13], and the customer loss probability during the period is also given by the equation (5) in [13]. Hence, we assume that the moments of , , and , and the customer loss probability are given throughout this paper.

In order to derive the queue length distribution, we now extend the delay cycle analysis of single-class finite-buffer M/G/1 queues proposed in [13], where the waiting time distribution is derived rather than the queue length distribution. The reason why the queue length distribution is considered here is that the derivation of queue length distributions is more straightforward under the -preemptive priority discipline because the queue length plays a main role in the decision of whether to preempt.

Let denote the PGF of the number of customers during the period in a steady state. Also, let denote the busy period starting with customers and terminating when the queue size reaches , , in the M/G/1/ queue with and as its arrival rate and service time. Hence, is identical to . Observe that the period can be viewed as a delay cycle with the first service time as the initial delay and the remaining part of as the delayed busy period of the delay cycle (see Section 3 in [13]). Consider also the number , , of customers who arrive at the system during the initial delay . If , the delayed busy period following the initial delay starts with customers and lasts until the system is empty of customers. Thus, the delay cycle period is decomposed into the initial delay of and the delayed busy period of when . If , the delayed busy period following the initial delay starts with customers, due to the finite buffer, and lasts until the system is empty of customers. Thus, the delay cycle period is decomposed into the initial delay of and the delayed busy period of when . Therefore, if we let denote the PGF of the number of customers during the initial delay , and denote the PGF of the number of customers during the period, we have [13]

We now derive the term . Let denote the elapsed service time of and and the number of customers who arrive during . Then, from PASTA and renewal theory, we have

Observe that there are plus one customers in the system during the the initial delay in a steady state. More specifically, customers who arrive and enter the system during are in the buffer, and one customer who initiates the period is in service. Thus, is given by

We now argue that the period can be decomposed into the , , …, periods, . Without loss of generality, we can assume the LCFS service order when considering the length of a busy period and the queue length distribution during that busy period. From the definition of , there are customers in the system at the beginning of the period. Consider the th customer among those initial customers in the order of their arrivals (we assume that ties are broken arbitrarily). Under the LCFS order, the th customer, , can begin his/her service when there are only customers in the system. More specifically, in the order of their arrival, the 1st, 2nd, , th, th customers who were initially present in the system. Thus, the period can be decomposed into consecutive periods initiated by the th customers, , in the reverse order of their arrivals, that starts from the beginning of the th customer’s service and lasts until the number of customers in the system drops to for the first time. Note that these periods initiated by the th customer, , are stochastically identical to the standard busy periods , except for more customers in the buffer, because the buffer of size is already taken up by customers who arrive earlier than the th customer so that the effective buffer size is . Thus, the length of the period is expressed as and the LST of is given by [13] for . Note that, when , from the convention of the product. Also, from the same decomposition argument used for deriving (8), the PGF of the number of customers during the is given by for .

Plugging the expression of into (4) and rearranging the terms leads to

Note that is different from the PGF of the number of customers in the M/G/1/ queue, where we have to consider the number of customers during the idle period as well as that during the busy period . Thus, if we let denote the PGF of the number of customers in the M/G/1/k +1 queue, then we have

Furthermore, when and 1, we can obtain the PGF of the number of customers for the M/G/1/1 and M/G/1/2 queues as follows: which correspond the established results of the M/G/1/ queues [23], pp. 206–208.

Let denote the mean number of customers during the period. Then, from (6), (10), and (11), we have where

Notice that is different from the mean number of customers in the M/G/1/ queue with and as the arrival rate and service time because we need to consider the mean number of customers during the idle period. Thus, is given by

The PGF and mean of the number of customers in M/G/1/ queues can be derived with other established methods such as the embedded Markov chain technique and the supplementary variable technique. However, there are two reasons for developing the queue length version of the delay cycle analysis of M/G/1/ queues. One is that the proposed delay cycle analysis of finite-buffer M/G/1/ queues can be combined with the well-established delay cycle analysis of infinite-buffer M/G/1 priority queues so readily that the complicated analysis of M/G/1/ queues under a sophisticated priority discipline can rely on the well-established terminology and approaches (see Section 5). The other is that, as seen in (11) and (15), the PGF and mean of the number of customers are expressed in recursive equations only with their own terms. As a result, as the buffer size and the thresholds and increase or decrease by one, changes in the system performance can be recursively calculated. This makes it straightforward to perform a numerical study of M/G/1/ priority queues because there is no need to repetitively calculate the probabilities of the number of customers in the system for all values of , , and (see Section 7).

4. Service-Time Structure

In the -preemptive priority queue, three different random variables can be viewed as the service time of a class- customer: the gross service time , the completion time , and the occupation time . The gross service time is defined to be a total of service effort on the class- customer from the server to complete the class- customer service. If we assume the -preemptive resume discipline, the gross service time is identical to the original service time . However, if we assume the -preemptive repeat discipline, the gross service time may be different from the original service time because the service is completely repeated after every preemption. The completion time is defined to be a time from the beginning of the first service attempt of the class-2 customer until the class-2 customer service is completed. Note that is composed of and zero or more class-1 busy periods, which will be called service interruption periods, that begins when the number of class-1 customers reaches and the class-2 customer service get preempted, and ends when the number of class-1 customers drops to and the class-2 customer service get restored. The occupation time is defined as a time from the beginning of the first service attempt of the class-2 customer until the server is available for the next class-2 customer (if any). is composed of and zero or one class-1 busy period which begins if there are class-1 customers at the end of and ends when the system becomes empty of class-1 customers. Under the classical preemptive priority discipline, is identical to because it is impossible to service class-2 customers when there are any class-1 customers in the system. However, under the -preemptive priority discipline, it is possible to service a class-2 customer even when there are class-1 customers in the system, as long as the number of class-1 customers in the system is less than . See Figure 1 in [15] for the detailed structure of the gross, complete and occupation times under the -preemptive queue with infinite buffers for both classes, the overall structure of which is similar to the structure of , , and under the -preemptive queue with finite and infinite buffers.

However, due to the limited buffer for class-1 customers, a service interruption period of class-2 customers by class-1 customers after service preemption and a time from the service completion of the class-2 customer until the server is available to another class-2 customer (if any) are different from those in the -preemptive priority queue with infinite buffer for both classes. This causes the derivation of the distributions of and to be more involved than those in the case of infinite buffer for both classes, where every class-1 customer who arrive during the gross service time can be viewed to initiate an identical, standard M/G/1 busy period during or if we assume the LCFS order between class-1 customers. This makes it straightforward to derive the distribution of and from the distribution of [15], see Section 3. However, this is not the case for the finite-buffer case.

Let denote the number of preemption occurrences during the gross service time of a class-2 customer. Since under any impartial service order between customers of the same class, the lengths of , , , and are unaffected by the service order between customers of the same class, we can assume the LCFS order between class-1 customers without loss of generality. Assuming the LFCS order between class-1 customers, define to be the number of class- customers who arrive during and are serviced before the service completion of the class-2 customer, and the number of class- customers who arrive during but are serviced after the service completion of the class-2 customer. Therefore, the number of class-1 customers who arrive during is . Observe that under the -preemptive priority discipline and the LCFS order within the same class, class-1 customers who arrive during a service interruption period are serviced or lost during the same service interruption period because . Thus, when the class-2 customer service is completed, he/she leaves behind only class-1 customers when departing from the system.

While the joint transform , , was utilized in [15], a partial transform of and conditioning on will be employed here, because customers who arrive during do not initiate their own standard, identical M/G/1 busy periods any more in the finite buffer case. Since the class-2 customer service is preempted whenever the number of class-1 customers reaches , there can be at most class-1 customers after the completion of the class-2 customer service. Given , , define the partial transform of and to be

Observe that, since the LCFS order is assumed between class-1 customers, every service interruption period after preemption by class-1 customers consumes exactly class-1 customers from class-1 customers who arrive during . Thus, we have which leads to the following relationship [15]

The completion time is composed of the gross service time and service interruption periods by class-1 customers, and under the -preemptive priority discipline, a service interruption period starts with class-1 customers and ends when the number of class-1 customers drops to . Thus, if we let denote a busy period that begins with customers and ends as soon as the number of customers drops to , in the M/G/1/ queue with and as the arrival rate and service time, then a service interruption period during is identical to . Thus, the LST of the completion time is expressed as where according to our standard notation. Similarly, the occupation time is composed of and the following class-1 busy period initiated by class-1 customers, which is identical to the period. Thus, the LST of can be expressed as

Differentiating (20), (21), (22), and (8) and setting , , and gives where for , and and , , are given in the Appendix in [13].

We now derive the for both the -PR and -PRI disciplines.

4.1. -Preemptive Resume Discipline

In order to derive for the -PR priority discipline, the approach used for deriving the joint transform in [15] is adapted for . Under the -PR priority discipline, the gross service time is identical to the service time of the class-2 customer. Note also that, if , then no preemption occurs during , and that, if for , then preemption occurs times during , , and because every preemption and the corresponding service-interruption period consumes class-1 customers from class-1 customers. Hence, for , we have

This leads to for . From the complex number theory, for integers , , and , and , we have the following identity: where . Thus, we can rewrite for as where the first equality holds from the identity above and the following identity that for a complex number , a rational numbers , and an integer ; the second equality holds from the following identity that for complex numbers and and an integer ; and the third equality holds from . Hence, we finally obtain the for the -PR priority discipline as follows: where .

Given the distribution of , differentiating (34) and setting and gives , , , , and for the -PR priority discipline, which, in turn, gives , , , and from (23)–(28). In numerical examples in Section 7, we will demonstrate that these moments can be numerically obtained for a specific distribution of .

4.2. -Preemptive Repeat-Identical Discipline

In order to derive for the -PRI priority discipline, we introduce the following notation: Given and , we define the following partial transform of and as

Due to preemption by class-1 customers, the class-2 customer may attempt his/her service multiple times under the -preemptive priority discipline. We now partition into portions at each service attempt, and define to be a total of service effort on the class-2 customer from the server on the th service attempt of the class-2 customer, and to be the number of class-1 customers who arrive during , . Hence, and . In addition, let denote an Erlang random variable with rate and shape parameter , , and denote the following partial transform of when :

Then, we have for (see (8) in [15]).

We now consider the following two events at the first service attempt of the class-2 customer when . (1)Assume that class-1 customers, , arrive during the first service attempt. Then, the class-2 customer service is completed at the first service attempt and there is no preemption during . Hence, , , and . Also, since class-1 customers arrive according to a Poisson arrival with rate , we have

Thus, for , we have (2)Assume that class-1 customers arrive during the first service attempt. Then, the class-2 customer service is immediately preempted as soon as the number of class-1 customers reaches , and then after the service interruption period of , the service is freshly restarted with the identical service time . Notice that, if , then because the class-2 customer service is preempted whenever the number of class-1 customers reaches and is restored whenever it drops to . Let and denote the remaining gross service time and the number of preemption occurrences after the end of the th service attempt. Then, and . Let also the partial transform denote the following partial transform of and :for . Given , the event that is equivalent to the event that , i.e., class-1 customers arrive before the class-2 customer service is completed. Notice also that, given , , , and , and that, given , , , and are independent of because the class-2 customer service is completely restarted at the next service attempt and the arrival process are Poisson processes. Hence, we have for

Combining the above two cases, we have

We now derive the term in the equation for . Given , consider the following two events at the second service attempt of the class-2 customer. (1)Assume that the number of class-1 customers who arrive on the second service attempt of the class-2 customer is less than . Then, the class-2 customer service is completed at the second service attempt, so , and . Hence, for , we have(2)Assume that . Then, on the second service attempt of the class-2 customer, preemption occurs as soon as class-1 customers arrive during . Given , the event that is identical to the event . Hence, , and . Observe that and are stochastically equivalent to and because the service is restarted with the same service time and the arrival processes are assumed to be Poisson processes. Thus, we have

Combining the above two cases, we have for which leads to

Plugging the result of into the equation for and eliminating the condition on , we finally have the partial transform for the -PRI discipline as follows:

Given the distribution of , differentiating (47) and setting and gives , , , , and for the -PRI priority discipline, which, in turn, gives , , , and from (23) to (28). In numerical examples in Section 7, we demonstrate that these moments can be obtained numerically.

5. Loss Probability and Queue Length Distribution of Class-1 Customers

While the system is always stable for class-1 customers due to the finite buffer, it can be unstable for class-2 customers. Since only one class-2 customer is serviced during an occupation time, if , then the system is stable for class-2 customers.

Assuming first that the system is stable for class-2 customer, we derive the loss probability and the queue length distribution of class-1 customers. Note that if the system is stable for class-2 customers, then a class-1 customer may arrive at the system during an idle period, a busy period for class-1 customers, or a gross service time for a class-2 customer. Class-1 customers who arrive during an idle period or a gross service time initiate class-1 busy periods, which can be grouped into three types: A first type of class-1 busy periods is those that are initiated by a single class-1 customer who arrive during an idle period, and continue until there are no class-1 customers in the system. This type of class-1 busy periods is stochastically identical to the standard busy period in the M/G/1/ queue with and as its arrival rate and service time, which will be denoted by the busy period. A second type of class-1 busy periods is those that are initiated by class-1 customers who arrive during the gross service time of a class-2 customer and preempt of the class-2 customer in service, and continue until the number of class-1 customers drops to . This type of class-1 busy periods will be denoted by the busy period. A third type of class-1 busy periods is those that are initiated by class-1 customers, , who arrive during the gross service time but are not allowed to preempt the class-2 customer service, which begin on completing the class-2 customer service and continue until there are no class-1 customers in the system. This type of class-1 busy periods will be denoted by the busy period, .

We derive the loss probability that a class-1 customer who arrive at the system is lost due to no vacancy of the class-1 buffer when . Denote also , , , , and to be the probabilities that the system is in the idle period, in the gross service time, in the busy period, in the busy period, and in the busy period at an arbitrary time, respectively. Then, from Little’s law and PASTA we have where and . Note that the busy period occurs at a rate of , the busy period at a rate of , and the busy period at a rate of , . Thus, from Little’s law, we also have

From (48)–(52) and (26), we have

Furthermore, since we have

This, together with (48), leads to where the term , , can be obtained from Appendix A in [13], and the terms and from (23), (27), and the partial transform that are derived for each priority discipline in Section 4. When and , the -preemptive priority discipline reduces to the classical preemptive priority discipline, and from (25) and (26), we have . Thus, from (56), the loss probabilities and for the classical PR and PRI disciplines are given by which is identical to the loss probability in the M/G/1/ queue because class-2 customers do not affect the loss probability of class-1 customers under the classical preemptive priority disciplines. Also, this result corresponds to the following basic relationship in the M/G/1/ queue:

We now derive the PGF of the number of class-1 customers at an arbitrary time when . Denote , , , , and to be the PGF of the number of class-1 customers at an arbitrary time when the system is in the idle period, the gross service time, the busy period, the busy period, and the busy period, respectively. As there are no customers in the idle period, we have

Notice that, as we assume that , no class-1 customers who arrive during the gross service time are lost. Hence, from PASTA the number of class-1 customers at an arbitrary time during is identical in distribution to the number of class-1 customers immediately before a class-1 arrival during . During class-1 customers arrive, and customers preempt the class-2 customer in service and are serviced in a service interruption period, but customers are serviced after the class-2 customer service is completed. Since assuming the LCFS service order among class-1 customers does not affect the distribution of the number of class-1 customers, we can assume the LCFS service order between class-1 customers. Observe that under the LCFS service order, the class-1 customers observe class-1 customers in the system on their arrivals, respectively. Similarly, class-1 customers are serviced on every service interruption period, and under the LCFS service order, they are the latest arrivals before each service preemption and observe class-1 customers on their arrival, respectively. Thus, we have

In addition, and can be obtained from (9). Hence, the PGF is expressed as where can be obtained from from (10) and (11), from Appendix A in [13], and and from derived for each priority discipline in Section 4.

Remark 1. The PGF in (61) exhibits a decomposition form similar to one in the corresponding infinite-buffer M/G/1 queue. If , then the terms in (61) converges to the PGF of the number of customers in the corresponding standard M/G/1 queue, and to , the mean length of the busy period in the corresponding standard M/G/1 queue (see Lemma 1 in [13]). Thus, if , then (61) converges to the following well-know decomposition form of M/G/1 queues: which corresponds to the results for the -preemptive priority queue with infinite buffers for both classes (see (17) and (18) in [15]).

From (61), the mean number of class-1 customers is given by

We now consider the case when the system is unstable for class-2 customers, i.e., . In this case, the system cycle consists of only consecutive occupation times. Thus, we can obtain the loss probability of class-1 customers from the following simple relationship: which leads to

Furthermore, since the busy period occurs times and the busy period occurs times during a single occupation time, the PGF is given by

This also leads to

6. Queue Length Distribution of Class-2 Customers

When considering the queue length distribution of class-2 customers, we assume that the system is stable for class-2 customers, i.e., . If the system is stable for class-2 customers, the system cycle consists of the idle period and the overall busy period, and the overall busy period is initiated by either a single class-1 customer or a single class-2 customer and ends when the system becomes empty of customers of both classes. We will call these two types of the overall busy periods initiated by a class-1 or -2 customer the type I and II busy periods, respectively.

Let and denote the number of class-2 customers who are serviced during a type I and a type II busy period, respectively. Let also and denote the PGF of the number of class-2 customers in the buffer just after the beginning of the occupation time of a class-2 customer during the type I and type II busy periods, respectively. From the point of view of class-2 customers, the type I busy period can be viewed as a delay cycle consisting of an initial delay of the period, which is initiated by the class-1 customer arriving during the idle period and ends when the system becomes empty of class-1 customers, and the following delayed busy period in the corresponding M/G/1 queue with the occupation time and as its effective service time and arrival rate. Additionally, the type II busy period can be viewed as the standard busy period in the M/G/1/queue with and as its effective service time and arrival rate. Thus, from the well-known decomposition property of the delay cycle [24], Sec. 8-3, [25], Sec. 3-3 we have

Note that the probabilities that the overall busy period is of type I and II are and , respectively. Hence, if we let denote the PGF of the number of class-2 customers in the buffer just after the beginning of the occupation time of a class-2 customer in a steady state, then we have

Since a class-2 customer leaves the system in the completion time after the beginning of the occupation time, the PGF of the number of class-2 customers left behind by a class-2 customer who depart from the system is . Hence, from Burke’s argument and PASTA the PGF of the number of class-2 customers at an arbitrary time is given by

Thus, from (70), the mean number of class-2 customers is given by where and can be obtained from Appendix A in [13], and , , and from (25) to (28) and derived for each priority discipline in Section 4.

7. Numerical Examples

In this section, we perform a numerical study on the effects of the threshold and , the buffer size , the arrival rates and , and the distributions of the service times and on performance measures in the -preemptive priority M/G/1/ queue.

We first consider the effect of the threshold on the performance measures. In order to do this, we use the following numerical example: The arrival rates and are set to 1 and 1/2, respectively. The service time follows an exponential distribution with rate 4. Hence, , and . Since the distribution of can severely affect the performance measures due to service repetition under the -PRI discipline, we consider three different service distributions of with the same mean : an exponential distribution with rate , an Erlang distribution with rate and shape , an hyperexponential distribution with rates 8 and 2 and weights 2/3 and 1/3. The coefficients of variation (CVs) of the exponential, Erlang, hyperexponential distributions are , , and , respectively. Also, to see the effect of the threshold , varies from 1 to 5, while the threshold is fixed at 0. The buffer size for class-1 customers is set to 7.

Figure 1 shows the effect of on the loss probability , the mean queue length of class-1 customers, and the mean queue length of class-2 customers for the three different distributions of and the two preemption modes PR and PRI of the -preemptive priority discipline. As seen in Figures 1(a) and 1(b), as increases, and increase for all the combinations of the distributions of and the preemption modes. This is because a larger leads to less frequent preemption occurrences by class-1 customers, which is favorable to class-2 customer, but unfavorable to class-1 customers. For all the three distributions of , , and under the -PRI discipline are higher than those under the -PR discipline, except for . When and , the -PR and -PRI disciplines reduce to the classical PR and PRI disciplines, where class-1 customers can preempt service for class-2 customers as soon as they arrive. Therefore, the loss probability and the mean queue length of class-1 customers are not affected by whether the preempted service of a class-2 customer is resumed or repeated. However, when , class-1 customers have to wait while a class-2 customer is in service and the number of class-1 customers is less than . The differences in class-1 performance measures between these two disciplines for class-1 customers become more prominent for or , and reduce for a larger . This is because, for a sufficiently large , preemption rarely occurs, and both -PR and -PRI disciplines converge to the nonpreemptive discipline. On the other hand, as seen in the left and center panels of Figure 1, the higher CV of is, the higher and are for all values of , except for . This is because, when , the -preemptive priority discipline has nonpreemptible portion of the gross service time of a class-2 customer, and the variance of has negative impact on the performance measures of class-1 customers, as in the nonpreemptive priority discipline. Also, the higher CV of is, the more prominent the differences in class-1 performance measures between the -PR and -PRI disciplines are. The right panel of Figure 1 shows the effect of on the mean queue length of class-2 customers. As increases, decreases for all the combinations of the distributions of and the preemption modes. The effect of on under the -PRI discipline is more prominent than that under the -PR discipline because a smaller leads to more frequent preemption, the effect of which on is more amplified by service repetition under the -PRI discipline. As with the performance measures of class-1 customers, the effect of on is more prominent when the CV of is larger, and so is the differences in the effect of between the -PR and -PRI disciplines.

We now consider the effect of the threshold on the performance measures. In order to do this, we use the same numerical example as for the effect of , except that, to see the effect of the threshold , varies from 0 to 3, while the threshold is fixed at 4. Figure 2 shows the effect of on , , and for the three different distribution of and the two preemption modes. As seen in the left and center panels of Figure 2, as increases, and under the -PR discipline increase so slightly as the curves seem to be flat, and those under the -PRI discipline increase more rapidly for all the distributions of . This is partly because a larger leads to a shorter busy period for class-1 customers, which is favorable to class-2 customer, but unfavorable to class-1 customers. On the other hand, for a larger , the difference between and is smaller, which leads to more frequent occurrences of preemption. This alleviates the negative effect of shorter class-1 busy periods caused by a larger on the performance measures of class-1 customers under the -PR discipline. Thus, the effect of is almost flat for the -PR cases. However, under the -PRI discipline, this is not the case because more frequent occurrences of preemption result in a longer gross service time due to service repetition, which intensifies the negative effect of a larger on the class-1 performance measures. Also, the larger CV of is, the more prominent this negative effect is. Similarly, as seen in the right panel of Figure 2, as increases, under the -PR discipline decreases so slightly as the curves of seem to be flat because there is a mixed effect of shorter class-1 busy periods (service interruption time) and more frequent occurrences of preemption (occurrences of service interruptions) on the class-2 service performance. However, under the -PRI discipline, more frequent occurrences of preemption result in a longer gross service time due to service repetition, and the net result of shorter class-1 busy periods and more frequent occurrences of preemption becomes a prominent negative impact on the class-2 service performance, especially when the CV of is large.

We now consider the effect of the buffer size on the performance measures. In order to do this, we use the same numerical example as for the effect of , except that, to see the effect of , varies from 3 to 11, while the thresholds and are fixed at 3 and 0, respectively. Figure 3 shows the effect of on , , and for the three different distribution of and the two preemption modes. As expected, the buffer size has a great impact on the customer loss probability , while it has a mild impact on the mean queue lengths of both classes. As seen in the left panel of Figure 3, the differences in the impact of on between the preemption modes and between the CVs of are not prominent. This is because class-1 customer loss occurs only during the class-1 busy period, which is basically not related to the service distribution of and the preemption mode of , except for their effect on the probabilities , , and of the type of a class-1 busy period. On the other hand, as increases, the expected queue lengths of both classes increase, but are bounded by the corresponding values when the buffer size is infinite when .

We now consider the effect of the arrival rate on the performance measures. In order to do this, we use the same numerical example as for the effect of , except that, to see the effect of the arrival rate , varies from 0.2 to 1.0, while the thresholds and are fixed at 3 and 0, respectively. Figure 4 shows the effect of on , , and for the three different distribution of and the two preemption modes. As seen in the left and center panels of Figure 4, as increases, the loss probability rapidly soars for all the distributions of and the preemption modes, while the mean queue length rather linearly increases. This is because class-1 customer loss restricts the build-up of the class-1 queue. On the other hand, as increases, the mean queue length rises (see the right panel in Figure 4). This is because a larger causes a longer occupation time of a class-2 customer, which, in turn, leads to a large mean queue length of a class-2 customer. Furthermore, as in the effects of the thresholds and , the effect of on becomes more prominent under the -PRI discipline than under the -PR discipline, and so does it for a larger CV of . However, the differences in the effect of on the class-1 performance measure and between the CVs of and between the different preemption modes are less prominent than that on the class-2 performance measure . The reason for this is the following: As seen in (56) and (63), and are clearly related to the gross service time, the duration of which depends on the CV of and the preemption mode. Furthermore, as increases, the gross service time increase more rapidly for a larger CV under the -PRI discipline (see the left panel of Figure 5). However, the length of the busy period of class-1 customers increases far more rapidly than the gross service time for a sufficiently large . This impact of a longer class-1 busy period, which is identical for all the CVs of and the preemption modes, dominates the impact of a longer gross service time, leading to the less prominent effect of on and . Figure 5 confirms this explanation. Figure 5(a) shows that, as increase, the differences in between the CVs of and between the preemption modes become prominent, but the right panel shows that the difference in , which consists of and the class-1 busy periods, is relatively less prominent than that of because the portion of the class-1 busy periods in is far bigger than that of .

We now consider the effect of the arrival rate on the performance measures. In order to do this, we use the same numerical example as for the effect of , except that, to see the effect of the arrival rate , varies from 0.4 to 2.0, while the thresholds and are fixed at 3 and 0, respectively. Figure 6 shows the effect of on , , and for the three different distribution of and the two preemption modes. As seen in the left and center panels of Figure 6, as increases, the loss probability and linearly increase for all the distributions of and the preemption modes. This is because and are linear function of as seen in (56) and (63), and is the product of and . Furthermore, the slopes of and are steeper under the (N, n)-PRI discipline than under the -PR discipline, especially when has a large CV. This is because is larger under the -PRI discipline than under the -PR discipline when has a large CV. On the other hand, as increases, the mean queue length rises as in an infinite-buffer M/G/1 queue (see Figure 6(c)).

Finally, we consider the effect of the CV of on the performance measures. In order to do this, we use the same numerical example as for the effect of , except that, to see the effect of the CV of , the CV of varies from 0.25 to 1.0, while is fixed at 1/4 and the thresholds and at 3 and 0, respectively. For various CV values of , we use Erlang distributions with the same mean. Figure 7 shows the effect of the CV of on , , and for the three different distribution of and the two preemption modes. As seen in the left and center panels of Figure 7, as the CV of increases, the loss probability rapidly soar for all the distributions of and the preemption modes, while the mean queue length rises rather slowly. This is because the finite buffer size restricts the build-up of the class-1 queue. On the other hand, as the CV of increases, the mean queue length also rises (see the right panel in Figure 7) because a high CV of leads to a large variance of the length of the class-1 busy period, which results in a large variance of the occupation time, the effective service time of a class-2 customer. Figure 8 confirms this explanation. As the CV of increases, and are flat, but increases.

In summary, the thresholds and , the buffer size , the arrival rates and , and the CVs of and all affect the performance measures , , and . However, their effects are somewhat different from each other. One of the most prominent factors that have a significant effect on all the performance measures is the upper threshold . More specifically, it has a more prominent effect on the blocking probability and the mean queue length than on the mean queue length because of the finite buffer that restricts a build-up of the queue of class-1 customers. In addition, while the difference of the effects of on and between the two preemption modes is mild, those on is prominent, especially when the CV of is large. This is because is related to the preemption frequency of class-2 customers, which has a significant effect on the effective service time of a class-2 customer under the -PRI discipline when the CV of is large. Overall, this implies that the upper threshold can operate as a primary control variable to govern the system performance because data traffic in communication systems generally has a heavy-tailed distribution of the service time, and data may have to be entirely retransmitted when a preemption occurs, due to a time bound of transmission. Compared to the upper threshold , the lower threshold has a mild effect on all the performance measure when the CV of is not large. This is because the lower threshold has two mixed effects. One is shortening the length of a busy period of class-1 customers, which is favorable to class-2 customers. The other is causing frequent occurrences of preemption during the service of a class-2 customer, which is unfavorable to class-2 customers. This implies that the lower threshold may operate as a subordinate control variable to fine-tune the system performance when the CV of is small or mild. On the other hand, the buffer size has a very significant effect on the blocking probability for all the combinations of the service distributions of and the preemption modes, but it has a mild effect on the mean queue lengths and . The arrival rates and and the CVs of and all have a significant effect on the performance measures as they increase, because an increase in them causes a larger mean and/or a larger variance of the traffic load. However, in many cases, these factors are exogenous, and we cannot control them.

From the point of a system operator, the thresholds and and the buffer size are decision variables to tune the balance of system performance between the classes. While increasing generally incurs an investment cost, adjusting the thresholds and does not. The effect of on the system performance is prominent for all the distributions of and the preemption modes, while the effect of is rather subtle under the -PR discipline and can be negative under the -PRI discipline. Therefore, regardless of the preemption mode, the threshold can be used as a primary control variable to tune the system performances between the classes of customers, while the threshold as a secondary control variable to fine-tune the balance between the classes only under the -PR discipline.

8. Conclusions

In this paper, we analyzed an M/G/1 priority queueing model with finite and infinite buffers under the -preemptive priority discipline. This study has two main contributions. From a practical point of view, this study will help system engineers tune their service systems for different classes of customers with heterogeneous requirements. Previous studies on priority queueing models with finite and infinite buffers have been restricted to those under the nonpreemptive priority or the classical preemptive priority disciplines. The nonpreemptive and preemptive priority disciplines are both static and rigid. When the service times of low-priority customers fluctuate, the nonpreemptive discipline may cause a severe degradation of service performance for high-priority customers. On the other hand, when preempted services of low-priority customers have to be repeated completely, the fluctuation of the arrival rates and the service times of high-priority customers may cause a severe degradation of service performance for low-priority customers. Compared to these classical priority disciplines, as shown in Section 7, the -preemptive discipline is so flexible that whether to preempt a low-priority service is determined dynamically based on the queue length of high-priority customers, and we can adjust the thresholds and to balance the performance of high-priority customers and that of low-priority customers (see Section 7). This will help system engineers fine-tune their systems for different classes of customers with heterogeneous requirements, without any additional investment cost such as a cost to increase the buffer.

From a theoretical point of view, this study extended the delay cycle analysis of finite-buffer M/G/1 queues recently proposed in [13], and developed the queue length version of the finite-buffer delay cycle analysis. Delay cycle analysis is one of the standard methods to analyze M/G/1 priority queueing models. However, delay cycle analysis has been applied mostly to infinite-buffer M/G/1 priority queueing models, and few studies have used it to deal with finite-buffer priority queueing models because the delay cycle analysis of finite-buffer M/G/1 queues is more involved than that of infinite-buffer M/G/1 priority queues [13, 14]. One of the advantages of the proposed queue length version of the finite-buffer delay cycle analysis is that, when the priority discipline is a dynamic rule based on queue lengths, the analysis of the priority queue is more straightforward than the corresponding waiting-time version in [13, 14] (see Section 4). Another advantage is that, as in the waiting-time version of the delay cycle analysis of finite-buffer M/G/1 queues, the queue length version of the finite-buffer delay cycle analysis is so easily combined with the traditional delay cycle analysis of infinite-buffer queues that the analysis of a priority queueing model with finite and infinite buffers can be reduced to the analysis of several simple finite- and infinite-buffer delay cycles. As a result, similar M/G/1 priority queues with finite and infinite buffers can be analyzed in a unified manner by exploiting a common delay cycle structure among the similar priority disciplines. In fact, the two different preemption queues, the -PR and -PRI priority queues, was analyzed in a unified manner in this study (see Sections 46), and their performance measures were compared in Section 7. Therefore, it is expected that other priority queues with finite and infinite buffers that have a priority discipline based on queue lengths, such as M/G/1/ under the -preemptive repeat different priority discipline or batch-arrival or/and discrete-time priority queues under the -preemptive discipline, can be tackled with the proposed method.

Data Availability

The parameter input values used in the numerical study are described in Section 7 of the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.