Abstract

We are concerned with the analytic solution of multiserver retrial queues including the impatience phenomenon. As there are not closed-form solutions to these systems, approximate methods are required. We propose two different generalized truncated methods to effectively solve this type of systems. The methods proposed are based on the homogenization of the state space beyond a given number of users in the retrial orbit. We compare the proposed methods with the most well-known methods appeared in the literature in a wide range of scenarios. We conclude that the proposed methods generally outperform previous proposals in terms of accuracy for the most common performance parameters used in retrial systems with a moderated growth in the computational cost.

1. Introduction

A common assumption when evaluating the performance of communication networks is that users that do not obtain an immediate service leave the system without retrying. However, due to the increasing number of customers and network complexity, the customer behavior, in general, and the retrial phenomenon, in particular, may have a significant impact on the network performance. We can find numerous examples of retrial queues in communication networks. In addition to classical telephony networks [1], the effect of retrials in wireless networks [2, 3] is usually a more delicate matter. We can also find the effect of retrials in data transfers along Internet [4]. However, it must be noted that retrial queues can also be observed in a wider range of systems, such as communication protocols or in many queues in daily life where a customer retries its access to a queue hoping to experience a lower delay in the access to a free server. Regarding the appearance of retrials in communication protocols, we can highlight its common use in the performance evaluation of MAC protocols like CSMA/CD [5]. The retrial phenomenon is also essential in modeling call centers (see [6] for a wealth of references on this issue).

The modeling of repeated attempts has been the subject of numerous investigations. Two functional blocks are typically distinguished in models which consider retrials: a block that accommodates the servers and possibly a waiting queue, and a block where users that retry are accommodated, usually called retrial orbit. More concretely, we consider an queueing system (following Kendall's notation stands for a queue with exponentially distributed interarrival and service times and an infinite capacity waiting room) with retrials where, following the human behavior, users can abandon the system with certain probability after an unsuccessful retry, what is usually referred to as impatience. Therefore, the resulting model under study will have a nonhomogeneous (because the retrial rate depends on the number of users in the retrial orbit) and infinite state space. It is known that the classical theory [7] is developed for random walks on the semistrip with infinitesimal transitions subject to conditions of space homogeneity. When the space-homogeneity condition does not hold, that is, in the case described in this paper, the problem of calculating the equilibrium distribution has not been solved beyond approximate methods [8, 9]. Indeed, if we focus on the case of multiserver retrial queues, the absence of closed-form solutions for the main performance characteristics when [10] can be emphasized. Therefore, it is clear that in this case it is necessary to resort to approximate methods. These methods are usually grouped into three categories: approximations, finite truncated methods, and generalized truncated methods. Although all the mentioned categories are in fact approximations and, therefore, sometimes it is not clear which category a method belongs to, the first category is usually devoted to methods that can be useful in a certain domain of the system parameters or in special extreme cases (low retrial intensity, high blocking probabilities, etc.). Both the finite truncated and generalized truncated methods replace the original infinite state space by a solvable state space, that is, a model where steady state probabilities can be computed. However, while for the finite truncated methods the initial state space is replaced by a finite state space, in generalized truncated methods it is replaced by another infinite but solvable state space.

The main contribution of this paper is the development of two new generalized truncated methods that are able to effectively solve retrial systems with user impatience. These novel methods are inspired in the model proposed by Neuts and Rao [11]. Comparing the proposed methods with the most well-known approaches appeared in the literature, we conclude that the new methods generally outperform the previous proposals in terms of accuracy for the most common performance parameters used in retrial systems and under a wide range of scenarios. Moreover, we show that their computational cost increases moderately compared to the simpler methods. We also show that all the generalized truncated methods clearly outperform the finite truncated methods in terms of accuracy, as it was predicted in [10].

The rest of the paper is structured as follows. Section 2 describes the model of the system under study and Section 3 summarizes previous approaches appeared in the literature. In Section 4 we describe the two proposed methods, and their performance is evaluated and compared to previous approaches in Section 5. Final remarks and a summary of the results are provided in Section 6.

2. System Model

In the system under study (shown in Figure 1), users arrive following a Poisson stream with rate and contend for access to a system with servers, requesting an exponentially distributed service time with rate . Furthermore, we define the load of the system by . Without loss of generality, we consider that each user occupies one resource unit. When a new request finds all servers occupied, it joins the retrial orbit with probability , having considered an infinite capacity for the retrial orbit. After an exponentially distributed time of rate this user retries, being a successful retrial if it finds a free server. Otherwise, the user leaves the system with probability or returns to the retrial orbit with probability , starting the retrial procedure again.

The model considered can be represented as a bidimensional continuous-time Markov chain (CTMC) whose state space is defined bybeing the number of users being served and the number of users in the retrial orbit, constituting a level dependent Quasi Birth and Death Process (QBD) [7] whose transition diagram is depicted in Figure 2. In QBD related literature, the term level refers to a set of states with the same second coordinate. The infinitesimal generator of this process has the following infinite block tridiagonal structure:

The main characteristics of this model are its infinite state space () and also its space heterogeneity produced by the fact that the retrial rate depends on the number of customers in the retrial orbit.

2.1. Computation of Performance Parameters

The most common performance parameter used in retrial systems is the blocking probability (), which is defined as the probability of the system being in a state where arrivals are not accepted. Notwithstanding, other performance parameters can describe the behavior of retrial systems more accurately. Those performance parameters are the mean number of users in the retrial orbit , the immediate service probability (), the delayed service probability (), and the nonservice probability (). is defined as the probability of a user being served in its first attempt, as the probability of obtaining service but not in the first attempt, and as the probability of leaving the system due to impatience without having been served. Obviously, it must be met that .

3. Overview of Approximate Methods

As stated in Section 1, it is known [10] that neither a closed-form analytical solution nor a direct algorithmic computation of the steady state probabilities has been obtained for the model under study, so we must resort to approximate methods. Those approximations can be classified into three categories [10].

(i)Approximations: this category includes those solutions in which the originalintractable model is simplified making some assumptions that usually suppose a good approximation only in a certain domain of the system parameters or in special extreme cases.(ii)Finite truncated methods: these methods consist in replacing the original infinite state space by a finite one, so the resulting model is solvable.(iii)Generalized truncated methods: in this case, the original infinite state space is replaced by another infinite but simplified and solvable state space.

It must be noted that the difference between the category approximations and the rest is sometimes not clear because all the methods are in fact approximations, and usually the accuracy of a method is better in a certain domain of the problem.

3.1. Approximations

An intuitive first approach to solve retrial systems is to consider those retrials as new arrivals requesting the access to a server. With this approximation, known as loss method [12, Section 2.8], the system becomes a simple loss model whose solution can be obtained using Erlang's B formula with an arrival rate that is the sum of new users and retrials. Although this solution is very simple, intuitively its main drawback is the expected inaccuracy when the retrial rate is high.

On the other side, for high values of the retrial rate the system can be approximated by an queueing system, so that the blocking probability can be computed using Erlang's C formula. As the loss method and the approach are useful for low and high retrial rates, respectively, in [12, Section 2.8], an interpolation of the above two methods is proposed to obtain a more accurate solution. Hereafter, we will refer to this method as Int.

In [13], Greenberg and Wolff propose another approximation (denoted by GW) based on the assumption that the returning customers see time averages. Obviously, the approximation made in GW is expected to work more accurately when the retrial rate decreases, because when a customer is blocked and reattempts quickly, the probability of finding the system busy is high.

3.2. Finite Truncated Methods

The first finite truncated method was proposed by Wilkinson in [14]. This method, denoted by Wil, is based on the truncation of the state space of the QBD beyond a level , that is, the method restricts the maximum number of users in the retrial orbit to . Obviously, the method is expected to be more accurate as we use larger values for the truncation level , but its computational cost will also increase. Other authors present more efficient truncated methods, for example, in [15] the truncation is based on the exclusion of those states with negligible stationary probabilities.

There also exist other finite truncated methods that modify the truncated state space to introduce, in some sense, the effect of retrials. The model presented by Fredericks and Reisner [16], and called hereafter FR, reduces the dimensionality of the model to a one dimensional state space, eliminating the dimension corresponding to the number of users in the retrial orbit and introducing its effect as a new arrival rate that depends on the state of the system.

In a similar way, Marsan et al. [17] propose a method (denoted by Mar) that reduces the infinite state space to a finite one grouping all the levels of the QBD where there are users in the retrial orbit into a single level. In [18] we propose a generalized version of Mar method, called FM. Comparing FM and Mar, FM shows a substantial improvement in the accuracy at the expense of only a marginal increase in the computational cost. These two methods include the effect of the eliminated states by modifying the transition rates of the last level of the resulting QBD. The difference between both methods lies in the number of levels taken into account in the finite truncated model. While Mar groups all the states with retrials into a single level, FM defines a value so the aggregation is performed beyond level . Mar can be seen as a particular case of FM, where . Note also that by increasing the value of both the accuracy and the computational cost will also increase. Due to that aggregation two new parameters are introduced: and . The parameter denotes the mean number of users in the retrial orbit conditioned to those states where there are at least users in the orbit, that is, . The probability that after a successful retrial the number of users in the retrial orbit does not drop below is represented by . The expressions needed to compute those parameters are [18]

As there is a mutual dependence between and and the steady state probabilities (), to find these values we followed a fixed-point iterative procedure starting with and .

3.3. Generalized Truncated Methods

In this section, we discuss the use of generalized truncated methods which approximate the retrial queue by some infinite although solvable system. We comment separately on the method proposed by Neuts and Rao in [11] as the methods proposed in this paper are based on it.

The method introduced by Falin in [19] (denoted by Fal) is based on the assumption that the retrial rate becomes infinite when the number of customers in orbit exceeds a certain level . Therefore, the system is approximated beyond level by a standard queue, and the retrial rate will depend on the number of users in the retrial orbit as follows:

Artalejo and Pozo [10] proposed a method (denoted by AP) based on the queue, instead of the queue. Therefore, the retrial rate will depend not only on the number of users in the retrial orbit () but also on the number of busy servers () as follows:

3.3.1. NR Method

The method proposed by Neuts and Rao in [11], and called hereafter NR, which was proved to converge to the original model in [20], is based on the homogenization of the model beyond a given level of the QBD, which supposes to restrict the maximum retrial rate, that is,

Figure 3 depicts the system model used in NR method. The resulting approximated model can be solved by the matrix-geometric approach [7]. Letwhere is the rate matrix defined as the unique minimal nonnegative solution of the quadratic matrix equation . In order to compute matrix , we have used the logarithmic reduction algorithm as proposed in [9, Section 8.4], using a precision of for the iterative procedure. The steady state probabilities for states where are obtained by solving combined with the normalization condition

As is a finite matrix, this system can be solved by any of the standard methods defined in the classical linear algebra. Finally, the steady state probabilities of the states in the homogeneous part of the model are easily computed by .

4. Proposed Methods

4.1. HM1 Method

When the number of users retrying is higher than , the NR method considers the retrial rate to be . Obviously, this is a rough approximation that can be inaccurate in many cases. Therefore, we propose to approximate the retrial rate beyond level by an estimation of its mean value, that is, , keeping the idea of the homogenization: The physical meaning of the proposed approximation is founded on assigning to the retrial rate beyond level a value that is the mean number of users retrying, when there are at least users retrying. This value can be computed by using

However, it must be noticed that depends on the value of , which is a configuration parameter, and more importantly, on the steady state probabilities. Therefore, we obtain a set of nonlinear equations relating , , and the probability vectors .

To solve this system, we use an approximation based on the assumption that when the number of customers in the orbit is sufficiently high then it is very likely that all servers are being used. Hence, , where , and thus and no longer depend on :

However, as the relationship between and still holds, we deploy an iterative procedure as follows: (1) set ; (2) given compute using the logarithmic reduction algorithm, then compute using (4.3); (3) if go to step 2; (4) with the final values of and compute the steady state probabilities solving the finite QBD.

4.2. HM2 Method

We must notice that the approximation that performs HM1 () may not be very accurate when the blocking probabilty is low, although it is expected to outperform NR. For that reason, we propose the HM2 method, which computes the value of using (4.2). More specifically, HM2 method uses the iterative procedure sketched next: (1) set ; (2) given compute and the steady state probabilities solving the finite QBD, then compute using (4.2); (3) if go to step 2.

Obviously, the performance of the HM2 method is expected to be higher than that of the HM1 method, but the computational cost of HM2 is also expected to be higher due to the fact that in HM1 the values of are computed only once, and this is not true in HM2.

4.3. Computation of Performance Parameters

For the methods HM1 and HM2, we have used the next expressions to compute the performance parameters described in Section 2.1.

5. Results

This section is aimed to compare the proposed methods HM1 and HM2 with the most well-known approaches appeared in the literature so far. In order to evaluate and compare the proposed algorithms with previous approaches, we have studied a wide range of scenarios. Letting , we have studied different system loads by modifying and keeping and fixed. It must be noted that, due to the introduction of the impatience phenomenon, we will be able to consider values of . We have also studied different configurations of the retrial behavior by varying using in all cases . Note that it is not necessary to modify to have different loads in the retrial orbit, as we use for that purpose.

The comparison of HM1 and HM2 is done with FM and NR methods, which can be considered representative examples of a finite truncated method and a generalized truncated method, respectively. It must be noted that the resulting QBD has been solved using the algorithm proposed in [21] in all cases, although other algorithms have been proposed in the literature [22, 23]. For the iterative procedures in FM, HM1, and HM2 the accuracy has been chosen in all cases to be .

For comparing the results, we have used the relative error of different performance parameters, defined as for a generic performance parameter . In order to obtain an accurate enough estimate of which can be used as , we ran all methods with increasing and sufficiently high values of so that the value of had stabilized up to the 14th decimal digit. As an example and for the particular case and , Figure 4 depicts that the general trend is that the higher the value of the lower the relative error is. This is due to the fact that the system under consideration becomes more similar to the exact model as increases. As expected all methods converged to the same value in the performance parameters under study, . In Table 1 we show the values of and that are going to be considered as the exact values, obtained as described above. Obviously, to obtain those values we need higher values of as and increase. In general, when we use values of , the value of needed to obtain a tight estimation of increases significantly, so the computational complexity makes the resolution to be unfeasible from a practical point of view when . For that reason and since blocking probabilities between and are unacceptable for virtually any practical application, the comparison among different methods is done in a range of .

The different methods are compared by using a metric based on the minimum truncation level needed to obtain a certain relative error, which has been widely used in the literature [10, 11]. Table 2 shows the minimum value of needed to obtain a relative error lower than for , , and . Note that a wide range of operation points for the occupancy of the servers and the retrial orbit has been chosen. The number in bold indicates the lowest value of for all the models studied. Finally, it is important to note that immediate service probability has been omitted as it is the complementary value of in the system under study.

Results show that the best performance is obtained by HM2, followed by HM1, NR, and FM. This trend is obtained in almost every scenario studied and for all the performance parameters. It is not unexpected that HM2 is more accurate than NR and HM1 as these can be considered as approximations of HM2. As it could be expected generalized truncated methods outperform finite truncated methods, as predicted in [12]. More specifically, the NR method improves FM, as it requires a lower value of for obtaining a concrete relative error. HM1 method obtains better accuracy than NR method for the same value of for and , but this is not true for and , in which the inverse behavior is obtained. Finally, HM2 method clearly outperforms all methods, as it needs much lower values of to obtain the desired accuracy for all the performance parameters under study.

Nonetheless, as we are dealing with numerical methods some attention must be paid to the computational cost. Note that the same methodology for solving the QBD and the same precision for the iterative procedures have been used in all cases. In Figure 5 we show the time needed to solve the system when and for different values of . Results have been obtained running Matlab in an Intel Pentium Core 2. As observed, the generalized truncated methods are not only more accurate than FM, but also they are computationally more efficient. However, we must note that times are negligible from a human point of view in all the generalized truncated methods. If we compare the results with those obtained in Figure 4, we conclude that we can obtain high accuracies with small computation times, specially for the generalized truncated methods.

5.1. Scenario with no Impatience

Although the impatience phenomenon is part of the human behavior, some of the most well-known methods to solve retrial systems do not consider it. For that reason, in this section we consider the particular case of the proposed model in which , that is, we do not take into account user impatience. In this way we will be able to make a global comparison of all methods. Note that in this scenario, the load offered to the system () has to be less than one. Moreover, we have decreased the number of servers to , because the computational complexity of AP makes it infeasible to solve the system when servers. We have focused only in the blocking probability because when we do not consider the impatience phenomenon and, therefore, . We can classify all the methods described in Section 3 into two categories: those in which the truncation level is a configurable parameter and those that offer a unique solution. In the former the precision can be adjusted through the parameter while in the latter precision is an intrinsic feature of the method and hence it is fixed. In the first group we find Wil, FM, Fal, NR, AP, HM1, and HM2 while in the latter we find FR, GW, Loss, Int and Mar. In a first part, we study the performance of the latter, showing and the relative error for each method in Table 3. As we can see, errors are usually lower when we increase the time between retrials but, in any case, errors are unacceptable in most cases. As Mar is the best method in all of them and it is a particular case of FM in the sequel only the FM method will be considered as it yields an upper bound for the accuracy of fixed precision methods.

Focusing on the configurable methods, Table 4 shows the minimum value of needed to obtain a relative error in the blocking probability lower than . Note that the number in bold represents the best choice for each scenario. As we can see, in scenarios without impatience, generalized truncated methods (Fal, NR, AP, HM1, and HM2) also outperform finite truncated methods (Wil and FM). Among the generalized truncated methods, the best performance is obtained by HM2 and AP. While the first is specially useful for low values of , the latter is the best for high values of ().

In Figure 6 we show the computational cost associated to all the presented methods when and for different values of . Results show that the lowest computation costs are for the simplest methods (Wil, Fal, NR) followed by HM1 and HM2. Finally, and computationally speaking, the worst methods are FM and AP.

6. Conclusions

We proposed and compared two novel methods that effectively obtain the value of typical performance parameters in retrial systems with user impatience. As there are not closed-form solutions to these systems when there are more than two servers, approximate methods are required to solve such systems. The proposed methods are improvements of the method proposed by Neuts and Rao in [11] being also based on the homogenization of the state space beyond a certain level. The results show the better performance of generalized truncated methods compared to finite truncated methods in terms of accuracy. Comparing the generalized truncated methods among them, we conclude that the proposed HM2 method outperforms, in almost all cases, previous proposals in terms of accuracy for a wide range of scenarios and for the performance parameters studied, with moderated computation cost growths compared to the simpler methods.

Acknowledgments

This work was supported by the Spanish Government (30% PGE) and the European Commission (70% FEDER) through Projects TSI2005-07520-C03-03 and TSI2007-66869-C02-02 and by Cátedra Telefónica de Internet y Banda Ancha (e-BA) from the Universidad Politécnica de Valencia. Besides, M. Jose Domenech-Benlloch was supported by the Spanish Ministry of Education and Science under Contract AP-2004-3332.