Abstract

Aggregates of real-time traffic may experience changes in their statistical characteristics often manifesting non stationary behavior. In multi protocol label switching (MPLS) networks this type of the traffic is assigned constant amount of resources. This may result in ineffective usage of resources when the load is below than expected or inappropriate performance when the load is higher. In this paper we propose new algorithm for dynamic resource adaptation to temporarily changing traffic conditions. Assuming that network nodes may reallocate resources on-demand using automatic bandwidth adjustment capability of MPLS framework, the proposed algorithm, implemented at ingress MPLS nodes, dynamically decides which amount of resources is currently sufficient to handle arriving traffic with given performance metrics. This decision is then communicated to interior MPLS nodes along the label switched path. As a basic tool of the algorithm we use change-point statistical test that signals time instants at which statistical characteristics of traffic aggregates change. The major advantage of the proposed approach is that it is fully autonomous, that is, network nodes do not need any support from hosts in terms of resource reservation requests. The proposed algorithm is well suited for traffic patterns experiencing high variability, especially, for non stationary type of the traffic.

1. Introduction

Congestion is one of the major problems that degrades performance of networks. It occurs when network resources are insufficient for a current load or when load distribution is uncontrolled. The latter was the main problem of IP networks back in 1990s. Existing interior gateway protocols (IGPs) were topology-driven, took decision on routing packets based on network connectivity only and did not take into account resource requirements of traffic patterns. Traffic engineering (TE) capabilities of MPLS framework are intended to avoid the situation where congestion occurs as the result of inefficient resource allocation. TE attempts to provide best possible utilization of network resources avoiding noneven load distribution. Nowadays, MPLS is considered the vital part of QoS-aware networks.

Traffic patterns in packet-switched networks are characterized by deterministic trends similar to those found in classic telephone traffic. Particularly, it was demonstrated that there are clear daily variations in statistical parameters of traffic aggregates and these variations can be specific for different services [1]. They are often caused by preference of users to use a certain network service in a specific time of the day. Authors in [1] also noticed that there is a clear indication of the busy hour in link usage patterns. Irregular changes in statistical parameters of traffic aggregates were also reported in [24]. In [5], authors highlighted that operators' pricing scheme may also produce changes in traffic patterns resulting in additional source of nonstationarity. Besides commonly believed self-similar and long-range dependent properties (see [68] among others), nonstationarity is one of the major reasons for a traffic pattern to exhibit high variability.

In modern QoS-aware networks, resources for traffic aggregates are often allocated statically based on busy hour traffic expectations. In this case, high variability of a traffic pattern requires significant overprovisioning of network resources. Practically, high variability implies that there are large bursts in a traffic pattern. To serve such traffic with given performance metrics, sufficient amount of resources must be provided during this period. However, there are also long time spans during which the local average of the traffic pattern stays well below the global average. This implies that for a traffic pattern exhibiting high variability static resource allocation leads to ineffective usage of network resources. We also note that the busy hour traffic volume may not be known in advance making the decision about the required resources completely unclear.

The network always has up-to-date view of traffic aggregates. The straightforward way to deal with overprovisioning or underprovisioning of resources is to allow network nodes to dynamically choose the amount of resources that is sufficient for current arrival statistics of a traffic aggregate. The rest of resources can be temporarily assigned to other traffic aggregates. To deal with this problem, MPLS networks provide automatic bandwidth adjustment capability. It allows an MPLS label switched path (LSP) to automatically adjust bandwidth allocation based on its traffic load measured over the sampling interval of a certain length. Unfortunately, automatic bandwidth adjustment procedure does not take into account that statistical characteristics of traffic patterns may vary even during a single sampling interval. The length of the sampling interval is another important issue that affects performance of the resource allocation and requires additional administrative efforts. Dealing with time-varying traffic patterns, fully automatic bandwidth adjustment scheme that requires no additional administrative effort would be of a great benefit for network operators.

In this paper, we propose resource description and allocation scheme that dynamically reallocate resources to traffic aggregates in MPLS networks. Assuming that network nodes may reallocate resources on-demand using automatic bandwidth adjustment capability of MPLS framework, the proposed algorithm decides which amount of resources is currently sufficient to handle arriving traffic with given performance guarantees. We describe the structure of the proposed system and demonstrate its applicability using real traffic traces. As a basic tool of the algorithm, we use change-point statistical test. It allows to automatically determine whether statistical characteristics of a traffic pattern change and, if so, estimate new parameters of the traffic pattern. The major advantage of the proposed approach is that it is fully autonomous, that is, the network does not need any support in terms of explicit resource reservation requests. The proposed algorithm is especially wellsuited for nonstationary type of the traffic whose statistical parameters change in time. Presented numerical results reveal that the proposed approach may provide significant resource savings.

The rest of the paper is organized as follows. In Section 2, we briefly review automatic bandwidth allocation capability of MPLS networks. In Section 3, we consider dynamic nature of aggregated video traffic. Model for covariance-stationary behavior of aggregated video traffic is also proposed there. Change-point statistical test for detecting changes in arrival statistics is introduced in Section 4. The dynamic resource allocation system is proposed in Section 5. Numerical examples are given in Section 6. Conclusions are drawn in the last section.

2. MPLS Networks

2.1. Traffic Engineering in MPLS

Modern QoS-aware networks such as DiffServ, MPLS, or joint DiffServ/MPLS are specifically designed to be flexible enough to reallocate network resources in the best possible way, such that the required performance is provided using minimum amount of resources. Although MPLS is not considered a QoS framework for IP networks, it provides a number of advantageous features to network operators. According to MPLS, data are transmitted along the so-called label switched paths (LSPs) that can be explicitly chosen by network operators [9]. For this purpose, MPLS assigns short labels to network packets that describe how to forward them through the network. This feature enables effective traffic engineering (TE) capabilities [10] compared to classic approach that adopts usage of interior gateway protocol (IGP) metrics. Label switched paths (LSPs) are computed in traffic engineering databases (TED, [10]) in ingress routers that are filled by IGP advertisements. For this reason, IGP protocols were extended to support advanced metrics such as available bandwidth on a link [11, 12]. LSPs are then established using either resource reservation protocol modified to support traffic engineering (RSVP-TE, [13]) or label distribution protocol with constrained routing (CR-LDP, [14]).

Traffic engineering capabilities of MPLS allow to control the path that data packets follow in the network avoiding standard routing strategy of IGP protocols. They are intended to deal with the situation where congestion occurs as the result of inefficient resource allocation. TE tries to provide best possible utilization of network resources avoiding noneven load distribution. This is achieved by real-time monitoring of the traffic load on each network element and further tuning of traffic management attributes, routing parameters, and resources constraints.

2.2. Automatic Bandwidth Adjustment

Traffic aggregates in DiffServ, MPLS, and joint MPLS/DiffServ networks are still described statically using the token bucket mechanism. Automatic bandwidth adjustment capability of MPLS networks allows an LSP to automatically adjust its bandwidth allocation based on the measured traffic load. It works as follows. During the sampling interval of a certain length, the average bandwidth is monitored. At the end of the interval, an attempt is made to signal a new LSP with the bandwidth allocation set to the average value for the preceding sampling interval. If a new path is established, the original path is removed and the LSP is switched to the new path. If a new path is not established, the LSP continues to use its current path until the end of the next sampling interval, when another attempt is made.

While the approach described above is considered to be automatic in nature, it does not take into account possible time-varying behavior of traffic patterns. When the sampling interval is too large, there can be multiple changes in the mean value of traffic observations that may severely bias the statistics. On the other hand, too short sampling interval may burden the network with excessive signaling information. The approach that triggers LSP reestablishment when statistical characteristics of a traffic patterns do change could provide trade-off between the amount of signaling information and accuracy of bandwidth allocation.

2.3. Service Configuration

Reference configuration of the service we consider is presented in Figure 1, where AN stands for access network. We assume that there is a network operator providing network resources for a service provider. Service provider provides a QoS-constrained video distribution service to a number of users across the network. We assume that there is a specifically established LSP for video traffic aggregate and routers along the path of the traffic aggregate implement MPLS framework with dynamic bandwidth adjustment capability. The network operator is required to provide guarantees to this traffic aggregate in terms of mean losses and delays.

3. Time-Varying Nature of Aggregated Video Traffic

To represent traffic arrival process, we use video traffic traces available from the University of Berlin [15]. We consider video traffic using the granularity of a second. Although we use H.263 VBR traces, we have checked that conclusions stated in this paper remain valid for MPEG- VBR traces from the same traffic archive and MPEG-1 VBR sequences from University of Wurzburg [16].

3.1. Traffic Aggregate: Varying Number of Sources

We generated traffic traces simulating behavior of the video distribution system as explained below. We assume that at a certain instant of time there are no active sessions and this is the time instant when the system enters its operational state. After , session requests start to arrive to the system. Each arrival requests an arbitrary video sequence. We used uniform distribution over all available traces to determine the requested sequence. Interarrival times of sessions are geometrically distributed with a certain probability . We assumed that session arrival process is time-varying meaning that varies in time. This is a natural assumption for any service where the session arrival rate is different for different time of the day. We simulated 50 session arrivals. First 15 and last 15 arrivals were drawn from geometrical distribution with , 20 arrivals in the middle occur according to geometrical distribution with . Note that these parameters allow to mimic behavior of the system prior, during and after the busy hour.

Time-series of generated traces are shown in Figure 2. Observing these traces, it is natural to expect that statistics of the aggregated traffic are time-varying. Indeed, when the rate of session arrivals varies in time, it is unrealistic to assume that aggregated traffic may converge to covariance stationary process except for some segments during which the number of active sessions remains constant. For other choices of and different number of generated sessions, traces look qualitatively similar meaning that their statistical parameters do vary in time. Note that statistical characteristics of traffic aggregates may also vary due to other phenomena including long-range dependent, self-similar or nonstationary properties of individual streams. In this paper, the actual cause of variability is of no interest as long as change detection mechanism is able to detect points at which changes occur and subsequently estimate current traffic statistics.

An important property of aggregated traffic is that changes in the mean value lead to changes in the variance and vice versa. Figure 3 highlights this behavior. As one can observe, this dependency is almost linear allowing to track changes in one parameter only, either mean or variance.

3.2. Traffic Aggregate: Fixed Number of Sources

Consider statistical characteristics of the aggregated traffic when the number of multiplexed sources is kept constant. To obtain aggregated traffic patterns, we arbitrarily choose 9 traces out of the archive and multiplex them synchronizing starting times of all traces. We limited all traces to 1200 seconds.

Time-series of six aggregated traces are shown in Figure 4. Observing these traces, one may suggest that statistics of the aggregated traffic may not significantly vary in time. Indeed, when the number of video traffic sources is fixed and relatively large, it is straightforward to assume that the aggregated traffic may converge to covariance stationary process with normal distribution [5].

To support our hypothesis about covariance stationarity, we carried out a number of tests. First, we divided each trace into 6 nonoverlapping segments each of which contains exactly 200 observations. Mean values of these segments are shown in Figure 5. One may see that the mean value of single traces does not change in time significantly. For all traces, the ranges of mean values are less than 20% of global means of respective traces. These variations can be attributed to positive autocorrelation found in these traces. Finally, we tested our traces for homogeneity. To do so, each trace was divided into two samples having the same number of observations. In our case, this corresponds to 600 observations. Then, we applied test for homogeneity of two samples. This test allows to check hypothesis that two parts have the same distribution against alternative hypothesis, , that they have different distributions. Performed tests revealed that with level of significance two parts of the same trace have the same distribution.

Histograms of relative frequencies of considered traces and their approximations by normal distributions are shown in Figure 6, where , is the relative frequency corresponding to s bin, is the width of the histogram bin. These approximations suggest that aggregated traffic from fixed number of sources is normally distributed. The goodness-of-fit test performed with the level of significance confirmed this hypothesis.

Empirical normalized autocorrelation functions (NACFs) for all traces are shown in Figure 7. One may note that the memory of the process is short and limited to few lags. We approximate this behavior using a single geometrical term, for example, , . This approximation exactly captures lag-1 values of NACFs of empirical processes and does not significantly overestimate or underestimate autocorrelation coefficients for larger lags.

Presented results allow us to assume that when the number of multiplexed sources remains constant the aggregated video traffic composes realization of covariance stationary stochastic process. We note that our tests do not allow us to be statistically strict regarding covariance stationarity. Unfortunately, there are no universal and effective methods to statistically test whether given observations are stationary or not. However, Carried out tests confirm that the marginal distribution is normal and depends on the number of multiplexed traces only NACF tends to zero as time progresses confirming that the process is ergodic. All these conditions are necessary for an underlying process to be covariance stationary. In Section 6, we carry out another test that also confirms our hypothesis.

3.3. Model for Aggregated Traffic

To model aggregated traffic from fixed number of sources, we use covariance stationary autoregressive process of order one, AR(1). This process has normal marginal distribution and geometrically decaying ACF allowing us to suggest that it may produce fair approximation of empirical data [17]. These two properties of traffic patterns were found to produce the major impact on their service performance in a network (see [18, 19] among others).

A process is said to be autoregressive of order one if it is given bywhere and are some constants, are independently and identically distributed random variables having the same normal distribution with zero mean and variance .

For (1) to be weakly stationary, it is sufficient to have . In this case,where , , are some constants.

Mean, variance and covariance of AR(1) are related to , , and as

Parameters of AR(1) models are related to statistical data aswhere , , and are estimates of lag-1 autocorrelation coefficient, mean, and variance, respectively.

4. Change-Point Statistical Tests

4.1. Basic Principles

To differentiate between fluctuations of the aggregated traffic around a constant mean and those variations caused by changes in the mean value, we propose to use change-point statistical tests. There are a number of change-point detection algorithms developed to date. The common approach to deal with this task is to use control charts including Shewhart, cumulative sum (CUSUM), or exponentially-weighted moving average (EWMA) charts. The idea of control charts is to classify all causes of deviation from a target value into two groups. These are common causes and special causes. Deviation due to common causes is the effect of inherent causes affecting a given process. Special causes are not the part of the process, occur accidentally, and affect the process significantly. Control charts signal the point at which special causes occur using two control limits. If values of a certain function of initial observations are between them, process is in control. If some value falls outside, the process is out of control.

For detecting changes in aggregated traffic, we assume the following. We consider that common causes of deviation are those resulting in inherent stochastic nature of the multiplexed traffic from fixed number of video sources. Special causes are all those causing changes in parameters of in control processes. For example, among those are arrivals of new sessions adding new traffic and completions of ongoing ones subtracting some traffic. The control procedure is as follows. Initially, a control chart is parameterized using estimates of parameters of the aggregated traffic process. When change in a parameter occurs, a new process is considered as in control and the control chart is reparameterized according to statistics of this process.

Change-point tests often require observations to be independent. We have seen that aggregated video traffic is characterized by positive correlation. Autocorrelation makes control charts less sensitive to changes in the mean value. For detecting changes in the mean value of autocorrelated processes, two approaches have been proposed. According to the first approach, control limits of charts are modified to take into account autocorrelation properties. The idea of the second approach is to fit observations using a certain time-series model and subsequently test residuals. If the model fits empirical data well, residuals are uncorrelated and control charts for independent observations can be used. Performance of change-point statistical tests for autocorrelated data has been compared in [20, 21]. It was shown that modified control charts on initial observations perform better when autocorrelation is positive. Due to this reason, we use the first approach.

4.2. EWMA Control Charts

Let be a sequence of initial observations. The value of EWMA statistic at the time , denoted by , is given bywhere parameter is constant.

The reason to use EWMA statistical test is that it takes central part among other control charts. Although, according to (5), the most recent value always receives more weight in computation of , the choice of determines the effect of previous observations of the process on the current value of EWMA statistics. Indeed, when all weight is placed on the current observation, , and EWMA statistics degenerate to initial observations. As a result, EWMA control chart behaves like Shewhart one [22]. On the other hand, when the current observation gets only a little weight, but most weight is assigned to previous observations. In this case, EWMA control chart behaves similar to CUSUM one [23]. Summarizing, EWMA charts provide more flexibility at the expense of additional complexity in determining one more parameter . Due to inherent flexibility, we use EWMA control charts.

Assume that initial observations are taken from covariance stationary process with mean and variance and can be well represented by AR(1) process. If , it is easy to see that when . The approximation for variance of is given by [20]where is the parameter of AR(1) process.

The control limits are given bywhere is a design parameter whose values are tabulated.

4.3. Parametrization of EWMA Charts

To parameterize the EWMA control chart, a number of parameters have to be provided. Firstly, parameter determining the decline of weights of past observations should be set. The values of and determine the wideness of control belts for a given process with certain and . These four parameters affect behavior of the so-called average run length (ARL) value that is used to determine efficiency of a certain change detection procedure. ARL is defined as the average number of observation up to the first out of control signal. There are a number of methods to compute ARL value for given and . Tabulated values of in control ARL can be found in [20, 21]. Finally, and are not usually known in practice and must be estimated from empirical data. Therefore, estimates of and should be used in (7). As a result, for real-time implementation of EWMA test, there should always be a certain warmup period involving estimation of the mean. Finally, the first value of EWMA statistics is usually set to the global mean of observations or, if unknown, to the estimate of mean.

5. Resource Allocation System

5.1. Functionality of the System

The proposed resource description and allocation system should be implemented in ingress MPLS routers. We assume that ingress routers conform to MPLS specifications. The only important difference compared to MPLS functionality is the resource controller. The purpose of this controller is to monitor arriving traffic of different LSPs for possible changes in its statistical characteristics and manage the resource allocation for a given behavior aggregate. Another responsibility of the controller is to estimate the amount of resources required to serve incoming traffic with given performance metrics. It is important that this controller does not change the traffic pattern allowing it to proceed unaltered to the output port. We also note that no changes to interior nodes are required and they must strictly follow MPLS specifications.

The structure ingress node is shown in Figure 8. It is intended to operate as follows. Traffic policing unit is statically parameterized such that the highest possible load a content provider intends to pay for is allowed to enter the network. Those packets that conform to this specification proceed to the resource controller. This element monitors conforming traffic for possible changes in its statistical characteristics and estimates the amount of resources required to serve it with given performance metrics. Performance metrics must be configured manually. When change in traffic statistics occurs, resource controller estimates new resource allocation and advertises it to the resource allocation mechanism at the output port and to the MPLS TED associated with ingress node. RSVP-TE is then used to update resource allocation at all nodes along the path of LSP.

The structure of the resource controller is shown in Figure 9. Two major components of this system are the real-time traffic estimation module (TEM) and the performance evaluation and optimization module (PEOM). The system operates as follows. The TEM is responsible for detecting changes in arriving traffic statistics and dynamic estimation of the traffic state in terms of AR(1) model. To achieve that, traffic statistics must be observed in real-time and fed to the input of the change-point analyzer. Change-point analyzer tests incoming observations for changes in the mean value using EWMA statistical test. It is important to note that the change-point analyzer must signal the point when a change in arriving traffic statistics is detected. Otherwise, no actions must be taken by the TEM except for traffic monitoring. If a change is detected, traffic statistics are computed during the warmup period. Then a new model of the traffic is parameterized in the modeling block and fed to the input of PEOM.

According to the design of PEOM, traffic model is fed to the input of the decision module. Taking the required performance at the IP layer as another input, this module decides whether the current performance is satisfactory. If the performance is satisfactory, the system must further decides whether the amount of resources for a current state of the traffic is optimal. In order to take these two decisions, the module containing the performance evaluation framework is activated and supplied with the current traffic model. This module may implement the performance evaluation framework or just contain a set of precomputed performance curves corresponding to a wide range of traffic statistics. Due to the real-time nature of the performance control system, the latter approach is preferable.

Theoretically, when the input traffic statistics change, performance provided to aggregated traffic or optimal resource allocation providing a certain performance level or both change. However, in practical applications we always have a finite granularity of performance parameters and, therefore, we have to check whether a new traffic model does not satisfy performance requirements of aggregated traffic or results in not optimal resource allocations. If the performance is predicted to be unsatisfactory, the current traffic model is used to decide which resource allocation parameters provide both the required performance level and optimal usage of resources. Finally, if the resource allocation is not optimal but the performance is satisfactory, new optimal resource allocation must be computed. Obtained resource allocation parameters are then used till the next change in traffic statistics.

5.2. Estimating Resource Allocation

As a descriptor of the amount of resources required by an LSP, token bucket is used. The actual amount of resources in terms of the buffer space and outgoing link rate share is inferred from token bucket parameters at each node. According to the token bucket mechanism, the amount of traffic allowed in the time interval is upper bounded by , , where token rate is related to the outgoing link rate share, and bucket size is related to the buffer space. To parameterize a token bucket, we have to find a Fair satisfying performance criteria. Usually, there are infinite number of pairs satisfying the required loss performance. However, there is always upper bound on that also satisfies delay requirements. There are a number of approaches to estimate token bucket parameters using statistics of arrival process. Approaches range from approximations providing token bucket parameters for worst case traffic scenarios to exact ones involving solution of equivalent queueing systems [24, 25]. We are interested in simple approach providing a feasible option for online implementation.

To estimate parameters of the token bucket, we use overflow theory. Let be the cumulative arrival process and let be the amount of work arriving in the interval . The probability that the queue length exceeds is thenwhere is the link rate share assigned to a traffic aggregate.

Using lower bound approximation, we get

Denoting , we get

Note that the bucket size, , should be set such that the delay is equal to or less than the maximum allowable delay in a network element. Thus the task in (10) reduces to finding a suitable link rate share that should be assigned to a traffic aggregate. When is normal, the following approximation can be used [26]:where is the buffer overflow probability.

5.3. The Immediate Update Mechanism

According to our algorithm, new resource reservation should only be advertised to network elements when the resource controller already estimated a new allocation. This information is available at the end of the warmup period. For some traffic patterns (e.g., VoIP aggregates), this period can be long enough to receive inadequate treatment in a network. To ensure that the proposed algorithm continuously provides the desired level of performance, we propose to use following adaptive mechanism. When increase in the mean value is detected, a new resource allocation is immediately estimated and advertised to the output buffer and network elements along the path of the LSP. Since at the beginning of the warmup period we do not know traffic statistics yet, the choice of is somewhat arbitrary. One possible way is to estimate new resource allocation using mean and variance, , where , , and are the mean, control limit, and variance of previous in control segment, respectively. When the warmup period expires and statistics of new in control process are available, new resource allocation is computed and advertised.

The same algorithm can be used when the mean value decreases. Another approach is to advertise new resource allocation at the end of the warmup period only. Indeed, since the mean value decreases we still ensure that appropriate performance is continuously provided. Note that the immediate update mechanism results in additional signaling load consisting of RSVP-TE reservation updates and subsequent IGP-TE resource allocation advertisements [27].

6. Numerical Examples

6.1. Aggregated Traffic: Fixed Number of Sources

In Section 2, we assumed that aggregated traffic from fixed number of video traffic sources composes realization of the covariance stationary process. Let us now provide one more reason for this conclusion applying EWMA change-point test to first two traces demonstrated in Figure 4. We already found that this traffic is normally distributed and there is significant lag-1 autocorrelation. Due to these properties, control limits are computed according to (7). The warmup period used to compute control limits was set to 50 observations. EWMA statistics for different values of and are shown in Figure 10. Note that , correspond to in control ARL for the first trace and in control ARL for the second trace. Parameters and correspond to in control ARL for the first trace and in control ARL for the second one. One can observe that no change in the mean value of traffic observations is detected even though the ARL values for and are relatively small. These results suggest that the mean value of the aggregated traffic from fixed number of sources does not vary in time as required by covariance stationarity. Similar results have been obtained for other traces.

Note that for both traces and all values of , the parameter was set to 3. In general, to match a given ARL value, different values of require different values of . It was also found that usage of tabulated values may lead to many out of control signals even when there are no new session arrivals or departures. One of the reasons is that the monitored process may not be exactly covariance stationary and may occasionally contain some extreme observations (outliers). These observations may be of local significance only and may not affect the future evolution of the monitored process as well as the service process of arriving traffic. Since the EWMA change detection is assumed to operate in real time, each out of control signal starts a new warmup period. During this period, new control limits are estimated and the process cannot be monitored. From this point of view, we should detect only those changes that occur for sure.

Results presented in this subsection provide additional necessary condition for covariance stationary of the multiplexed traffic from fixed number of video sources. Our observations stay in control in EWMA chart constructed for AR(1) process suggesting that they can be realization of this process. Thus EWMA chart can be seen as a tool for testing stationarity of observations.

6.2. Aggregated Traffic: Varying Number of Sources

Consider now aggregated traffic from varying number of sources shown in Figure 2. EWMA statistics computed for these traces are shown in Figures 11(a) and 11(b) (, ). Figures 11(c) and 11(d) demonstrate the same statistics for first several session requests, where solid horizontal lines represent control limits and boxes represent session arrivals. The first box indicates the time instant when second session request arrives to the system.

Consider EWMA statistics in detail. We started the control chart when the first session arrives. It occurred at 1681 second for trace 1 and at 334 second for trace 2. The warmup period used to compute statistics of observations and control limits of charts was set to 50 observations. One may observe from Figures 11(c) and 11(d) that both processes stay in control when the second sessions arrive and remain in control until new traffic adds up to EWMA statistics and changes are detected. Note that there are gaps before EWMA statistics exceed the upper control limit for both traces. This is due to the memory of EWMA statistics that allows to avoid false signals but worsens reactive properties of the chart. In general, when the value of gets smaller, this interval becomes larger while the probability of false change detection decreases.

EWMA statistics computed for our traces with and are shown in Figures 12(a) and 12(b). The same statistics for first several session arrivals are shown in Figures 12(c) and 12(d). Note that changes in mean values of the aggregated traffic were successfully detected for these values of and .

6.3. Dynamic Resource Allocation

The proposed dynamic resource allocation system tries to provide bandwidth savings compared to busy hour reservation and conventional MPLS automatic bandwidth adjustment while keeping performance metrics of interest at the desired level. Consider how much gain we get applying the proposed algorithm.

Firstly, we compare performance of the proposed algorithm to that of the static busy hour resource reservation. The performance metric of interest is the amount of bandwidth required to transmit our traffic aggregates with a given overflow probability. Values of bandwidth allocation for our algorithm were computed using (11) such that the overflow probability is . Busy hour bandwidth allocation was chosen as the maximum bandwidth allocation computed by our algorithm. Note that it is different from the actual busy hour and represents the amount of bandwidth required to serve the traffic during the highest in control period of EWMA control chart. Parameter of control charts was chosen such that ARL was always kept at 500. We also used the immediate update mechanism.

Figure 13 demonstrates bandwidth allocation according to considered algorithms. Estimated values of bandwidth allocation are shown in Table 1, where average values are shown. Note that our algorithm always provides significant performance gain compared to busy hour resource reservation. For trace 1 our algorithm allows to save 71% and 70% of bandwidth when EWMA change-point detection with and is used, respectively. For trace 2, these numbers are 70% and 69%, respectively.

Next we compare performance of the proposed algorithm to that of MPLS automatic bandwidth adjustment. Performance metrics of interest are the amount of LSP reestablishments and the amount of required bandwidth according to these two approaches. Values of bandwidth allocation were computed using (11) such that the overflow probability is . To highlight influence of and on the performance of the proposed algorithm, parameter was set such that ARL values for are always three times higher than those with . For , was always chosen such that ARL is 300.

Figure 14 illustrates bandwidth allocation required by these two approaches. As one can notice, MPLS automatic bandwidth allocation always leads to higher required bandwidth for a given traffic aggregate. This is due to specific behavior of arrival statistics during sampling intervals over which the bandwidth allocation is estimated. This includes occasional outliers, drastic and gradual changes in arrival patterns. All these features lead to biased estimates of mean and variance that are further used to compute bandwidth allocation. Note that they also affect performance of the proposed algorithm.

It should also be stressed that the performance of MPLS bandwidth adjustment algorithm is highly sensitive to the choice of the sampling interval. When the sampling interval is too large, changes in arrival statistics are more likely to occur within a single interval. Large sampling intervals also lead to delays in LSP reestablishments with estimated bandwidth allocation. Due to presence of changes in arrival statistics, estimated bandwidth allocation may not be optimal at the end of intervals. On the other hand, when the sampling interval is too short, too many LSP reestablishments may burden the network with excessive signaling load.

Table 2 compares the amount of signaling load and average bandwidth required to transfer considered traffic aggregates for different lengths of the sampling interval and different values of . The average amount of bandwidth required to serve traffic aggregates according to our algorithm is considered as %. Parameters and denote the percentage of average bandwidth required by MPLS automatic bandwidth adjustment compared to our algorithm. First of all, observing Tables 1 and 2 we note that MPLS automatic bandwidth adjustment allows to significantly decrease the amount of bandwidth required to serve considered traffic aggregates compared to static busy hour bandwidth allocation. Although the performance gain is significant, it has observable limit. Indeed, when we decreased the length of the sampling interval 10 times (from 500 to 50 seconds) the average bandwidth required to serve traffic aggregates decreased by only 0.04% for trace 1 and 0.05% for trace 2. This is very small performance gain, especially, considering that we are close to theoretical and practical limits. Indeed, further decrease in the length of the sampling interval will lead to biased estimators of mean and variance and may overload the network with signaling load. On the other hand, as we observe, the proposed algorithm performs significantly better. For both values of it requires significantly less signaling load while still provides the same performance level in terms of losses. Noticeably that for our approach requires even less signaling than MPLS bandwidth adjustment with sampling interval set to 500 seconds.

Finally, influence of parameters and on the performance of the proposed algorithm is also seen from Table 2. Indeed, three times lower ARL that was used with led to almost three times higher number of change detections. Although in our particular case it did not lead to extremely biased bandwidth allocation, it is still noticeable. This effect is due to more frequent change detections in arriving traffic patterns leading to periods of uncertainty during which statistical parameters are estimated. During these periods we had to use the immediate update mechanism that may overestimate or underestimate the amount of required bandwidth.

It should be noted that, theoretically, the proposed algorithm cannot provide deterministic guarantees even if we use the immediate update mechanism and the target overflow probability is set to 0. The main reason is uncertainty introduced by warmup periods during which statistical characteristics of traffic aggregates are estimated. However, in all our experiments the cumulative length of warmup periods was significantly shorter compared to the overall length of in control periods.

7. Conclusions

We proposed the dynamic resource description and reservation algorithm for QoS-aware networks. The core of the algorithm is statistical change detection procedure. We demonstrated that if the local mean of the traffic pattern changes in time, the required amount of resources needed to serve this traffic with given performance metrics can be appropriately adapted using already available features of MPLS framework. The only required change is EWMA change-detection algorithm that should be implemented in ingress MPLS routers. The proposed algorithm is well suited for non stationary type of the traffic whose statistical parameters change in time. Numerical examples demonstrated that when traffic patterns exhibit nonstationary behavior the proposed approach allows to save significant amount of resources compared to busy hour resource allocation and conventional MPLS automatic bandwidth adjustment.

We also note that content distribution services considered in this paper are just example of applications that may have time-varying traffic characteristics. In principle, the proposed approach can be used for any type of the aggregated traffic that experiences high variability due to time-varying statistical characteristics.