Abstract
Scaling phenomena of the Internet traffic gain people's interests, ranging from computer scientists to statisticians. There are two types of scales. One is small-time scaling and the other large-time one. Tools to separately describe them are desired in computer communications, such as performance analysis of network systems. Conventional tools, such as the standard fractional Brownian motion (fBm), or its increment process, or the standard multifractional fBm (mBm) indexed by the local HΓΆlder function may not be enough for this purpose. In this paper, we propose to describe the local scaling of traffic by using on a point-by-point basis and to measure the large-time scaling of traffic by using E on an interval-by-interval basis, where E implies the expectation operator. Since E is a constant within an observation interval while is random in general, they are uncorrelated with each other. Thus, our proposed method can be used to separately characterize the small-time scaling phenomenon and the large one of traffic, providing a new tool to investigate the scaling phenomena of traffic.
1. Introduction
Consider an application that sends a series of packets from the source to the destination through the Internet. Suppose a traffic series passes through servers from the first server with the service curve to the th server with the service curve to reach the destination. Then, the communication from the first server to the th one can be expressed by Figure 1 (Li and Zhao [1], Li [2]), where is the arrival traffic accumulated within the time interval and is the departure traffic within .
Let be instantaneous arrival traffic, implying the bytes of a packet at time from connection at the input port of the server with the service curve . Then, the accumulated function regarding in the time interval is given by We now consider the aggregated traffic . By aggregated traffic, we mean the following: where is the positive number representing all connections at the input port of the server . In this research, traffic time series is in the sense of (1.2). The accumulated traffic within the interval is given by
In the field of traffic modeling, there are two categories of traffic models. One is deterministic modeling, more precisely, bounded modeling, and the other is stochastic modeling, see Li and Borgnat [3], Michiel and Laevens [4]. Scaling plays a role in all types of traffic models, see, for example, Willinger et al. [5], Feldmann et al. [6], Jiang [7], and Papagiannaki et al. [8]. There are two types of scaling phenomena in traffic. One is the small-time scaling and the other is large-time one, see, for example, Paxson and Floyd [9]. This paper aims at investigating two types of scaling phenomena of traffic for either the bounded modeling, say , and the stochastic modeling of .
Note that a commonly used model of in the wide sense stationarity is the self-similar process, that is, fractional Gaussian noise (fGn), see, for example, Stalling [10], McDysan [11], Pitts and Schormans [12], Leland et al. [13], Beran et al. [14], Tsybakov and Georganas [15], Willinger and Paxson [16], and Adas [17]. However, there is a limitation in fGn for the analysis of two scaling phenomena, namely, small scaling and large one, since it is indexed by a single parameter called the Hurst parameter , see Paxson and Floyd [9], Tsybakov and Georganas [15], Ayache et al. [18], Li and Lim [19, 20], and Li [21β24]. Therefore, two-parameter models of traffic are needed.
In this paper, we address two types of traffic models. One is the multifractional Brownian motion (mBm), see Li et al. [25]. The other is the 2-parameter bounded model introduced by Cruz, see [26, 27], Li and Zhao [28], Raha et al. [29], Jiang and Liu [30], and Boudec and Thiran [31]. The contributions of this paper are in two aspects. (i)We claim the small-time scaling phenomenon is independent of the large-time one and vice versa based on the model of Cruz.(ii)We propose the point of view to use mBm to analyze the scaling phenomena of traffic in this way. Describing the small-time scaling phenomenon by using on a point-by-point basis and to characterize the large-time scaling one by using on an interval-by-interval basis.
The rest of the paper is organized as follows. We will give the preliminaries regarding conventional time series in Section 2, aiming at pointing out why scaling is a topic in traffic of the fractal type. We will describe the reason why the small-scaling phenomenon of traffic is independent of its large-time one in Section 3. In Section 4, we will introduce a two-parametric model of mBm towards the scaling analysis of traffic based on the local HΓΆlder function . Finally, Section 5 concludes the paper.
2. Preliminaries
Traffic time series on old telephony networks is in the class of the Poisson processes, such as the Poisson one and its compound ones, see Erlang [32] and Brockmeyer et al. [33]. It has been successfully used in the design of infrastructure of old telephony networks for years, see, for example, Bojkovic et al. [34], Le Gall [35], Lin et al. [36], Manfield and Downs [37], and Reiser [38]. It is such a success on old telephony networks that it has almost been taken as an axiom for modelling traffic in communication systems, see Gibson [39], Cooper [40], and Akimaru and Kawashima [41]. Due to unsatisfactory performances of the Internet, such as traffic congestions, people began doubting about the models of the Poisson type. Accordingly, they began measuring and analyzing the traffic at different sites in the Internet during different periods of times for the purpose of reevaluating general patterns of traffic, see [9, 13, 14], Paxson [42, 43], and Traffic Archive at http://www.sigcomm.org/ITA/. Experimental processing real-traffic traces exhibited that traffic is in the class of fractal time series.
The early fractal model used for traffic modelling is the self-similar process with long-range dependence (LRD), that is, fGn with LRD. For this reason, we will address the preliminaries in this section in the aspects of conventional time series, stationary self-similar process, that is, fGn, and LRD processes.
2.1. Conventional Time Series
Let be a 2-order stationary random process, where is the th sample function of the process, where is the set of real numbers. We use to represent the process without confusion causing. Its mean in the wide sense can be expressed by Its autocorrelation function (ACF) can be written by In (2.1) and (2.2), the superscript implies that the mean and the ACF are computed by using spatial average. The mean and the ACF of a process expressed by time average are written by where the superscript indicates that the mean and the ACF are computed by time average.
The process is said to be ergodic if (2.5) holds, In what follows, we simply use to represent a random function in general.
Denote by the probability density function (PDF) of . Then, the probability is given by The mean and the ACF of based on PDF are given by (2.7) and (2.8), respectively, Let be the variance of . Then,
If , then it has the following properties.
Note 1. The PDF is light tailed. By light tailed, we mean that the integrals in (2.7) and (2.8) are convergent in the domain of ordinary functions.
Note 2. There exist and for if the PDF of is light tailed.
The Poisson distribution is an instance of light-tailed distribution, which expresses the probability of a number of events occurring in a fixed period of time if these events occur with a known average rate and independently of the time since the last event. In communication networks, one is interested in the work focused on certain random variables that count, among other things, a number of discrete occurrences (sometimes called βarrivalsβ) that take place during a time interval of a given length. Denote the expected number of occurrences in this interval by a positive real number . Then, the probability that there are exactly occurrences is given by the Poisson distribution below
Note 3. The ACF of with a light-tailed PDF decays fast. By βdecays fast,β we mean that is integrable in the continuous case and summable in the discrete case in the domain of ordinary functions.
Denote by the power spectrum density (PSD) of . Then,
Thus, we have Note 4 below, which is a consequence of Note 3.
Note 4. exists in the domain of ordinary functions.
The results in Notes 1β4 are usually assumptions for conventional time series as can be seen from Fuller [44], Box et al. [45], Mitra and Kaiser [46], and Bendat and Piersol [47]. We will explain below that all in Notes 1β4 may be no longer valid for LRD traffic.
2.2. Scaling Measures for Conventional Gaussian Time Series
A Gaussian process is completely determined by its second-order properties, more precisely, its mean and ACF, see Papoulis [48] and Doob [49]. Note that the mean of is a measure of the global property of . On the other side, the variance of measures its local property. These two points can be easily inferred from (2.3) and (2.9). For a Gaussian process with mean zero, one has Therefore, mean and variance or ACF are two essential numeric characteristics of a Gaussian process. In fact, if is Gaussian, then However, or mean of traffic time series may not exist in general due to LRD, see Li [22], which is a particular point of a time series with LRD (Beran [50, 51]). A simple explanation about this is in (2.13). In the case of , in (2.13) is indeterminate. Therefore, variance and mean are no longer suitable for measuring the local property and the global one of LRD traffic.
2.3. Correlation Time of Conventional Time Series
Correlation time is defined by (Nigam [52, page 74]) It is a measure relating to the scaling of a random function . It implies that the correlation can be neglected if , where is the time scale of interest [52]. As traffic is LRD, both the numerator and denominator on the right side of (2.14) do not exist. Therefore, correlation time that is a useful measure in conventional time series is inappropriate to be used in LRD traffic.
2.4. Brief of LRD Time Series
One says that is asymptotically equivalent to under the limit if and are such that (Murray [53]), that is, where can be infinity. It has the property expressed by In this sense, is called slowly varying function if for all .
A random function is said to be LRD if its ACF is nonintegrable, while it is called short-range dependent (SRD) if is integrable. This implies that is LRD if where can be either a constant or a slowly varying function. It is SRD if
Theoretically, any series whose ACF is nonintegrable are LRD. In the field of telecommunications, however, the term of LRD traffic usually corresponds to a hyperbolically decayed ACF. Its asymptotic expression for is often indexed by the Hurst parameter . That is,
Note 5. The tail of the PDF of LRD traffic is heavy according to Taqquβs theorem, see Abry et al. [54].
According to the Fourier transform in the domain of generalized functions (Kanwal [55], Gelfand and Vilenkin [56]), one immediately obtains the Fourier transform of the right side of (2.17) given by where stands for the operator of the Fourier transform. Therefore, for LRD traffic, we have
Note 6. LRD traffic is in the class of noise (Li [57]).
In summary, from the point of view of the assumption of Gaussian distribution, we say that the tail of the PDF of LRD traffic may be so heavy that its and do not exist. Owing to this meaning of the heavy tails, the ACF of traffic decays so slow in a hyperbolical manner such that it is nonintegrable. Consequently, a random variable that represents a traffic time series can be no longer considered to be independent, hence, LRD or long memory. On the other hand, the PSD of LRD traffic obeys a power law, see (2.21), hence, noise.
2.5. Brief of Self-Similar Time Series
A random function is said to be self-similar if it satisfies the definition of self-similarity given by where denotes equality in the sense of probability distribution.
Note 7. The concept of LRD differs from that of self-similarity (Li [23]).
Note 8. The self-similarity described by (2.22) is in the global sense.
The commonly used self-similar model of traffic is fGn in the stationary case and fBm in the nonstationary case. We will brief them in the next subsection.
2.6. fGn and fBm for Traffic with LRD
fGn is an only stationary increment process with self-similarity (Samorodnitsky and Taqqu [58]). We discuss it in this subsection towards exhibiting the limitation of fGn in describing two types of scaling phenomena of traffic.
Let be Brownian motion (Bm). Let be the fBm of the Weyl integral type with the Hurst parameter . Let be the Gamma function. Then,
The function has the following properties.(i).(ii)The increments are Gaussian.(iii), where .(iv). (v).
Thus, the ACF of , denoted by , is given by where
Denote by the PSD of . Then (Flandrin [59]) From the above, we see that either the ACF or the PDF of is time varying. Therefore, is nonstationary.
Note that is self-similar because it satisfies the definition of self-similarity. In fact, where denotes equality in the sense of probability distribution.
From (2.26), one sees that the PSD of is divergent at , exhibiting a case of noise, see Csabai [60] for the early work of noise in traffic theory. The relationship between the fractal dimension of fBm, denoted by , and its Hurst parameter, denoted by , is given by
Note that the increment series, , is fGn. Thus, the ACF of the discrete fGn (dfGn) is given by Since the ACF is an even function, we have where . Denote by the ACF of fGn in the continuous case. Then, where is used by smoothing fBm so that the smoothed fBm is differentiable.
The PSD of dfGn was derived out quite early by SinaΔ [61]. It is given by where and . The PSD of fGn is (see Li and Lim [62]) which exhibits that fGn belongs to the class of noises.
Note that can be approximated by , in fact, that is, the finite second-order difference of . Approximating it with the second-order differential of yields
From the above, one immediately sees that fGn contains three subclasses of time series. In the case of , the ACF is nonsummable and the corresponding series is of LRD. For , the ACF is summable and fGn in this case is of SRD. FGn reduces to white noise when .
Among LRD processes, fGn has its advantage in traffic modeling. For example, it can be used to easily represent two types of traffic series, namely, self-similar process and processes with LRD. Note that LRD is a global property of traffic. However, in principle, self-similarity is a local property of traffic, which is measured by fractal dimension .
Denote and the fractal dimension and the Hurst parameter of fGn, respectively. Then, one has (Li [23]) Therefore, one gets (Li et al. [63]) Hence, for fGn type traffic, the local properties of traffic happen to be reflected in the global ones as noticed in mathematics by Mandelbrot [64].
The above discussions exhibit that the standard fGn as well as fBm has its limitation in traffic modeling because it uses a single parameter to characterize two different phenomena, that is, small-time scaling and large-time one. The former is a local property and the latter is a global one.
3. Large-Time Scaling of Traffic Is Independent of Its Small-Time One
Traffic is greater than zero, that is, The above holds because is arrival traffic. In addition, where and are constants restricted by the IEEE standard without technical reasons except the need to limit delays. For instance, the Ethernet protocol forces all packets of to have bytes and bytes without considering the Ethernet preamble and header (Stalling [10]).
Due to the functionality of TCP, traffic appears βburstinessβ (see Tobagi et al. [65]) or intermittency and non-Poisson (Jain and Routhier [66], Jiang and Dovrolis [67], and Papagiannaki [8]). The burstiness has considerable effects on system performances, see, for example, Nain [68], Draief and Mairesse [69], NΓ©meth et al. [70], Li and Zhao [71], Jiang et al. [72], Wang et al. [73], and Starobinski and Sidi [74].
The following measure introduced by Cruz [26, 27] characterizes the bound of the burstiness of traffic The integral expressed in (3.3) does not make sense if for the continuous even in the field of Lebesgueβs integrals, see Bartle and Sherbert [75] and Trench [76]. However, it makes sense when it is considered in the domain of generalized functions. A simple way to explain (3.3) is where and is the Dirac- function. Equation (3.3) represents the burstiness bound of , which is a local behavior of traffic.
Note that is dependent. Therefore, we may rewrite (3.3) by the following expression: The above exhibits that traffic has highly local irregularity or high burstiness as observed by Feldmann et al. [6], Papagiannaki et al. [8], Paxson and Floyd [9], Jiang and Dovrolis [67], Willinger et al. [77], and Estan and Varghese [78]. Such a local irregularity considerably affects the polices or performances of telecommunication systems, such as queuing (see, e.g., Nain [68] and Draief and Mairesse [69]), end-to-end delay, see, for example, NΓ©meth et al. [70], Li and Zhao [71], Jiang et al. [72], Wang et al. [73], and Starobinski and Sidi [74], resource allocation (see, e.g., Gravey et al. [79]), anomaly detection (Tian and Li [80]), and admission control (Knightly and Shroff [81], Raha et al. [82], and Jia et al. [83]), just naming a few.
Another measure introduced by Cruz [26, 27] describes the bound of the average rate of traffic. It is given by Note that the bound of the average rate expressed above describes a global property of traffic. It implies that the bound of the average rate of traffic is robust as is a constant. This is in agreement with the experimental observations stated by Feldmann et al. [6], Willinger et al. [22], and Paxson and Floyd [9].
The above exhibits, taking into account (3.5) and (3.6) together, that the accumulated traffic within is bounded by Equation (3.7) implies that traffic has scaling phenomena in two folds. One is small-time scaling and the other large one.
Note 1. Parameter is independent of .
Note 2. From Note 1, we see that the small-time scaling of traffic is independent of the large-time one.
We now further explain the point in Note 2 from the point of view of fractal time series. Denote the autocorrelation function (ACF) of traffic by where is the lag. Then, for small lags, more precisely, for , if is sufficiently smooth on , is given by where is a constant and is the fractal index of . The fractal dimension of , denoted by , is given by see Adler [84], Hall and Roy [85], Chan et al. [86], Kent and Wood [87], Gneiting and Schlather [88], Lim and Li [89], and Li et al. [90]. The parameter is used to describe the local irregularity of traffic. It is in terms of small-time scaling of traffic, see Li [21β24] and Li and Lim [19, 20]. From (2.19), we have The parameter is utilized to characterize the global property, more precisely, LRD, of traffic from a view of fractals.
Note 3. Generally, is independent of .
Note 4. Owing to Note 3, we infer that the small-time scaling of traffic is independent of the large-time one in general.
The above discussions exhibit that it may be more flexible to characterize two types of scaling phenomena of traffic by using two independent parameters. One is for large-time scaling and the other for small-time scaling.
4. Applying mBm to the Scaling Analysis of Traffic
From the previous discussions, we suggest that it is natural to use two independent measures to describe two types of scaling phenomena that are independent of each other. Conventionally, fBm as well as its increment process, that is, fGn, is indexed by a single parameter , alternatively by . Thus, there is a limitation for them to independently characterize the scaling phenomena of two. This limitation was empirically noticed by Paxson and Floyd [9]. Lately, it was noticed by Ayache et al. [18] from the point of view of the multifractional Brownian motion (mBm).
In this research, we are interested in the work in mBm by Peltier and Levy-Vehel [91, 92] as well as Benassi et al. [93] to generalize the standard fBm by replacing the constant with the HΓΆlder function . Li et al. [94] applied to describe the multifractality of traffic. Although [91β93, 95] explained the local self-similarity characterized by using and Ayache et al. [18] discussed their method to measure the LRD of a random function, those works may not be enough for traffic because the small-time scaling is independent of the large-time one as we explained previously. As a matter of fact, it is quite awkward to use to describe two scaling phenomena of traffic because is linearly correlated with the fractal dimension with the expression [91, 92]. To overcome the difficulty to capture the large scaling phenomena of traffic in the global sense, we introduce the measure expressed by . Based on this, we propose our opinion like this; using to represent the small scaling of traffic on a point-by-point basis and to characterize the large scaling of traffic in the global sense, respectively. The key point of our opinion is that and are independent of each other.
In the rest of this section, we will brief the mBm in Section 4.1. Then, in Section 4.2, we will demonstrate the applications of and to real-traffic traces.
4.1. mBm of Type
Note that the above (2.27) implies that the local irregularity of a random function is globally the same. That, nevertheless, may not meet the real case of traffic. As a matter of fact, if of a traffic function is a constant, of in (3.3) is a constant too. This is a unifractal case, which is obviously in contradiction with real traffic as is time dependent, see (3.5).
One simple way to investigate the multifractality of traffic is to use mBm. Replacing the constant with a time-dependent function , where and is also called the local HΓΆlder exponent, see Peltier and Levy-Vehel [91, 92] and Benassi et al. [93], yields where is the standard Bm. The variance of is given by where Without lose of generality, one may normalize such that by replacing with .
Unless otherwise stated, denotes the normalized process in what follows. The explicit expression of the covariance of can be calculated by where With the assumption that is -HΓΆlder function such that one may have for . Thus, the local covariance function of the normalized mBm has the following limiting form for : The variance of the increment process for becomes which implies that the increment processes of mBm are locally stationary. It follows that the local Hausdorff dimension of the graphs of mBm is given by for each interval .
Regarding the computation of , we need a sequence expressed by the local growth of the increment process, where is the largest integer not exceeding . The local HΓΆlder function at point is given by (see Peltier and Levy-Vehel [91], Muniandy et al. [95], and Li et al. [94]) The local box or Hausdorff dimension denoted by is equal to That is,
4.2. Scaling Analysis of Traffic Using mBm
The function in (4.14) characterizes the local irregularity of traffic on a point-by-point basis or the small-time scaling of traffic.
Note that may be used to describe the LRD of traffic on a point-by-point basis, see Peltier and Levy-Vehel [91]. From a view of applications, it is desired to represent the LRD, which is a global property of traffic at large time scales, on an interval-by-interval basis. As a matter of fact, from a practical view of the Internet traffic, one is interested in the LRD measure, say , to investigate how traffic at time is correlated with that at apart from . Thus, the LRD at time on a point-by-point basis, that is, , may be difficult to be used in practice. In addition to this, since the local irregularity of traffic is independent of its LRD while linearly correlates to (see (4.13)), may be unsatisfactory to characterize the LRD property of traffic. Therefore, we propose the following expression to describe the LRD of traffic: where the subscript implies the mean.
Note 1. should be understood on an interval-by-interval basis.
Note 2. is uncorrelated with . Denote by corr as a correlation operator. Then, considering that is a constant, we have
Note 3. According to (4.13), we have Equation (4.17) exhibits that is completely correlated with .
We show two demonstrations of real-traffic traces named DEC-PKT-1.TCP and DEC-PKT-2.TCP that were recorded at Digital Equipment Corporation (DEC) in March 1995. Figure 2 plots its first 1025 data of traffic DEC-PKT-1.TCP, which is denoted by to imply the size of the th packet . Figure 3 shows its of the first 8193 data points. The value of for DEC-PKT-1.TCP equals to 0.756 in the range of . Figures 4 and 5 are plots for DEC-PKT-2.TCP, where . The plots in Figures 3 and 5 exhibit that traffic has highly local irregularity as discussed by Li and Lim [19] on an interval-by-interval basis.
5. Conclusions
The key idea in this paper is to describe small-time scaling and large-time one of traffic, separately. Following this idea, we have explained the limitation of the standard mBm in this regard because the local irregularity of traffic because of mBm linearly relates to its . To relax this restriction, we suggest to use to describe the local irregularity of traffic on a point-by-point basis for the small scaling phenomenon and propose to use , instead of , to represent the LRD of traffic for the large scaling phenomenon on an interval-by-interval basis, providing a promising candidate to study the scaling phenomena of traffic. The present results, in methodology, may be applied to random data in related issues, for example, those in [96β111], for the scaling analysis.
Acknowledgments
This work was partly supported by the National Natural Science Foundation of China (NSFC) under the Project Grant nos. 60573125, 60873264, 61070214, and 60870002, the 973 plan under the Project no. 2011CB302800/2011CB302802, NCET, and the Science and Technology Department of Zhejiang Province (nos. 2009C21008, 2010R10006, 2010C33095, Y1090592).