Abstract

The diversified service requirements in vehicular networks have stimulated the investigation to develop suitable technologies to satisfy the demands of vehicles. In this context, network slicing has been considered as one of the most promising architectural techniques to cater to the various strict service requirements. However, the unpredictability of the service traffic of each slice caused by the complex communication environments leads to a weak utilization of the allocated slicing resources. Thus, in this paper, we use Long Short-Term Memory- (LSTM-) based resource allocation to reduce the total system delay. Specially, we first formulated the radio resource allocation problem as a convex optimization problem to minimize system delay. Secondly, to further reduce delay, we design a Convolutional LSTM- (ConvLSTM-) based traffic prediction to predict traffic of complex slice services in vehicular networks, which is used in the resource allocation processing. And three types of traffic are considered, that is, SMS, phone, and web traffic. Finally, based on the predicted results, i.e., the traffic of each slice and user load distribution, we exploit the primal-dual interior-point method to explore the optimal slice weight of resources. Numerical results show that the average error rates of predicted SMS, phone, and web traffic are 25.0%, 12.4%, and 12.2%, respectively, and the total delay is significantly reduced, which verifies the accuracy of the traffic prediction and the effectiveness of the proposed strategy.

1. Introduction

Autonomous driving is one of the key scenarios in 5G networks. In order to achieve road safety of intelligent transportation systems (ITS), the ultrareliable and low-latency communications (URLLC) must be guaranteed in vehicular networks. Moreover, for vehicle-to-everything (V2X) communications, the large data transmission of diversified service requirements poses the challenges to improve the transportation efficiency of ITS [1]. Therefore, it is urgent to tailor the vehicular networks to cater to these different requirements that come from different services. Along this line of thought, multiple virtual networks are created via network slicing as a feasible way to meet its diverse needs [2].

The main function of network slicing is creating multiple logical separate networks based on public shared physic infrastructure [3]. Specially, these logical networks are independent with each other. Through network slicing, mobile network operator (MNO) can allocate network resources dynamically and flexibly to each logical network slice on demand [4, 5] so as to support extensive use cases with various performance and service needs. Network slicing can be realized by software-defined network (SDN) and network function virtualization (NFV) technologies [6], and the future network will evolve into the flexible and programmable network architecture gradually [7]. One important issue in network slicing is the scheduling policy which allocates limited resources dynamically to vehicles with various quality of service (QoS) requirements according to the traffic change and network state. Most of the existing researches about network slicing focused on core networks, and the requirements (i.e., QoS) for RAN slicing are usually assumed to be guaranteed perfectly. However, it may be unreasonable when the radio access network (RAN) is not considered due to the resource scheduling processing [8].

Through network slicing operation, multiple network slices will share the resources of the basic network according to their specific needs [9, 10]. The traffic characteristics of each network slice were analyzed and predicted in [11, 12], and the admission control decisions for network slice requests were then studied. In [13], the problem of QoS-aware joint admission control and network slicing was studied, and a heuristic algorithm was proposed to solve this problem. A model for orchestrating network slices based on service requirements and available resources was proposed in [14]. In addition, a Markov decision process framework was proposed to formulate and determine the optimal strategy of resource allocation for the 5G networks. In [15], a novel radio resource slicing framework for 5G networks was proposed; then, radio resources were allocated to different slices based on reinforcement learning. In [16], the slicing resource allocation problem was modeled as an online winner determination problem to maximize the social welfare of auction participants. Similarly, in [17], a novel auction mechanism-based network slicing strategy is presented in which resources and revenue were jointly optimized. In [18], the authors studied a simple dynamic resource sharing policy and indicate that the slices are able to maximize their carried loads subject to performance requirements by admission control to manage users’ performance.

However, due to the characteristics of the vehicular networks, such as stringent delay constraints, and complicated communication environments [19], the general network slice resource allocation strategy may be insufficient in such scenario. Fortunately, under the complex and changeable vehicular network environment, machine learning would be an effective solution to this problem. For the dynamic nature of 5G vehicular scenarios, reference [20] proposed an online learning algorithm, namely fast machine learning, to solve the problem of beam selection to achieve higher context-awareness and adaptability in millimeter-wave vehicular communications. Furthermore, in [21], the authors discussed the prospect of managing vehicular network resources by reinforcement learning, and some open issues are also highlighted. A novel resource allocation scheme based on deep learning was proposed in [22]; the proposed scheme can optimize resource allocations according to the changing demands and network dynamics in SDN-based vehicular networks.

With the accumulation of cellular traffic data and the development of machine learning and artificial intelligence [23], the idea of traffic prediction based on machine learning is becoming more and more popular in the field of communications [24]. In [25], the authors designed a hybrid deep learning model for spatiotemporal prediction, where the spatial dependence was modeled by autoencoder and the temporal dependence was captured by LSTM [26]. In [27], a deep transfer learning-based prediction architecture has been proposed to predict different service traffic more conveniently. Moreover, in order to allocate resources appropriately, literatures [28, 29] used the LSTM to predict future traffic in RAN network slicing.

Motivated by the above analysis, we use machine learning to predict the services traffic of each slice, so as to allocate radio resources to each slice to reduce the delay. In more detail, firstly, we propose a new radio resource management, namely, shared proportion fairness (SPF), to keep resource management in accordance with slicing vehicle activity, and then, we use it for resource allocation representation. Moreover, we formulated the system delay minimization problem of resource allocation as a convex optimization problem. Secondly, we use ConvLSTM, which combines CNN and LSTM, to model the temporal-spatial dependency of the slice service traffic in the vehicular communication networks. Using the ConvLSTM for traffic prediction, we can predict different service traffic to obtain the user load distribution. Finally, according to the predicted results, a primal-dual interior-point-based resource allocation strategy is used to explore the optimal slice weight.

The rest of this paper is organized as follows. Section 2 describes the system model and assumptions. Using the above model, the resource allocation problem is formulated as a convex optimization problem to obtain optimal slice weight allocation in Section 3; then, an LSTM-based resource allocation is presented to minimize the system delay. In Section 4, we propose a primal-dual interior-point-based resource allocation strategy to solve the optimal slice weight problem. Simulation results are provided in Section 5 to evaluate the performance of the proposed traffic prediction and resource allocation strategy, followed by concluding remarks in Section 6.

2. System Model

We consider the cellular network consists of Road Side Units (RSU) and network slices. And the sets of RSUs and slices are denoted by and , respectively. As shown in Figure 1, RSU is virtualized into three layers, namely RSU interface layer, RSU virtualization layer, and RSU virtual resource layer. Among them, the RSU interface layer provides related interface for each slice. The RSU virtualization layer implements the functions of slice management, SDN control, and slice coordination. Besides, the RSU virtual resource layer provides the virtual resources required by each slice, which are obtained from the resource sharing layer of the base station. Also, the base station controls the service traffic prediction of the entire system and schedules the resources among slices.

System states , , and represent the sets of vehicles that communicate with RSU on slice , communicate with RSU , and on slice , respectively. Specially, we use and denote the cardinalities of these sets, i.e., , and . Further, we assume that each vehicle communicates with only one RSU and connects to one slice.

In our model, the vehicle communicates with the RSU that provides it with the strongest SINR; thus, the downlink SINR can be expressed as where the spectrum of the RSU is set to 20 MHz, and the transmit power is set to 44 dBm [30]. The channel gain is related to path loss, shadow fading, and fast fading. The path loss is equals to [31], where and represent the distance between the vehicle and the RSU and carrier frequency, respectively, and we set . Shadow fading follows logarithmic normal distribution with mean square deviation 4 dB. Fast fading is Rayleigh distribution depending on vehicles speed. The noise depends on the noise spectral density and the noise figure .

According to the Shannon capacity formula, we get the spectrum efficiency of vehicle at RSU , which can be expressed as

The limited resources of each RSU are shared by all connected vehicles, so vehicle can be allocated a fraction of resources from RSU . The allocated resources may be some resource blocks or time slots. For simplicity, we use that represents the allocated resources to vehicle , which implies a proportion of total resources of RSU with . Therefore, the transmission rate of RSU to vehicle is formulated as , where denotes transmission rate when all resources of RSU are allocated to vehicle , and is bandwidth.

3. LSTM-Based Resource Allocation

In this section, we will describe the LSTM-based resource allocation. The shared proportional fairness resource allocation is first introduced, which can keep resource management in accordance with slicing vehicle activity. Then, the system delay minimization problem is formulated as a convex optimization problem. Finally, the ConvLSTM-based cellular traffic prediction is proposed at the end of this section.

3.1. Shared Proportional Fairness

For each slice in vehicular networks, we assume each slice is allocated a certain percentage of the radio resources, which is denoted by , so we have , and . Next, the vehicle gets a subweight from the serving slice that depends on the number of active vehicles, i.e., for a vehicle , where means the subweight of vehicle . Finally, The RSU allocates its resource to vehicles in proportion to their weights. Consequently, the transmission rate from RSU to vehicle can be obtained and written as

Considering there are many vehicles on slice at RSU so based on some further notations that are introduced in Table 1, the average transmission rate provided by RSU to slice is expressed as

According to (4), the average bit transmission delay (BTD) of the vehicle on slice can be given by where represents the average BTD of the vehicle at RSB on slice . Besides, we use and that denote the weighted inner product of the vectors and the weighted norm of a vector, respectively. denotes a diagonal matrix.

We assume the message e handling process follows the queue model [32]. Among which, the random variable of message arrival interval obeys general distribution, , while in the different time slots is independent and identically distributed. Its expectation , where is the arrival rate. The message service time is exponentially distributed, i.e., , and the mean value of the message service times depends on the number of allocated resource blocks. To facilitate the analysis of the system delay, is used to denote the average service time when only one resource block for the message processing. Similarly, we denote the average service time as when resource blocks for the message processing.

The average waiting delay of the vehicle is given by where , and can be obtained by solving the following equation:

As a result, the average waiting delay of the vehicle on slice is

According to formula and (5) (8), the total average delay of a vehicle on slice can be obtained by

3.2. Problem Formulation

In the real implementation of network slicing, each slice would provide a guarantee of service to the vehicles, that is, the total delay on slice does not exceed a deadline . In this subsection, we will explore how to obtain the optimal solution of minimizing system delay by allocating weight to each slice.

Considering the network with just one slice , so we have . To satisfy the deadline , from (9), we can obtain that where is the acceptable maximum load of slice .

Next, considering multislice networks, each slice has its self-requirement. According to (9) (10), each slice would satisfy the following constraint to meet their requirements:

Equation (11) can be written in a simplified form, i.e., where is share coupling vector of slice and can be expressed by

Our objective is to satisfy the requirements of vehicles in each slice and minimize the system overall delay, so the objective function is the summing average delay of all slices. Consequently, the optimization problem can be formulated as where is the optimization variable. Constraint ensures the requirements of slices can be satisfied. Constraints and state the weight of each slice is nonnegative and is constrained by total resources. We can see that the problem (14) is an inequality constrained convex optimization problem, the method to solve it is described in Section 4.

3.3. Cellular Traffic Prediction

In practice, searching the optimal weights of each slice through minimizing the system total delay according to the current load distribution will cause some delay; we define that as arranging delay . To reduce the arranging delay, traffic prediction is a feasible solution. Through traffic prediction, the system can acquire complicated traffic information in advance to calculate the optimal weighs for each slice. Hence, the arranging delay can be largely reduced.

ConvLSTM neural networks are adapted to predict cellular traffic. ConvLSTM networks can not only model the sequences information of cellular traffic accurately as same as Long Short-Term Memory (LSTM) networks [12] but also the local feature as same as convolutional neural networks (CNN). In short, it can easy to capture the spatial-temporal dependencies. Consequently, ConvLSTM networks are appropriate for predicting slice traffic in complex vehicular networks. This memory cell consists of cell states and three neural network units, i.e., input gate, forget gate, and output gate. For this specific framework, it is able to effectively store information chronically from long-term sequences.

As shown in Figure 2, the forget gate outputs a value to the cell according to the current input and past output , which determines what information would be abandon in past cell status now. The calculation formula of forget gate is given by

The input gate decides update when a new input comes to the ConvLSTM unit through a sigmoid function, which can be further effects present states , and expressed as

The output gate decides the output of this cell through a sigmoid function. After that, the output is given by where , , , , , and denote the output of forget gate, the output of input gate, cell status, cell input, and cell output, respectively, and , , , , , , , , , , , and are the parameters of the LSTM network. The functions and are sigmoid function and hyperbolic tangent function, respectively. In the above equations, the notation denotes the convolution operation, and the notation denotes the Hadamard product. Different from the common LSTM networks, the inputs or outputs in the ConvLSTM unit are all three-dimensional tensors. More specifically, the citywide service traffic data can be treated as a matrix or picture. Then, previous multiple service data are fed into the ConvLSTM networks to obtain future results. So, the multiply operation of common LSTM networks is replaced by convolution operation in ConvLSTM. The neural network can be accomplished via updating various parameters in each iteration, e.g., , , , , , , , , , , , and , so that the networks can minimize the error between forecasted values and ground trues.

4. A Primal-Dual Interior-Point Method-Based Resource Allocation Algorithm

In Section 3, the LSTM-based resource allocation is presented to minimize the system delay. Then, we will explore how to obtain the optimal slice weight of resources in problem (14) and apply the LSTM in resource allocation processing.

The resource allocation strategy includes two phases: the first phase is service traffic prediction by using machine learning described in the preceding section, and the second phase is the optimizing procedure that based on the primal-dual interior-point method. Based on the predicted results in the first phase, the slice weight distribution process can be performed in advance so that the delay is decreasing obviously.

Considering there are inequality constraints in the convex optimization problem (14), so it can be solved by the primal-dual interior-point method. We will transform the inequality constrained optimization problem as an equality constrained optimization problem so that the central path of this problem can be found. Therefore, we rewrite the problem (14) as

For simplicity, let , , where is the optimization variable. Considering the equivalent problem

Due to the number of slices is three in the simulation setting, so and . According to reference [33], the Newton step is given by the modified KKT equations where , is the inequality constrained function of the original problem, is its derivative matrix. and are dual residual and primal residual, respectively, and these residuals are used for the termination condition in the primal-dual interior-point method. The solution of (20) is the current primal-dual search direction .

Input: Initial , , scale factor , residual error , duality gap error ϵ, times observed load distribution
Output: Optimal solution
Phase 1: Predict service traffic
Training the ConvLSTM networks to obtain parameters , , , , , , , , , , ,
According to times observed load distribution predict sequences in the future
Phase 2: To obtain optimal slice weight
while True do
 Calculate initial value of the surrogate gap
ifthen
  break
end if
 Determine
 Compute primal-dual search direction
 Determine initial step length
whiledo
  Ensure satisfy the constraint condition
end while
while
do
  Determine backtracking search step length
end while
 Update search direction y
end while

The step length can be obtained by backtracking line search which is based on the norm of the residual. Through continuous iteration, the best solution will be returned when the present state satisfies the termination condition; the details are elaborated in Algorithm 1.

5. Performance Evaluation

In this section, we present numerical experiments to illustrate the performance of the proposed traffic prediction and resource allocation strategy. We use the cellular traffic dataset [34] of Milan, Italy, to train our neural networks; this dataset contains three categories of service traffic, i.e., SMS, phone, and web traffic, and we can deem it as three slices. The area of Milan is divided into a grid overlay of squares; the dataset records 1000 samples of each square with a temporal interval of one hour. We choose the 800 samples from this dataset as the training set, the rest of the samples as the testing set. In the simulation setting of ConvLSTM, the neural network is set to three layers, and each layer has three ConvLSTM cells. In the training stage, we set the learning rate as 0.01, batch size as 32. After 100 times iteration, the trained model is used to predict the three types of service traffic. Then, the primal-dual interior-point resource allocation strategy is used to solve the optimal slice weight allocation according to the predication. In simulation parameters, we set scale factor , residual error , and duality gap error .

Figure 3 shows the difference between predicted traffic and real traffic in a certain cell. The vertical axis of Figure 3 is the number of user access for service, and the horizontal axis of Figure 3 is the temporal dimension. It can be seen from Figure 3 that the temporal activities of three services show a strong daily pattern and follow weekly-periodic properties. Besides, the traffic volume expresses a slight difference among the three services. Specifically, the number of web access is always larger than that of the other two services. It can be seen that the service traffic shows a strong daily pattern. The average error rates of SMS, phone, and web traffic are 25.0%, 12.4%, and 12.2%, respectively.

Figure 4 shows the network slicing delay with and without prediction when slice weight is fixed, and the default of arranging delay was set to 20 ms. The system delay also exhibits daily characteristics. Further, we can get that the more the number of user access, the greater the total average delay of the system. It can be seen that the periodicity of service traffic is mapped dynamically to system delay completely. Using the service traffic prediction, the slice weight can be distributed on average 15.33 ms in advance.

The predicted results express system load distribution in the future; the optimal slice weight can be solved accordingly. In Figure 5, we present the optimal slice weight under various times. Especially, at the 12th, 36th, 60th, and 84th time slots, the optimal slice weights of SMS are 0.3011, 0.3028, 0.3001, and 0.3076, respectively; the slice weights of phone and web traffic are 0.3032, 0.3059, 0.3018, and 0.3015; and 0.3957, 0.3913, 0.3982, and 0.3819. The results clearly show that the optimal resource allocation problem is resolved under different times; the optimal slice weight is dynamic and varies with the service traffic pattern.

From the above results, it can be seen that the traffic prediction can be used to predict the service traffic in the future, so the user load distribution can be obtained. Therefore, the slice weight can be distributed in advance to save arranging delay. Furthermore, the primal-dual interior-point-based resource allocation strategy can calculate the current optimal slice weight distribution.

6. Conclusions

For slicing resource allocation problem in vehicular networks, this paper proposes an LSTM-based resource allocation, which contains two phases, i.e., traffic prediction phase and resource allocation phase. Moreover, we use ConvLSTM to capture the spatial-temporal dependencies of service traffic of each slice. Therefore, the user load distribution is obtained through traffic prediction. Then, based on the predicted user load distribution, we propose a primal-dual interior-point-based resource allocation strategy to solve the optimal slice weight problem for minimizing the system total delay. The resource allocation strategy proposed in this paper can make the slice weight be allocated in advance, thus greatly saving the delay. Moreover, with the development of AI, the research field of wireless communication will be impacted deeply. In the future, we will investigate the implementation of resource allocation with machine learning methods in vehicular network slicing.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

Acknowledgments

An earlier version of this article was presented at the 2020 IEEE/CIC International Conference on Communications in China. This work was supported in part by the National Natural Science Foundation of China (grant numbers 61801065, 61771082, 61871062, and 61901070) and the Science and Technology Research Program of Chongqing Municipal Education Commission (Grant No. KJQN202000603).