#### Abstract

Nowadays, the problem of road traffic safety cannot be ignored. Almost all major cities have problems such as poor traffic environment and low road efficiency. Large-scale and long-term traffic congestion occurs almost every day. Transportation has developed rapidly, and more and more advanced means of transportation have emerged. However, automobile is one of the main means of transportation for people to travel. In the world, there are serious traffic jams in almost all cities. The excessive traffic flow every day leads to the paralysis of the urban transportation system, which brings great inconvenience and impact to people’s travel. Various countries have also actively taken corresponding measures, i.e., traffic diversion, number restriction, or expanding the scale of the road network, but these measures can bring little effect. Traditional intelligent traffic flow forecasting has some problems, such as low accuracy and delay. Aiming at this problem, this paper uses the model of the combination of Internet of Things and big data to apply and analyze its social benefits in intelligent traffic flow forecasting and analyzes its three-tier network architecture model, namely, perception layer, network layer, and application layer. Research and analyze the mode of combining cloud computing and edge computing. From the multiperspective linear discriminant analysis algorithm of the combination method of combining the same points and differences between data and data into multiple atomic services, intelligent traffic flow prediction based on the combination of Internet of Things and big data is performed. Through the monitoring and extraction of relevant traffic flow data, data analysis, processing and storage, and visual display, improve the accuracy and effectiveness and make it easier to improve the prediction accuracy of overall traffic flow. The traffic flow prediction of the system of Internet of Things and big data is given through the case experiment. The method proposed in this paper can be applied in intelligent transportation services and can predict the stability of transportation and traffic flow in real time so as to optimize traffic congestion, reduce manual intervention, and achieve the goal of intelligent traffic management.

#### 1. Introduction

At present, almost all major cities have problems such as poor traffic environment [1] and low road efficiency. Large area and long-term traffic congestion occurs almost every day. The extremely congested traffic every day leads to the increase of people’s travel time and transportation costs and a great waste of time, energy, and money and increases the pressure of urban life [2]. In the development of urbanization [3], traffic congestion [4] is one of the main urbanization problems that need to be solved. The application of Internet of Things technology [5,6] and big data technology [7] in traditional industries has produced certain economic benefits. In the application of smart transportation and the construction of smart cities [8], traffic data can be quickly collected by using Internet of Things technology, and traffic flow prediction can be realized by using big data technology. Sensor technology [9], as an important data acquisition source in the Internet of Things, can realize fast and stable data transmission and fast storage and analysis [10]. Intelligent transportation can rely not only on the algorithm model [11], but also on the historical data of road traffic. Big data technology is used to process large-scale data and high-dimensional data [12] and capture the data in real time, quickly and effectively. Big data has four characteristics: magnanimity, diversity, efficiency, and variability. The combination of these four characteristics provides basic support for traffic flow data. Establish the corresponding intelligent traffic flow data analysis system based on big data analysis, extract the historical data of road traffic from the specific traffic system [13], analyze and process these data to form structured traffic flow data, and then store the analyzed structural data in the corresponding target server database [14]. Finally, real-time, effective, and accurate traffic flow data are obtained through Internet of Things technology. This system can quickly analyze the changes of various types of data, accurately predict these changes, and screen the information accordingly. In the intelligent traffic flow prediction model based on intelligent data analysis and application, intelligent technology integration is realized through Internet of Things technology and sensor application [15], such as fiber Bragg grating sensor [16], positioning technology [17], RFID application [18], and wireless sensor technology [19]. How to use big data technology and Internet of Things technology in the field of intelligent transportation has become an important technical means of transportation application in the future. Through intelligent transportation technology, traffic accidents can be handled immediately, cost can be saved, and the system can respond quickly and be handled in time. Under the Internet of Things technology, wireless network information has a strong antiinterference and high positioning accuracy. Therefore, the application of big data and Internet of Things technology in traffic flow forecasting can alleviate urban traffic pressure and is one of the important indicators in realizing smart cities. Therefore, Internet of Things and big data can be widely used in modern cities, and the development of traditional industries can also be realized.

#### 2. Application of Internet of Things and Big Data in Intelligent Traffic Flow Prediction

##### 2.1. Traffic Flow Characteristics

The transportation system is huge and complex, and there are many factors that can cause the poor operation state of traffic flow, such as the changeable weather and the occurrence of accidents. Small changes in the external environment can have a great impact on it, and the performance state of each face is complex, random, and irregular, with obvious dynamic characteristics. Therefore, it becomes very difficult to accurately predict. Hence, it is necessary to comprehensively consider the prediction and analysis of traffic flow, generally from spatial and traffic timeliness. The spatial characteristics of traffic flow means that when observing traffic flow, the parameters of traffic flow at the observation site are inseparable and the traffic state at the observation site is generally directly affected by the overall traffic flow. When the distance of the predicted road section increases, the influence degree decreases; when the distance of the predicted road section decreases, the degree of impact becomes greater.

According to the traffic flow in the past, the change has periodicity, which exists every day, every week, every quarter, and even every year. Looking at the traffic flow every day, it is not in a balanced state as a whole, but obviously has great fluctuations, such as morning peak and evening peak. During these two peak periods, the traffic flow and road congestion are extremely serious compared with the normal time, which is unmatched by the traffic flow in general time. Similarly, if one year is used to analyze the periodicity of traffic flow, the periodicity of traffic flow in each quarter is also very different. Therefore, the traffic flow will roughly show periodic changes due to external factors. Therefore, it is precisely because the traffic flow shows periodic characteristics that the prediction of traffic flow becomes feasible.

The randomness in the time characteristics of traffic flow is mainly reflected in the short-term traffic flow. In the whole transportation system, people mainly manipulate vehicles, whether drivers driving or pedestrians walking on the road, which will affect the whole transportation system. People's behavior cannot be predicted. Sometimes just because of a small action to run the red light, it may cause an irreparable situation; the diversity of vehicle types will also lead to the change of traffic flow. These are unpredictable events, which makes the randomness of traffic flow stronger. Randomness is also a factor that must be considered in real traffic prediction, which increases the difficulty of traffic flow prediction.

Jiao tong traffic flow prediction will predict the traffic flow data of the road section in the past period, process and analyze the real-time dynamic data collected by wireless sensors and other devices, and use the characteristics of the data to predict the traffic flow of the current road section in the future, as shown in Figure 1.

To predict the circulation flow in the future, it must be based on historical data. Firstly, historical traffic data are collected for data preprocessing, the system starts training the data, calculates the prediction model through real-time data, obtains the prediction data, and finally converts the analysis results into charts and other visual results to respond to users.

###### 2.1.1. Traffic Flow Prediction Method

Now, there are generally three methods to realize traffic flow prediction, namely, parameter prediction method, nonparametric prediction method, and combined parameter prediction method, which are shown in Figure 2.

Traffic flow forecasting methods are divided into parametric methods, nonparametric methods, and combined forecasting methods. Among them, parametric methods can be divided into linear and nonlinear styles. The linear method can be subdivided into time series method, historical average algorithm, smoothing algorithm, and filtering algorithm; nonlinear methods can be divided into wavelet analysis, catastrophe theory, and chaos theory; nonparametric methods include support vector machine, nonparametric regression, neural network, and fuzzy logic; combined forecasting method is the combination of two or more forecasting methods to achieve the effect of joint forecasting. It adopts the clustering combination model, decomposition combination model, and prediction result fusion model technology.

##### 2.2. Intelligent Transportation Technology in Internet of Things and Big Data

The application of Internet of Things technology in intelligent transportation has become the main way to relieve traffic pressure. Integrating Internet of Things technology into intelligent traffic flow prediction can make traffic roads smarter and make cities develop faster and more harmoniously. The key technologies are shown in Figure 3.

In Figure 3, intelligent transportation technology in Internet of Things mainly includes data collection, data analysis, data processing, data storage, and data sharing.

The problem to be solved in the road traffic flow data analysis service environment is how to timely analyze the diverse characteristics of traffic flow data, so as to provide accurate services. It should have the ability to enhance service value, mine data value, and reduce response time.

After big data analysis, it will analyze, process, and calculate the object data according to relevant algorithms and provide the service. The basic process of the big data analysis service mode is shown in Figure 4 (online mode) and Figure 5 (offline mode).

This mode has high real-time requirements and slightly small data throughput. The collected data will not be stored, and the results will be calculated directly in memory. Finally, the analysis results will be directly transformed into visual results to respond to the user. For the offline big data service mode, the biggest difference from the online mode is that it will store the collected historical data and real-time data in the disk. When the data needs to be processed, it will load, process, and calculate in the memory, and finally convert the analysis results to the user. This offline service mode is characterized by low real-time requirements and deep mining of data.

##### 2.3. Multiperspective-Based Learning Method

For the same thing, if multiple people describe it, then each person’s description methods will be different. Hence, describing the same data from different angles or ways is the method of multiperspective data learning. Data have multiple feature sets. The data obtained from multiple feature sets have different attributes, and these attributes are revealed from different levels and different perspectives. Such data are called multiperspective data. In multiview data learning, there are various analysis technologies, among which the canonical correlation analysis (CCA) and collaborative training technology are the most representative.

CCA is a multivariate statistical method of the cross-covariance matrix, which reflects the overall correlation through the correlation between various variables. *U*_{1} and *V*_{1} represent two sets of variables and are linearly combined to reveal the dependence of the variables on the whole. According to relevant research, canonical correlation analysis acts on the data represented by two or more perspectives and finds two linear transformations for each perspective. Assuming that there are data sets and , the formula is as follows:

and are the two projection directions to maximize the linear correlation coefficient, and is defined as follows:

and are the average of the two perspectives, defined as follows:

That is, the objective function of typical correlation analysis technology can be obtained:

The corresponding Lagrange function is as follows:

If for and , take the derivative respectively and make it 0, then the formulas are as follows:

Using these two formulas, we can get

, If , is reversible. Then you can get :

Similarly, the generalized eigenvalue decomposition problem can be obtained as follows:

Finally, in order to make , if the relationship between the eigenvalue and the correlation coefficient is clearer, the rewriting objective function is defined as follows:

You can get

is a positive correlation solution which affects the correlation of the projection; therefore, the range of must vary between [−1, +1].

Finally, according to the calculation results, the biggest feature of using the canonical correlation analysis is that there will be an overfitting of relevant features, and the normalized useful mode will be used, i.e., the function is defined as follows:where and are in the change of [0, 1], and the latest statistical analysis believes that regularization is a reasonable standard method to control the projection direction, which will maximize the correlation between the two perspectives and minimize the difference in square loss in each perspective.

##### 2.4. Prediction of Traffic Flow

The so-called traffic flow prediction, through the use of information technology on historical data, finds the rules and predictions of traffic operation, thus effectively reducing communication, congestion, and accidents.

In this paper, the Markov model is used to effectively study and analyze the historical data of AC traffic.

In the Gaussian–Markov model, vectors show normal characteristics.

Mathematical expectations are as follows:

The variance covariance matrix is as follows:

By calculating the expected value of the random vector *Y* for formula (1):

The variance covariance matrix is as follows:

In formula (17), is of unbiased estimation.

In formula (18), is the negative correlation coefficient.

K-order multivariate linear regression is defined as follows:

The regression model is defined as follows:

The least squares method is defined as follows:

#### 3. Optimization and Improvement Algorithm

##### 3.1. Kalman Filtering Algorithm

The Kalman algorithm is realized by using a recursive method to deduce the previous state and the current state of the data. According to the existing prediction model, the Kalman algorithm shows the state and observation equation:

is dynamic noise, is the observation noise sequence, and its statistical vector is defined as follows:where is the variance matrix of dynamic noise, is the variance matrix of observation noise, and is the Kronecker function.

is the set initial value, and the relation between and denotes independence.

If set target initial state , is OK, then get Tminimum square difference.

Calculated gain is as follows:

##### 3.2. BP Neural Network Algorithm

###### 3.2.1. BP Neural Network Topology

The BP model passes through a multilayer feedforward neural network. The BP network has multilayer hidden units. Most neural networks adopt the change form of the BP network. It is different from perceptron in the network structure. Signal forward transmission is the main feature of the BP network, which is shown in Figure 6.

The realization steps of the BP network can be obtained from the topological structure diagram of the BP three-layer neural network. First, initialize the network, provide training samples, calculate the output layer by layer from the input layer, such as the hidden layer and the output layer, carry out the vehicle output, and then, correct the weight to judge whether the prediction error reaches the error accuracy.

##### 3.3. Kalman-BP Neural Network Algorithm

The neural network prediction method can play a great role in intelligent traffic flow prediction, and it is one of the most important methods. Similar to the neural network prediction method, our method combines the Kalman algorithm and the BP algorithm and gives full play to the advantages of the two algorithms. It mainly introduces the filtering idea into the BP network, and the performance is better after the algorithm is optimized. After the above analysis, we can get the basic flow of neural network prediction as shown in Figure 7.

The prediction process of the neural network can be divided into data preprocessing, network construction, network training, and network prediction. Data preprocessing refers to the input and normalization of data (training data and test data).

#### 4. Experimental Simulation

The experimental data will collect the traffic volume observation data of a certain section of the highway in one year, one week, and one day respectively. The observation scale is one hour traffic flow, a total of 336 total data, 312 training data and 24 test data were obtained. Combining these field collected traffic flow observation data, the experiment will carry out data preprocessing, network construction, network training, and network prediction, and analyze the role of the Kalman filter algorithm in intelligent traffic flow prediction and analysis system from these four aspects.

##### 4.1. Data Analysis

The experiment will start from one month. The changes of traffic flow from three angles of one day and one hour are analyzed. The first is the analysis of the monthly traffic flow change, as shown in Figure 8.

In Figure 8, the traffic flow increased rapidly from January to February, reaching 64000 vehicles in February. From February to August, the growth was relatively stable. It showed a downward trend from August to September and from October to November. The traffic flow from September to October and from November to December is large, and the traffic flow increases rapidly.

Next is the analysis of daily traffic flow change, as shown in Figure 9.

It can be seen from the change in Figure 10 that the traffic flow increases slowly from Monday to Thursday, the traffic flow fluctuates greatly from Thursday to Sunday, the traffic flow on Friday is the least, and the traffic flow on Saturday and Sunday increases sharply.

The analysis of the hourly traffic flow change is shown in Figure 10.

In Figure 10, the hourly traffic flow in a day is less in early morning, gradually increases at 6:00, and reaches the peak from 16:00 to 19:00 with the fastest growth. It means that there is a peak period from 16:00 to 19:00 every day.

Using the traffic flow data of the above three periods as the historical reference data, traffic volume changes periodically in days. The volume first increases gradually, reaches the peak at a certain time, then decreases gradually, and oscillates before reaching the peak. In terms of peak traffic volume, it gradually increases from Monday, increases steadily and slowly on Tuesday, Wednesday, and Thursday, increases sharply on Friday and Saturday, and decreases sharply on Sunday; moreover, the total traffic volume on Friday and Saturday is also larger than usual.

##### 4.2. Comparison of Prediction Error Value

To verify the efficiency of the proposed method, this paper compares the errors between the predicted values and the measured values of the Kalman algorithm, BP algorithm, and Kalman BP algorithm, as shown in Figure 11.

The error value of monthly traffic flow is shown in Figure 12.

Under the monthly traffic flow prediction, the Kalman BP algorithm compared with the BP algorithm and the Kalman algorithm, the Kalman BP algorithm has the lowest error value.

##### 4.3. Algorithm Operation Speed

The operation speed of the algorithm is very important. It is the basis for the algorithm. Its purity means that the system can run correctly and stably, greatly reducing the number of system maintenance, and judging the quality of a system depends on the stability of the algorithm.

Finally, we also compared the operation speed of traffic flow prediction of the three algorithms in three time periods, calculated in milliseconds (MS), as shown in Figures 13–15.

#### 5. Conclusion

In the field of intelligent transportation, it is predicted that the current processing research stage will realize the development of intelligent transportation in smart cities. The Kalman-BP combined algorithm proposed in this paper can improve the prediction accuracy of the overall traffic flow and lower the error value. It is pointed out in this paper that there are difficulties in predictive multivariate data fusion, and higher performance optimization algorithms are needed to solve such problems. Future research works, aiming at real-time data for effective prediction, can improve the accuracy of real-time traffic prediction and lower error rate. It also considers multitask and multiplatform data real-time prediction technology supported by Internet of Things, which can be better applied to urban traffic and intelligent optimization of traffic flow.

#### Data Availability

The experimental data used to support the findings of this study are available from the corresponding author upon request.

#### Conflicts of Interest

The authors declare that they have no conflicts of interest.

#### Acknowledgments

This research was supported by the Key Projects of Natural Science Research in Colleges and Universities of Anhui Province, China (KJ2018A0749 and KJ2020A0971) and the Projects of the Professional Construction Fund of Tongling Polytechnic (06202003).