#### Abstract

This paper presents the model and algorithms for traffic flow data monitoring and optimal traffic light control based on wireless sensor networks. Given the scenario that sensor nodes are sparsely deployed along the segments between signalized intersections, an analytical model is built using continuum traffic equation and develops the method to estimate traffic parameter with the scattered sensor data. Based on the traffic data and principle of traffic congestion formation, we introduce the congestion factor which can be used to evaluate the real-time traffic congestion status along the segment and to predict the subcritical state of traffic jams. The result is expected to support the timing phase optimization of traffic light control for the purpose of avoiding traffic congestion before its formation. We simulate the traffic monitoring based on the *Mobile Century* dataset and analyze the performance of traffic light control on VISSIM platform when congestion factor is introduced into the signal timing optimization model. The simulation result shows that this method can improve the spatial-temporal resolution of traffic data monitoring and evaluate traffic congestion status with high precision. It is helpful to remarkably alleviate urban traffic congestion and decrease the average traffic delays and maximum queue length.

#### 1. Introduction

The traffic crowds seen in intersection of urban road networks are highly influential in both developed and developing nations worldwide [1]. Urban residents are suffering poor transport facilities, and meanwhile the considerable financial loss caused by traffic becomes a large and growing burden on the nation’s economy, including costs of productivity losses from traffic delays, traffic accidents, vehicular collisions associated with traffic jams, higher emission, environmental pollution, and more. The idea that the improvements to transport infrastructure are the efficient way has been central to transport economic analysis, but in fact this problem cannot be resolved with better roads [2–4]. Intelligent transportation systems (ITS) have been proven to be a scientific and efficient solution [5]. Comprehensive utilization of information technology, transportation engineering and behavioral sciences to reveal the principle of urban traffic, measuring the traffic flow in real time, and try to route vehicles around them to avoid traffic congestion before its formation promotes a prospective solution to resolve the urban traffic problem from the root [5–7].

Nowadays, in an information-rich era, the traditional traffic surveillance and control methods are confronted with great challenges [8, 9]. How to get meaningful information from large amounts of sensor data to support transportation applications becomes more and more significant [6, 10]. Modern traffic control and guidance systems are always networked in large scale which need real time, traffic data with higher spatial-temporal resolution that challenges the traditional traffic monitoring technologies such as inductive loop, video camera, microwave radar, infrared detector, UAV, satellite image, and GPS [11]. The state-of-the-art, intelligent, and networked sensors are emerging as a novel network paradigm of primary relevance, which provides an appealing alternative to traditional traffic surveillance approaches in near future [12], especially for proactively gathering monitoring information in urban environments under the grand prospective of cyber physical systems [13, 14]. Wireless sensors have many distinctive advantages such as low cost, small size, wireless communication, and distributed computation. Over the last decade, many researchers have endeavored to study traffic monitoring with novel technologies, and recent research shows that the tracking and identification of vehicles with wireless sensor networks for the purpose of traffic surveillance and control are widespread applications [15–19].

Traffic research currently still cannot fully express the intrinsic principle of traffic congestion formation and predict under which conditions traffic jam may suddenly occur. In the essentials, urban traffic is a typical self-driven many-particle huge system which is far from equilibrium state, where the traffic flow is a complicated nonlinear dynamic process, and the traffic congestion is the spatial-temporal conglomeration of traffic volume in finite time and space. In 2009, Flynn et al. have conducted some theoretical work to model traffic congestion with macroscope traffic flow theory and obtained some basic results in congestion prediction [20], which are regarded as a creative solution of traffic equations proposed in 1950s and reported as a great step towards answering the key question that is how can the occurrence of traffic congestion be avoided. Based on current research, the congestion status of traffic flow is expected to be evaluated in real time and higher precision to support traffic control.

Traffic light control at urban intersection can be modeled as a multiobjective optimization problem (MOP). In UTCS (Urban Traffic Control System) such as SCOOT/SCATS/REHODES system, it always employs single loop sensor or double loops as vehicle detector deployed at upstream of the signalized intersections. Generally, in current traffic control strategies, optimization objectives include stop of vehicle, average delay, travel time, queuing length, traffic volume, vehicle speed, and even exhaust emission [21]. The traditional traffic detection is *Eulerian sensing* which collects data at predefined locations [22], and the sensors cannot be deployed in large amount as compared to the huge scale of urban road networks for sake of budget restriction and maintenance cost; as a result the data such as vehicle stops and delays of individual’s vehicle is difficult to be achieved accurately. In the essentials, comparing to existing criteria mentioned above, the traffic congestion is a directly relevant factor and the root reason. Introducing a method to evaluate the degree of traffic congestion and proposing into the optimization model of traffic light control promote a feasible approach to improve traffic control performance.

In this paper, we studied the intrinsic space-time properties of actual traffic flow at the intersection and near segments and build an observation system to estimate and collect traffic parameters based on sparsely deployed wireless sensor networks. We are interested in understanding how to evaluate and express the degree of traffic congestion quantitatively and what the performance for traffic signal control would be if we take into account the traffic congestion factor as one of the objectives in timing optimization.

The rest of the paper is organized as follows. The current studies on traffic surveillance with wireless sensor networks are briefly reviewed in Section 2. The observation model based on traffic flow theory and traffic flow parameters estimation algorithm based on wireless sensor networks are described in detail in Section 3. The traffic congestion evaluation model and congestion factor based signal phrase optimization algorithms are discussed in Section 4. The performance is analyzed based on simulation and experimental results in Section 5. Finally, a conclusion and future works are given in Section 6.

#### 2. Related Works and Problem Statement

Several research works on traffic monitoring with wireless sensor networks have been carried out in recent years. Most of them have focused on individual vehicle and point data detection, where the traffic spatial-temporal property is not an issue in these circumstances. Pravin et al. creatively applied the magnetic sensor networks to vehicle detection and classification in Berkeley PATH program from 2006 and obtained high precision beyond 95% [12, 23]. In 2008, UC Berkeley launched a pilot traffic-monitoring system named *Mobile Century* (successor project is known as *Mobile Millennium*) to collect traffic data based on the GPS sensor equipped in cellular phones [22]. They found that 2–5% point data provided by mobile sensors is sufficient to provide information for traffic light control, and their conclusion motivates the research to collect traffic data and control traffic flow via sparsely deployed sensor networks in this paper. Hull et al. studied the travel time estimation with Wi-Fi equipped mobile sensor networks [24, 25]. Bacon et al. developed an effective data compress and collection method based on sensor networks using the weekly spatial-temporal pattern of traffic flow data in TIME project [26]. But in current research there are some important aspects out of consideration. Few considerations are given to the intrinsic space-time properties and operation regularity of actual traffic flow and traffic congestion formation. How to evaluate traffic congestion quantitatively with sufficient precision and real-time performance and be introduced as an objective to support control optimization in traffic light control? How to combine traffic surveillance sensor networks with traffic control system to analyze future traffic conditions under current timing strategies and try to avoid traffic congestion before its formation.

The discipline of transportation science has expanded significantly in recent decades, and particularly traffic flow theory plays a great role in intelligent transportation systems [27–29]. The typical models include LWR continuum model [30] and Payne-Whitham higher model [31]. From the physical view of traffic flow, the spatiotemporal behavior is the fundamental propriety in nature. In previous work, the vast majority of inductive techniques were focused on state-space methodology that forecasts short-term traffic flow based on historical data with relatively small number of measurement locations [32–34]. Limited amount of work has been performed using space-time model [35], and the resolution or precision is insufficient for the purpose of traffic light control. In 2008, Sugiyama et al. explained the formation process of traffic congestion by experimental observations [36], and based on this, Flynn et al. built a congestion model to explain and predict traffic congestion with macroscope traffic flow theory in 2009 [20], which is regarded as a solution of traffic flow equations and a great step towards answering the key question that how can the occurrence of traffic congestion be avoided.

The goal of this paper is to estimate traffic parameters based on sparsely deployed sensor networks, evaluate the degree of traffic congestion, and obtain a quantitative factor to express the spatiotemporal properties of traffic flow in real time. Based on this, introduce the congestion factor to the optimization model of traffic light control. In this paper we use *Lagrangian* detection [37]. Not only detect point data via imperfect binary proximity sensor network [38], but also try to estimate the time-space properties along the road segment based on scattered sensor measurements. The deployment of sensor networks is shown in Figure 1, where denotes traffic data such as velocity and density. Based on this, the congestion status and evaluation criteria can be studied from the comprehensive scope. The sensor network is expected to monitor real-time traffic data, to predict the subcritical state, and to control traffic signal to avoid the traffic jams before its formation.

The urban road network can be modeled as a directed graph consisting of vehicles and edges . Let be the length of edge . The spatial and temporal variables are road segment and time , respectively. On a given road segment and time , the traffic flow speed and density are distributed parameter system in time and space. While vehicle labeled by travels along the road segment with trajectory , the sensor measurements and are discrete and instant values, as shown in (2.1), and here is the sensor node number. The problem of traffic flow information monitoring can be transformed to estimate traffic parameters in given spatial and temporal variables with these discrete values (Nomenclature and symbols are available in Table 1):

#### 3. Traffic Monitoring and Data Estimation

In this section, we firstly describe the intrinsic characteristic of traffic flow and then propose a method to estimate traffic parameters based on scattered data collected by sparsely deployed sensor networks.

##### 3.1. Continuum Traffic Flow Theory and Theoretical Models

The continuum model is excellent to describe the macroscopic traffic properties such as traffic congestion state. In 1955, Lighthill and Whitham introduced the continuum model (LWR model) [30] based on fluid dynamics, which builds the continuous function between traffic density and speed to capture the characteristics such as traffic congestion formation. In 1971, Payne introduced dynamics equations based on the continuum model and proposed the second-order model (Payne-Whitham model) [31]. Consider the Payne-Whitham model defined by (3.1) (conservation of mass) and the acceleration equation, written in nonconservative form as (3.2): where and denote the space and time, respectively, and are the traffic flow speed and density at the point and time , respectively, is traffic density in unit of vehicles/length, is delay, and is traffic pressure which is inspired from gas dynamics and typically assumed to be a smooth increasing function of the density only, that is, . The parameter denotes the equilibrium speed that drivers try to adjust under a given traffic density , which is a decreasing function of the density with and . Here is the desired speed on empty road, and is the maximum traffic density at which vehicles are bumper-to-bumper in the traffic jam. In MIT model of self-sustained nonlinear traffic waves, the relationship between and denotes as the following. Here denotes free flow speed, and is the traffic flow density in congestion state:

In (3.1), the is flow production rate, and for road segment with no ramp , for entrance ramp , and for exit ramp . Assume the velocity of vehicle traveling from the given intersection during the green light interval is , and the intervals of green light phase are ; thus the flow production rate can be denoted as follows:

Based on the exact LWR solver developed by Berkeley [39], we can obtain the solutions of traffic equations with given initial parameters. That means the operation status and future parameters of the traffic flow can be predicted and analyzed on a system scale.

##### 3.2. Signal Processing for Traffic Data Estimation Based on Sensor Networks

In this paper, we employ high sensitive magnetic sensor, as shown in Figure 2(a), to detect vehicles. Given that the detection radius is , sensor node detects travelling vehicle with the ATDA algorithm developed by UC Berkeley [12], which detects vehicle presentence based on an adaptive threshold, and estimates vehicle velocity with the time difference of up/down thresholds and the lateral offset [12, 23], as shown in Figure 2(b).

**(a)**

**(b)**

Where is sensor separation, is the raw data, which will be sampled as sensor readings in discrete values and transformed to after noise filtering. is the threshold at detection interval , and is the corresponding detection flag. The instantaneous velocity can be estimated by (3.5). Here time and are the moments when magnetic disturbance signals exceed the threshold continuously with count and , respectively:

In actual applications, for sake of cost, the sensor node number is expected as few as possible [40], so there need a tradeoff between sensor number and measurement precision. In this paper we try to improve the traffic detection exactness based on the spatial and temporal relations of sampled data. The main idea is to estimate the lost traffic information based on the limited sensor readings with traffic flow model and numerical interpolation. Assuming the temporal-spatial scales are and , the vehicle trajectory and observation time are dispersed into and sections, respectively. Consequently the two-dimensional domain is transformed to a grid mesh, as shown in Figure 3, which can be denoted by (3.6) for an arbitrary location and detection time. Where is grid point and the and are spatiotemporal scales that can be denoted as and ,

For sensor reading in grid cell may be considered as a detection unit on location , and there is a single sensor node which takes effect in time interval . To take into account the disconnected vehicle queue under unsaturated state, here the sensed traffic flow speed is defined as the average velocity of all vehicles that pass the detection point in predefined interval. In actual applications, the traffic data is typically collected in 20 s, 30 s, 1 min, or 2 mins.

The sensor network is sparsely deployed, and the total number of sensor node is . We denote by the actual speed of the th vehicle travelling from the th sensor in the detection grid , is the estimated speed calculated from sensor measures, is the average speed in detection grid, and are the first and last vehicles in detection interval, respectively, and is the theoretical speed based on the continuous traffic flow model. The actual and estimated traffic flow speed can be denoted by the following equations:

Assume that we have trajectories of a certain number of vehicles in an observation interval. If the scale is small enough, it could be inferred that the traffic flow speed keeps unchanged in the unit gird. And consequently the partial differential equations (3.1)–(3.4) can be rewritten in an approximated way, such as (3.8). Here the subscripts and indicate space and time, respectively:

With the scattered measurements as boundary initial values, the traffic data can be estimated by numerical interpolation based on the approximated traffic equations, as shown in Figure 4. For instance of traffic flow speed detection, denote by and the estimated and actual velocities of th () vehicle on sensor (), respectively. The estimation error is , which can be formulated as

There are many evaluation criteria for error optimization; we use the same objective function as that in [41], which has the expression of (3.10) as follows. Here is the objective function, and is the mean square error (MSE) of traffic parameter estimation for all vehicles on sensor . And the purpose of optimal estimate algorithm is to minimize the total MSEs of all sensors:

Assume point data is obtained in detection area , and is the corresponding value given by traffic equations. The noise root-mean-square error between model outputs and measured data can be denoted as (3.11), which is a two-dimensional random field, and we assume it is unbiased:

The velocity change in real traffic flow is continuous. To eliminate noise, we introduce the smoothing factor with the minimum sum of squares of the second derivative, as shown in (3.12). Where denotes two-dimensional square detection area,

The traffic data estimation can be transformed to a two-dimensional data fitting problem with time-space constraints based on scattered measurements. To solve the conditional extremum problem based on (3.11) and (3.12), we can use the similar method in [42] based on Lagrange multiplier and finite elements method.

#### 4. Congestion Factor Based Signal Optimization

In this section, we focus on traffic congestion evaluation and signal optimization. Based on traffic flow theory, the traffic flow near signalized intersections and connecting links can be modeled as entrance and exit ramps. The traffic light control algorithm will generate a shock wave at the stop line of the lanes, from the beginning of red signal phase, which will affect the traffic state in future. We introduce congestion factor to evaluate the degree of traffic congestion, and cost function to represent the influence of current timing phase on next phase. The result is helpful to optimize signal control.

##### 4.1. Traffic Congestion Evaluation and Congestion Factor

The traffic congestion without external disturbance is an unsolved mystery. Knowing that traffic on a certain road is congested is actually not very helpful to traffic control system, and the information about how congested it is and the process it formed is more useful. There is much novel research about traffic congestion prediction and evaluation in last decades [43, 44]. Flynn et al. studied these phenomena and introduced the traffic congestion model named *Jamitons* [20], in which the traffic congestion is modeled as traveling wave. Based on the traffic model described in (3.1)-(3.2), the traffic congestion can be expressed and denoted in a theoretical way. Assuming the speed of traveling wave is , with introducing the self-similar variable defined by , the traffic equations in Section 3.1 can be rewritten, and (4.1) holds:
where is the speed of the traveling shock wave, and traffic flow density and speed can be denoted as function of , , . The subcritical state can be predicted by (4.1), where denotes the subcritical condition. To solve these equations, we select the shallow water equations [45] denoted as (4.2) to simplify the problem:

Applying this assumption to (4.1) and the LWR model denoted by (3.1) and (3.2), (4.1) can be rewritten as (4.3). Here is a constant denoting the mass flux of vehicles in the wave frame of reference:

The subcritical condition is therefore denoted as (4.4). If this equation is satisfied, the traffic congestion is inevitable to occur. The density will reach immediately when traffic conditions exceed the subcritical state:

The road can be regarded as share resource for vehicle and traffic flow link, and according to Jain’s fairness index for shared computer systems, the quantitative congestion factor can be defined based on the traffic congestion model, as (4.5). Here indicates the lane number, is the locations coordinate with origin starting from stop line, and the traffic density is sampled in discrete values with fixed frequency. The congestion factor indicates the general congestion state on the whole road segment, which is a number between 0 and 1, and larger value means more crowded:

Considering an intersection with four phases numbered , , *,* and , as shown in Figure 5, the phase timing can be denoted as (4.6). Here and represent the minimum and maximum green times, respectively, and is the effective green time of phase :

Under the scenario of traffic flow stops by red signal, for instance of lane during signal phase , the traffic flow from west to east will be blocked from the beginning of phase , and the interval is . The corresponding cost function on lane is denoted as (4.7). Here is timing adjustment step length, and and represent congestion factor on lane of traffic flow under blocking status by signal and normal condition with green light, respectively. The normal condition can be simulated based on (3.1) and (3.2) with initial values detected by sensor networks at time , where . And traffic parameters can be predicted by resolving the traffic equations:

With the Matlab implementation of an exact LWR solver [39], we can build a virtual simulator of traffic flow scheduling to analyze the traffic equations, congestion factor, and cost function in a theoretical way, based on given initial conditions. For traffic flow of a straight lane, consider two scenarios that traffic flow runs continuously and blocked by red signal at time , the congestion factor and cost function can be simulated. The result is shown in Figure 6.

**(a) Density factor**

**(b) Congestion factor**

##### 4.2. The Multiobjective Optimization Model for Signal Control

The problem of traffic timing optimization for an urban intersection in a crowded city has been previously approached in much research [46, 47], and the existing traffic signal optimization formulations usually do not take traffic flow models in consideration. The variables on a signalized intersection and connecting links of phase are shown in Figure 7. We define and to be the inflow and outflow, respectively, and define and to be the demand flow and exit flow during the phase in an interval , where is the timing adjustment step, and is a discrete index. Define and as the saturation flow for green and yellow times of phase at intersection . indicates the signal, and means green light and means red light. To simplify the problem we just optimize the phase timing, with assumption that phase order is kept unchanged, four phases, as shown in Figure 5, transfer in the presupposed order , , , and .

Based on the dynamics of traffic flow, the control objective of the dynamic model is to minimize the total delay and traffic congestion factor. To minimize,

With constraints subject to

For a given time window , based on constraints of (4.10), the timing problem can be separated into () subproblems. We can solve these problems and obtain noninferiority set of optimal solutions and then merge them to get a new noninferiority set of optimal solutions, which is the solution of the original problem. In this paper we use MOPSO-CD (*Multiobjective Particle Swarm Optimization Algorithm* using crowding distance) to find the optimal timing.

##### 4.3. Traffic Flow Detection and Control Algorithms

Based on the above model and computational method, the overall block diagram of traffic data detection and control algorithm is shown in Figure 8. It employs magnetic sensor and detects magnetic signature based on ATDA. The individual vehicle data is collected in time window , and traffic flow speed is monitored at regular intervals. The scattered point data , contains all sensor readings that will be used to approximate the traffic equation and numerical approximation obtained. Finally we can get the traffic data and , which is expected to provide data to traffic control and evaluate traffic congestion.

The traffic congestion state can be evaluated based on (3.9), and we can obtain the congestion factor in every segment near the intersection. At the same time, a cost function in next control phase can be calculated with a traffic scheduling simulator which is based on traffic equations and LWR solver. When we give priority to different possible directions and block traffic flow on other directions, the overall delay cost from alternative timing strategy will be taken into consideration before making the final signal, and the optimal timing can be obtained by solving a MOP. Finally, the traffic controller will choose the optimal timing scheme. This process operates in a circulation and in an adaptive way.

#### 5. Simulation Result and Performance Analysis

The model and algorithms are simulated based on VISSIM platform. The traffic flow data is generated with the Mobile Century field test dataset [22, 48] and LWR solver [39]. VISSIM is a microscope, time interval, and driving behavior based traffic simulation tool kit. It supports external signal control strategies by providing API with DLL. The simulation tool will invoke the *Calculate* interface with presupposed frequency. And user can obtain the signal control related data in this interface.

With the DLL and COM interfaces, we designed a software/hardware in the loop simulation platform based on VISSIM, as shown in Figure 9.

The traffic data for simulation is based on Mobile Century dataset. Traffic data near three intersections is used to simulate traffic data collection and timing phase optimization. The traffic network is shown in Figure 10.

We select a fixed coordinate without sensor and try to estimate traffic parameters with the method proposed in this paper based on proximity sensor readings. The estimation precision under different smooth factor is shown in Figure 11. The performance is better when compared to traffic prediction based on BP neural network.

In the control simulation, we analyzed the performance by two scenarios: control with delay constraint only and combining delay with traffic congestion factor together as the optimization objective, and compare the performance with fixed time control. On the same traffic flow dataset, the performance is illustrated in Figure 12. The criteria include average delay and the maximum queue length. The result shows that congestion factor based control optimization can increase the performance with lower average waiting time and shorter queue length.

**(a) Average delay**

**(b) The maximum queue length**

#### 6. Conclusion and Future Research

In this paper we study the traffic flow congestion evaluation and congestion factor based control method using sparsely deployed wireless sensor network. Taking into consideration the traffic flow intrinsic properties and traffic congestion model, try to obtain optimal phase timing with as few sensor node as possible. The main idea is to study the congestion and its influence on future traffic flow, combine traffic equations with the optimization function, to obtain the numerical solution of the traffic equations via approximate method, and finally to refine traffic sensor data based on data fitting. The model and algorithms are simulated based on VISSIM platform and *Mobile Century* dataset. The result shows better performance, and it is helpful to decrease average delay and the maximum queue length at the intersection.

Current research is limited to single intersection and simple segments with continuous traffic flow. Future research should focus on complex segments and even road network, such as ramp, long road with multi-intersections. And the traffic control strategy, road capability, and dynamics caused by incidents need to be taken into consideration in actual applications. Furthermore, complex traffic flow pattern simulation and traffic control strategies on a networked scale among multi-intersections and arbitrary connecting segments in road network are also an important aspect in next step.

#### Acknowledgments

This work was supported in part by the National High Technology Research and Development 863 Program of China under Grant no. 2012AA111902, the National Key Technology R&D Program of China under Grant no. 2011BAK02B02, the National Natural Science Foundation of China under Grant no. 60873256, and the Fundamental Research Funds for the Central Universities under Grant no. DUT12JS01.