Abstract

Floating car data are beneficial in estimating traffic conditions in wide areas and are playing an increasing role in traffic surveillance. However, widespread application is limited by low-sample frequency which makes it hard to get a complete picture of a vehicle’s motion. An accurate and reliable reconstruction of a vehicle’s trajectory could effectively result in a higher sampling frequency enabling a more accurate estimation of road traffic parameters. Existing methods require additional information such as nearby vehicles, signal timing strategies, and queue patterns which are not always available. To address this problem, this paper presents a method used with low-sample frequency data to reconstruct vehicle trajectories through intersections, without the need for extra information. Furthermore, the additional parameters for the speed-time curve distributions for deceleration rate and acceleration rate are generated. A piecewise deceleration and acceleration model is developed to calculate the acceleration rate for different travel modes in the trajectory. The distribution parameters of the acceleration data for each travel mode are then estimated using a new Expectation Maximization (EM) algorithm. The acceleration statistics are then used to reconstruct the corresponding parts of the trajectory. Compared to the reference trajectories (truth), the test results show that the method developed in this paper achieves improvement in accuracy ranging from 16 to 67% over the commonly used linear interpolation method. In addition, the proposed method is not very sensitive to the sampling interval of the floating car data, unlike the linear interpolation method where the error grows rapidly with increasing sampling interval.

1. Introduction

The potential for the contribution of floating car data to traffic state monitoring has attracted significant research in the past decade [13]. However, constrained by the penetration rate of equipped vehicles and cost of data storage and transmission, the sampling interval of floating car data can be low, restricting their application. Liu et al. evaluated the sensitivity of delay measurement to sampling frequency of floating car data, showing that delays estimated from data sampled at a 10-second interval are consistent with those from a 5-second interval for 74% of the cases. However, when the sampling interval is 60 seconds, the consistency drops to 37% [4]. Patire et al. investigated the question of how much GPS data are needed for a traffic information system capable of providing accurate speed (and thus travel time) information. They point out that this question must address issues of data quality in terms of sample rate and penetration rate [5]. Bucknell et al. investigated the relationship between penetration rate and sampling rate, concluding that, in general, increasing sample frequency is more beneficial when the current penetration rate is low [6]. High-resolution trajectory data are ideal for traffic state monitoring. However, the majority of trajectory data (especially those for commercial use) today are collected at a relatively low sampling frequency. Hence, it is necessary to develop methods to reconstruct vehicle trajectories from low-frequency floating car data. Researchers have conducted various studies on vehicle trajectory reconstruction, classified into two categories. One category aims at eliminating the trajectory noise caused by GPS positioning error, while the other attempts are to reconstruct the vehicle’s trajectory from low-frequency floating car data.

For trajectory error correction, Huang et al. propose an approach for predicting vehicle trajectory by using information from nearby vehicles to construct the local driving environment [7]. Fard et al. present a simple two-step method based on wavelet analysis for filtering errors and reconstructing trajectories. Wavelet transform is employed to identify and modify the outliers [8]. Marcello et al. develop an approach with a multistep procedure based on the information of traffic kinematics and vehicle dynamics [9]. Since high-frequency data cannot be collected at a large scale, it is impossible to provide high-accuracy traffic monitoring of the road network.

To reconstruct the trajectory between spares probes, Sun et al. apply the Variation Formulation (VF) method to obtain the complete picture of traffic flow using sample vehicle trajectories. However, to properly apply the VF method, the shockwave boundaries have to be known, which is estimated from the fixed sensors data [10]. Wan et al. propose Expectation Maximization (EM) and Maximum Likelihood (ML) methods to reconstruct trajectories between two consecutive transit bus GPS updates. The path is divided into short segments and the segment travel time is allocated iteratively with the EM method. The maximum likelihood trajectory is generated based on travel time statistics [11]. In the method, it is assumed that the traffic signal timing is known. Hao et al. propose a model investigating all possible driving mode sequences between data points. Detailed trajectories are reconstructed based on the optimal driving mode sequences [12]. Hao et al. develop a modal activity-based stochastic model for reconstructing the trajectories from sparse data [13]. However, in their approach, the distribution parameters of four types of modal activity are estimated with the second-by-second trajectory in the NGSIM Lankershim dataset. The NGSIM data consist of detailed vehicle trajectory and are collected using digital video camera. The precise location of each vehicle on a 0.5 to 1.0 kilometer section of roadway is recorded every one-tenth of a second. The applicability of this method to different locations and traffic conditions is still to be proven. Wang et al. proposed an approach for trajectory reconstruction with sparse probe data for estimating control delay. The method describes the deceleration and acceleration as a piecewise constant deceleration process and acceleration process, respectively. This method has the limitation of poor stability [14].

The problem for trajectory reconstruction is that only a few discrete sampling points are available. Furthermore, the correlation between sampling points is weak as a result of low-frequency sampling. Therefore, in the current methods, additional information is required for reconstruction. For example, Huang et al. exploit information from proximate vehicles. Wan et al. use signal timing strategies and queue patterns to assist in the trajectory reconstruction. In the approach proposed by Hao et al., the parameters for deceleration and acceleration are estimated from second-by-second vehicle trajectory data. However, in reality, the extra data are not always available. Hence, this paper proposes a trajectory reconstruction method for signalized intersections with low-frequency floating car data.

In this paper, it is assumed that the vehicle driving mode through an intersection evolves in a given pattern, that is, cruise ) – deceleration ) – idle ) – acceleration ) – cruise ). More precisely, a deceleration (acceleration) rate-based piecewise process is used to model the driving mode. Based on this assumption (i.e., the 5 driving modes), a model is developed to calculate deceleration and acceleration rate from selected historical data. An Expectation Maximization (EM) algorithm is specified to estimate the corresponding distribution parameters. For given low-frequency sample points, when the distribution parameters of deceleration and acceleration are available, the time period and distance for each of the five modes are calculated by solving a Constrained Quadratic Programming (CQP) problem.

In this paper, unlike the current methods, only low-frequency floating car data are used, with no additional or extra information. Furthermore, in addition to position-time curve, speed-time curve, and distribution parameters of acceleration (deceleration) rate are obtained. By reconstructing the trajectory of the low-frequency trajectory, more accurate and valuable information could be extracted from trajectories, enabling floating car data to play a more important role in traffic state monitoring.

The paper is structured as follows. The next section presents the methodology followed by the results and their analysis and conclusions.

2. Methodology

Different from the current methods, the method developed in this paper reconstructs the trajectory in two parts, deceleration process and acceleration process (i.e., not the trajectory between two consecutive updates). Traffic signal timing and historical queue length data are not needed. The main factors that may influence trajectory reconstruction include traffic conditions, number of sample points, and a vehicle’s stop position relative to the intersection. To better reconstruct the low-frequency trajectory, the numbers of sample points and stop positions are investigated in the first instance. It should be noted that in our statistics even if there are many sample points with zero speed, the number of points is recorded as 1. For an arterial street with signalized control, we generate the number of sample points and stop positions of floating cars for different periods of the day over a timespan of three weeks. The results are shown in Figure 1.

As shown in Figures 1(a) and 1(b), whether in the peak hour (7:00-9:00 am) or off-peak hour (10:00-12:00 am), the distribution is skewed to the left. This means that, for the floating car through the intersection, there are more cases generating 3, 4, and 5 sample points than 6, 7, and 8 points, with the most likely being 4. Figures 1(c) and 1(d) show the vehicle stop position histogram, exhibiting a bimodal distribution. There is no obvious difference between the peak hour and off-peak hour. However, in both figures, the vehicles rarely stopped between 100 and 300m. The data used for the analysis here were collected by taxis. Hence, the gap in the stop position histograms is attributed to the traffic volume. When the traffic volume was low, the taxi drivers preferred to stop close to the intersection by changing lane. However, when the traffic volume was high, all the lanes were mostly occupied with other vehicles, so that the taxis had to queue. Hence, in most cases, the taxis stopped relatively far from the intersection. This led to the vehicle stopping either close to or far from the intersection. From the histogram, it could be inferred that the maximum queue length of the intersection was about 400m.

A vehicle’s motion is disrupted at the point where the speed is zero. Hence, the trajectory reconstruction is divided into two processes: deceleration and acceleration.

The schematic of this research is shown in Figure 2, for reconstructing the maximum likelihood trajectory from all possible trajectory sets with sparse sample points A, B, and C. Figure 3 presents the reconstruction steps, with the main elements of estimation of the distribution parameters for different modes and maximum likelihood trajectory reconstruction.

2.1. Acceleration Rate Estimation for Different Modes

The model for calculating the acceleration and deceleration rates with the low-frequency floating car data originates from previous work by the authors on the estimation of vehicle control delay [13]. The model is developed on the basis of a piecewise constant deceleration model and a Simple Platoon Advancement (SPA) model. As illustrated in Figure 4, deceleration and acceleration are both divided into three parts. For the deceleration process, the first stage represents a vehicle travelling at the free-flow speed. The second and third stages represent vehicle deceleration rates and in m/s2. Similarly, for the acceleration process, the first and second stages are represented by acceleration rates and . The third stage represents a vehicle travelling at the free-flow speed. Hence, the vehicle driving pattern through the intersection is as follows: cruise ) – deceleration1 ) – deceleration2 ) – idle ) – acceleration1 ) – acceleration2 ) – cruise ).

Based on the model and driving pattern for a low-frequency trajectory, deceleration rates a2 for mode2 (deceleration1) and a3 for mode3 (deceleration2) and acceleration rates a5 for mode5 (acceleration1) and a6 for mode6 (acceleration2) can be calculated by solving a constrained nonlinear programming problem as follows.

For a typical low-frequency trajectory of the probe vehicle shown in Figure 5, the vehicle travels at the free-flow speed at Point 1 and then decelerates between the point in the dotted box and Point 2. At Point 3, the vehicle has stopped and is waiting for the traffic light to turn green. From Point 4 to Point 5, the vehicle accelerates to reach the free-flow speed. Points 1-5 cover the entire motion process of the vehicle through the intersection. The deceleration onset and acceleration end points correspond to the points in the dotted boxes.

The deceleration rates and are calculated with the developed model.

Let be the time interval between and and let be the time interval between and . The time and distance between Point 1 and Point 2 are expressed by the following equations:where T is the sampling interval of the probe vehicle data and is also the time travelled between Point 1 and Point 2. and are the instantaneous velocities at Point 1 and Point 2, respectively. In this paper, the sample interval is 30 seconds. is the distance between Point 1 and Point 2; . In (1) and (2), there are three unknown parameters. The only solution cannot be obtained from this nonlinear equation group. Therefore, to obtain and , expressions 1 and 2 are converted into a nonlinear programming problem with the objective function as follows:subject toBy solving the nonlinear programming problem, the values of and could be calculated. For the acceleration model, the acceleration rates of and are calculated as follows:subject towhere v4 and v5 are the instantaneous speeds at Point 4 and Point 5, respectively, =(v4+v5)/2, and L45 is the distance between Point 4 and Point 5.

With a2, a3, a5, and a6, the trajectory can be reconstructed. However, the model only works for the scenarios where there is a sample point before deceleration and in deceleration. For the acceleration process, the requirement is that there is a sample point in acceleration and after acceleration. Nevertheless, probe vehicle updates normally occur at random positions and times, so there may be no sample point in or before deceleration in a trajectory. To address these issues, we propose an approach to reconstruct the maximum likelihood trajectory to alleviate the effect caused by the randomness of sampling. The essential parameters of our method are distribution parameters of deceleration rate and acceleration rate for different modes.

2.2. Distribution Parameters Estimation of Acceleration Rate for Different Modes

In this section, the valid trajectories are selected from the historical trajectory data. A valid trajectory has a sample point before deceleration and in deceleration and in acceleration and after acceleration. By modeling and solving the nonlinear programming problem (see (4) and (6)), the acceleration rate for modes 2, 3, 5, and 6 for each valid trajectory is obtained. To avoid errors by using these directly to calculate the distribution parameters of acceleration rate for different modes, we adopt an Expectation Maximization method to iteratively calculate the acceleration rate for each mode. It is assumed that the deceleration rate and acceleration rate are independent from each other. For the idle and cruise modes, the mode time mainly depends on the signal plan, traffic condition, and intersection placement. In this paper, it is assumed that the idle and cruise mode travel times t are uniformly distributed. where i = 1, 4, 7 correspond to cruise , idle , and cruise modes.

For the deceleration and acceleration modes, it is assumed that the acceleration rate follows a Gaussian distribution. The travel time t that a vehicle spends under the deceleration and acceleration modes is the product of speed variation divided by corresponding time period:where j = 2,3,5,6 correspond to deceleration1 (mode2), deceleration2 (mode3), acceleration1 (mode5), and acceleration2 (mode6). Each valid trajectory provides an observation of deceleration rate and acceleration rate for different modes. Assume that there are M floating cars passing through the intersection. Suppose that k⩽M is the trajectory sequence number. The accelerate rate for each mode is calculated.

E Step. For mode j (j=2, 3, 5, 6), we calculate the mean and standard deviation of the corresponding acceleration. Under the assumption that the acceleration rate follows a Gaussian distribution, the probability density function for is determined aswhere is the acceleration rate for mode i of kth trajectory,  j=2,3,5,6. For ease of calculation, the log-likelihood function of is used.

M Step. In this step, for each valid trajectory, deceleration rates and acceleration rates for different modes are recalculated such that the likelihood function is maximized. For the deceleration process of each trajectory, its likelihood function can be represented assubject to the equalitywhere is travel time of mode1 for the kth trajectory, and are the speeds at the first sample point and second sample point for kth trajectory, respectively, , , , and is the distance between the first sample point and the second sample point. In the M step, and are considered to be constants, and is fixed. Therefore, the terms containing them can be ignored when maximizing the likelihood function. This is a Constrained Quadratic Programming (CQP) problem. The solution of this CQP problem is the travel time of mode1 and the distribution parameters of acceleration rate for mode 2 and mode 3.

In a similar way, the likelihood function for the acceleration process is formulated as subject to the equalitywhere is the travel time of mode7 for the kth trajectory, and are the speeds at the fourth and the fifth sample points for the kth trajectory, , , , and is the distance between the fourth and the fifth sample points. It should be noted that the M step is run separately for each trajectory. When the M step ends, the method goes back to the E step to update the mean and variance based on all trajectories. When the difference between consecutive iterations is below a threshold, the iteration process stops.

2.3. Maximum Likelihood Trajectory Estimation

With the distribution parameters of acceleration for each mode being known, the maximum likelihood trajectory of a given low-frequency trajectory is estimated. As a result of low-frequency sampling, the sample points may be located at any position of the intersection. Besides, the number of sample points in a trajectory is uncertain. To better capture the difference, we define scenarios for the deceleration stage of a low-frequency trajectory each with different numbers and reconstruct the maximum likelihood trajectory for each.

Scenario 1. 1 sample point before deceleration, none in the deceleration stage.

Scenario 2. 1 sample point before deceleration, 1 in deceleration.

Scenario 3. Multiple sample points before deceleration, none in the deceleration stage.

The scenario definitions for the acceleration stage are similar to those for deceleration.

Scenario 1. 1 sample point after acceleration, none in the acceleration stage.

Scenario 2. 1 sample point after acceleration, 1 in acceleration.

Scenario 3. Multiple sample points after acceleration.

Here the sample points refer to the points where the speed is above zero. Different trajectories contain different numbers of sample points and different combinations of scenarios. The trajectory reconstruction is divided into two parts: deceleration process and acceleration process. The reconstruction method is the same for both processes; hence, only the deceleration process reconstruction with different sample points is illustrated here.

(1) Scenario 1 for Deceleration Trajectory. It is assumed that P1 is the sample point where the speed is above zero; P2 is the sample point where the speed is zero. The distribution parameters of deceleration rate for mode 2 and mode 3 are and . In order to reconstruct the maximum likelihood trajectory of deceleration process, the following is formulated and solved:subject toIn scenario 1, v2=0, v1-=-v2=v1/2. The solution to this problem generates the travel time for mode1 and deceleration rates of mode2 and mode3. =t1+v1/(2a2); =t1+v1/(2a2)+v1/(2a3). This information is used to construct the speed-time curve and the position-time curve, that is, trajectory of the vehicle. v is the speed of the vehicle and y is the position of the vehicle at time t. According to the model, the expressions for v and y are as follows:

(2) Scenario 2 for Deceleration Trajectory. As shown in Figure 6(b), there is a sample point P1 before deceleration and another, P2, in the deceleration stage of a trajectory. In this scenario, the trajectory between P1 and P2 is reconstructed with the following expressions: subject toHowever, a part of the deceleration is unclear. To reconstruct the entire deceleration process, it is assumed that a vehicle decelerates at rate from Point P2 to Point P2′ as shown in Figure 6(b). The time and position when the vehicle is stationary can be calculated and this estimated point is regarded as a virtual sample point P2′. If the position of P3 is different from that of point P2′, this indicates that the estimated trajectory has an obvious error. However, the estimated trajectory could be corrected and the estimation error reduced. Keep the obtained acceleration rates and unchanged and assume that a vehicle travels at speed :The solution of this problem generates the travel time of mode1 and acceleration rate of mode2 and mode3. The speed and position at time t of the vehicle are obtained as in scenario 1. The trajectory reconstruction for scenario 2 is then completed.

(3) Scenario 3 for Deceleration Trajectory. For scenario 3 as shown in Figure 6(c), there is no sample point in deceleration and there are two sample points before deceleration. The trajectory reconstruction between P2 and P3 is the same as scenario 1. However, for the trajectory between P1 and P2, the assumption that a vehicle travels at a constant speed may not hold. Because the probe data are based on a fixed time frequency, more sample points represent a longer time at the intersection. Under this circumstance, a vehicle may experience fluctuation in speed before deceleration, especially in congested traffic. Hence, we propose a piecewise constant acceleration rate method to construct the trajectory between P1 and P2. This method adapts to various traffic conditions and the constant speed motion is a special case. Firstly, another virtual point P is assumed with the corresponding speed which is the average speed between P1 and P2. It is assumed that a vehicle’s speed at point P1 is ; the vehicle’s speed reaches at time at a constant acceleration rate. Then the vehicle’s speed becomes at the same acceleration rate at timestamp T. The speed and position are then determined by the following expressions:T is the sample interval. After solving the problem, the speed and position of the vehicle between P1 and P2 are obtained. Combined with the trajectory between P2 and P3, the trajectory of the deceleration process is constructed.

The trajectory reconstruction of the acceleration process is similar to that for deceleration. For a trajectory, when deceleration and acceleration reconstructions are completed, the stop position and duration can be calculated. The low-frequency trajectory reconstruction is then completed.

3. Field Experiments

In this paper, the site for experiment site was a portion of the south bound carriageway on Songshan Road, spanning the intersection of Huanghe and Songsha Roads in the Nangang District of the city of Harbin in China (Figure 7). To estimate the distribution parameters of deceleration rate and acceleration rate, accumulated historical data collected by taxi from January to June 2017 are used. There are about 66700 taxis in Harbin in 2017, proving a good coverage of the road network. The typical sample frequency of the taxi is 1/30 Hz. As the type of the floating car in this paper is a taxi, to guarantee the statistical significance of the results of the distribution parameter estimation, the trajectories that included pick-up and drop-off points were excluded. The low-frequency trajectory passing through the targeted intersection for scenario 2 is selected to estimate the distribution parameters of deceleration rate and acceleration rate.

To test the accuracy of the proposed method, a field experiment was conducted in the experiment site. Four probe vehicles were equipped with GPS receivers to collect high-frequency GPS data at 1Hz. The vehicles were driven in the north-south direction (red arrow direction in Figure 7) traversing straight through the intersection repeatedly during the periods of 06:00-11:00 and 13:30-19:00 on a normal weekday, capturing the morning and evening peaks and off-peaks, respectively. To obtain the possible trajectories of the vehicles through the intersection, the vehicles joined the flow at different locations. The process lasted for six hours generating 189 valid high-frequency GPS tracks.

The statistical results for acceleration are shown in Table 1.

As can be seen in Table 1, in the deceleration process, the vehicle’s acceleration in the first stage is lower than that in the second stage. However, in the acceleration process, the vehicle’s acceleration in the first stage is higher than that in the second stage. The standard deviation for mode4 is the smallest. The standard deviation is a function amongst others of differences in driving behavior and habits of the driver.

4. Result Analysis and Discussion

4.1. Trajectory Reconstruction for Different Sample Points with the Proposed Method

With the distribution parameters of acceleration rate for each mode, the trajectory from the low-frequency floating car data can be reconstructed with the proposed method. To test the proposed method’s performance, the estimated trajectories are compared with the observed trajectories achieved in a field experiment. Figure 8 shows this comparison for three typical low-frequency trajectories, each with a different number of sample points and combinations of sample scenarios. Figure 8(a) represents the trajectory with 4 sample points, scenario 1 for deceleration trajectory and scenario 3 for acceleration trajectory. Figure 8(b) illustrates trajectories with 5 sample points, scenario 2 for deceleration trajectory and scenario 3 for acceleration trajectory. Figure 8(c) illustrates trajectories with 7 sample points, scenario 3 for deceleration trajectory and scenario 3 for acceleration trajectory.

From Figure 8, it can be seen that in general the error between the estimated trajectory and the observed trajectory is small for the most part of the trajectory, which indicates the effectiveness of our method. For the trajectory in Figure 8(b), the performance of the method is the best. For the trajectory in Figure 8(a), the trajectory error between the third and fourth sample points is relatively large, while for the trajectory in Figure 8(c), the large estimation error occurs in the first and third sample points. To explain this phenomenon, Figure 9 shows the estimated and observed speed-time curve of the three trajectories. It can be seen that, for the trajectories in Figures 8(b) and 8(c), the vehicle experiences another deceleration after accelerating. For the trajectory in Figure 8(c), the vehicle even experiences another stop (inconsistent with our assumption) and experiences speed fluctuation before decelerating resulting in the relatively large trajectory estimation error. In general, the proposed method provides a good reconstruction of the low-frequency trajectory in all the cases investigated.

To further demonstrate the significance of the proposed method, the estimated trajectories were compared with trajectories that were created with the linear interpolation method. In the linear method, it is assumed that the distance the vehicle travels between any consecutive updates is fixed. As illustrated in Figure 10, the trajectories estimated by the proposed method have a better performance than the trajectories created by the linear interpolation, especially for the estimation of the stop period. In addition, the proposed method captures the change in the trajectory caused by the change in speed, while the linear-interpolation method treats all the trajectories in a similar way. From Figure 10, it can be seen that the proposed method’s performance is obviously better than the linear-interpolation method, especially for the deceleration and acceleration processes of each trajectory, which indicates the effectiveness and practicality of our method. Note that, even in the cases where the selected trajectory is relatively complex, with fluctuations in acceleration and deceleration, the estimated trajectory is close to the observed trajectory for the most part.

To quantitatively analyze the estimation results, the Mean Absolute Error (MAE) between the estimated and ground truth trajectories is calculated. The MAE is defined as follows:where is the estimated position and is the actual position at time step i. Hence, the estimation error of a group of trajectories, MMAE, is defined aswhere MAEi is the estimation error of the ith trajectory and m is the number of trajectories. The MAE between the estimated and ground truth trajectories is calculated. Besides, the stop period is also an important evaluation indicator. TAE is defined as the absolute error of the estimated time period and the actual time period for each trajectory. Hence, MTAE is defined as the mean of the TAE of a group of trajectories. To test the ability of the proposed method to reconstruct trajectories generated in different traffic conditions and stop positions, the trajectories were classified and counted according to the speed of deceleration onset point and stop position. The MMAE and MTAE values for each category of trajectory are listed in Table 2.

Compared with the linear interpolation method, the proposed method generally provides better estimation in terms of both MMAE and MTAE. For the proposed method, when the speed at the deceleration onset point ranges from 0 to 15 km/h, which indicates that the traffic is congested, the MMAE and MTAE are obviously larger than those for other speed ranges. However, for the linear-interpolation method, there is no obvious difference between each speed range. For the proposed method, the MMAE ranges from 3.47 to 10.53 with MTAE from 3.8 to 9.9. For the linear-interpolation method, the MMAE range is 9.35-13.9 and the MTAE range is 9.7-13.1. When the speed range at the deceleration onset point is 45-60 km/h, the proposed method achieves smaller MMAE and MTAE than the other speed intervals. This may be because when a vehicle travels at a relatively high speed, the travel pattern is in line with the modal structure in our proposed method. Conversely, when the vehicle travels at a low speed, more sample points are generated and the vehicle may experience stop and go and even pass through the intersection in two or more signal cycles. This circumstance is not consistent with our model’s assumption. This could explain the differences in performance of the proposed method under different travel speed ranges. It should be noted that the worst performance appears when the vehicle travels at low speed and stops at the furthest distance from the intersection. This is the case for both the proposed and linear interpolation methods.

In this paper, the number of sample points of the trajectory is an important factor for trajectory reconstruction. To give more insight into the proposed method’s performance with different number of sample points, the MMAE for different number of sample points and travel speed ranges can be visualized in Figure 11.

Figure 11 shows that, for a specific stop position range, when the number of sample points is below 6, the MMAE is not affected by the number of sample points. However, when the number is above 6, as the number increases, the MMAE increases. The further the stop position from the intersection, the larger the error. When the number of sample points is below 7, the stop position range has no effect on the MMAE. However, when the number is above 6, the MMAE increases with the distance from the stop position to the intersection.

In this paper, the low-frequency floating car data is sampled at 30s. The effect of higher sampling intervals is investigated through a sensitivity analysis employing both the proposed and linear interpolation methods.

In Figure 12, “P” represents the proposed method and “B” the linear-interpolation method, with the number before them being the sample interval in seconds. The point inside each box represents the mean value. In Figure 12(a), it is shown that the linear-interpolation method is more sensitive to the sample interval. By contrast, the proposed method has a better performance in stability. For the sample intervals 30, 40, 50, and 60 the corresponding MMAE values from the proposed method are 7.2, 9.0, 10.5, and 11.7m. The corresponding MMAE values for the linear-interpolation method are 11.0, 13.5, 15.4, and 17.0m. It can be seen that the difference in performance between the proposed and linear-interpolation methods increases with the length of the sample interval. Figure 12(b) presents the MTAE for different sample intervals for the proposed method and the linear-interpolation method. As the length of the sample interval increases, the MTAE for the linear-interpolation method shows greater growth and includes frequent abnormal values. Hence, the estimation results from the proposed method are more reliable than the linear-interpolation method.

5. Conclusions

In this paper, a novel method is proposed to reconstruct a vehicle’s trajectory from low-frequency floating car data. Under the assumption that the vehicle travels in a certain pattern, a model is developed to estimate the distribution parameters of deceleration rate and acceleration rate with selected historical data. With the distribution parameters as prior information, trajectories for different scenarios are constructed. Compared to a reference trajectory (truth), the proposed method achieves encouraging accuracy, with MMAE ranging from 3.47m to10.53m, higher than the commonly used linear-interpolation method which that MMAE ranging from 9.35m to 12.51m. In our method, extra information is not required and, in addition, the speed curve and distribution parameters of the deceleration rate and acceleration rate are obtained. The proposed method, therefore, paves the way for a widespread application of floating car data in transport.

The presented work has established a framework for vehicle trajectory reconstruction that could be modified or solve other traffic problems in the future. To improve and extend the application of our model, future work will include the following cases:(1)Vehicles experiencing stop-and-go behavior when the queue at an intersection is long. In this case, the vehicle movement may deviate from our assumed driving pattern, increasing trajectory estimation error. We will extend our model to account for the limitations of our assumptions to improve its robustness.(2)Queue length at an intersection is long such that vehicles pass through the intersection in two or more signal cycles. In this case, we do not recommend the application of our method. We mine multicycle vehicle trajectories to capture the vehicle movement patterns and use them to improve our model.(3)Queue profile at the intersection. Combined with the traffic shockwave theory, the queue length and control delay of the intersection could be effectively estimated. We will update our model to incorporate traffic shockwave models.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The work is supported by the National Natural Science Foundation of China (NSFC) (Grant no. 51578198).