Abstract

The car-free carrier platform is a product of the rapid development of the modern logistics industry and has a vital strategic value for promoting the construction of a country’s comprehensive transportation. However, due to the unreasonable platform pricing model, the industry is currently in a bottleneck period. In order to solve this problem, we established a gray correlation model to calculate the degree of correlation between each characteristic index and platform pricing based on the massive historical transaction data of a certain platform and performed K-means clustering on the results to discover the main factors affecting platform pricing. Based on the abovementioned results, we created a pricing optimization model based on the BP neural network, with the structure of 8-13-1 to predict the freight pricing of the order and test the prediction results. The test shows that the goodness of fit (R2) of the predicted value is close to 1, and the prediction error range is less than 3.7%, which proves the accuracy and effectiveness of the BP neural network model and provides an effective reference for the optimization of the pricing model of the car-free carrier platform.

1. Introduction

Road freight is a key part of comprehensive transportation construction and the main carrier for the rapid development of the modern logistics industry. However, while the road transportation continues to open up, many drawbacks have appeared, and the development status of “small, scattered, and chaotic” has gradually formed, and the development of the transportation industry has entered a bottleneck stage. Under the development environment of “internet + logistics,” the “car-free carrier” model in the freight market has emerged, and car-free carrier companies have flourished.

The car-free carrier platform is a platform-type road freight intermediary who signs a transportation contract with the shipper as the carrier, assumes the responsibilities and obligations of the carrier, and then entrusts the actual carrier to complete the transportation task. This type of platform has the advantages of strong resource integration ability, wide brand effect, and convenient network promotion. Car-free carrier platforms need to intervene in freight transactions and earn freight difference. Compared with the direct transaction between the shipper and the carrier, the car-free carrier builds an information platform to promote information communication between the two parties, implements management model innovation, integrates and reconfigures bilateral channels, provides customers with standardized services, and improves transportation efficiency. Reduced transportation costs standardized the operation of the road freight industry. Countries with great transportation needs such as China provide opportunities for the development of car-free carrier platforms. Therefore, how to scientifically price the freight tasks of the car-free carrier platforms has strong research value and practical significance [15].

However, due to short development time and unbalanced resource allocation, the pricing mechanism of most car-free carrier platforms is imperfect, or even has major defects, and lacks strong theoretical support. How to better improve the pricing mechanism of the carrier business has become a major obstacle currently restricting the development of this industry, and an effective solution is urgently needed. Therefore, we take China as an example to study the status quo of the development of car-free carrier platforms and for in-depth analysis of the main factors that affect the pricing mechanism of the platform. And on the basis of the current mainstream pricing model, we propose to establish a transportation pricing optimization algorithm based on machine learning.

2. Current Research Status

The pricing strategy and mode of transportation routes are of vital importance to the transportation industry market, and they are also an important component of marketing. Georg and Stefan inspected transportation prices through three labor supply models and studied the impact of changes in the optimal level of transportation taxes on the above models [6]. Harald and Bjørn constructed an improved freight revenue assessment framework based on the number of economic orders and detailed transportation costs, combined with the limitations of the size and capacity of transportation vehicles [7]. Pavón and Rizzi took Santiago, Chile, as an example, assuming the relevant parameters. A simulation analysis of the interaction between the city’s transportation price, transportation service provision, and cost recovery was performed [8]. Xuan and Lee established the Stackelberg model in the process of studying the transportation pricing problem with discount in the land port system composed of a land port corresponding to multiple shippers, obtained the global optimal solution by using a single breakpoint discount pricing scheme, and finally tested the effectiveness of quantity discount pricing in various situations [9]. Luis et al. took the Madrid metropolitan area of Spain as the research object and established a new optimization algorithm to optimize the four types of transportation policies and deeply explore the potential through the establishment of a land-use model and a transportation interaction model [10]. Based on marginal cost pricing rules, by simulating the user interaction process at the microlevel and introducing external factors such as the prolonged waiting time caused by passenger behavior, a pricing model of the best bus fare is constructed [11]. On the basis of game theory, Feng and Shi conducted an in-depth study on the transportation price and service quality of railway and freight transportation and then established the Hotelling model [12]. With the increasing development of China’s transportation industry, car-free carrier platforms have begun to emerge in recent years. Since the development of car-free carrier platforms in China is still in the exploratory stage, there are relatively few documents available for reference. Xiong and Su established a multiple linear regression model and an improved GA-BP model, then built a neural network decision tree prediction model based on the first line pricing and related data, and achieved the maximum rationalization of pricing by adjusting the value of information entropy [13]. In addition, other documents mainly use game models, linear regression, fitting, and other traditional mathematical model methods to study the pricing of car-free carrier platforms. For example, Nie and Li used asymmetric information theory to construct a pricing model from the perspective of game [14].

Also, most of these documents only put forward relevant suggestions on the optimization of the platform’s pricing mechanism from the policy level. Due to the limitations of traditional mathematical models, fewer factors are considered when constructing the pricing model of the platform. At the same time, it lacks the support of valid data and fails to achieve quantitative analysis. Therefore, the current research in this area generally has strong subjectivity. A scientific and reasonable route pricing model is crucial to the operating performance of the car-free carrier platform and the operational efficiency of the freight market. Therefore, the establishment of a comprehensive and perfect cargo consignment pricing model is of great significance and value to Chinese society.

3. Research Ideas

Based on the collected historical transaction data of a car-free carrier platform, this article has solved the pricing problem for actual carrier users from the perspective of car-free carriers and establishes an optimization algorithm for platform pricing. This paper has used the gray correlation model and the K-means clustering model to explore the main factors that affect the pricing of the platform and built the BP neural network model with multilayer feedforward on this basis. With the help of error backpropagation thinking, the optimal pricing decision for the platform’s orders is made through machine learning in order to promote the rapid transaction between the two sides of the platform, reduce the time cost of the transaction, greatly improve the operating efficiency of the platform, and maximize profits.

4. Data Selection and Processing

This article selects 15,498 historical successful transaction order data from a certain car-free carrier platform, and the data come from https://www.bestfriend.com. Each order includes 20 characteristic indicators, as shown in Table 1.

After entering the data, we found that some of the data are abnormal. Therefore, we preprocess the data in the following ways: (i) eliminate the abnormal order data which have exactly the same value of each characteristic indicator or too many missing values. (ii) For the order data with individual missing values, Lagrange interpolation is used to complete the missing values. (iii) Unify the format of all time data. (iv) Use the one-hot method to perform vector encoding processing on all discrete data such as , and normalize all continuous data.

At the same time, in order to ensure the rigor of the research, we also need to make the following assumptions to eliminate the interference of other factors on the research results: (i) at the current stage, the car-free carrier platform aims to accelerate transactions and lower carrier costs. (ii) Every task of the platform will be carried out. (iii) The route tasks are all fixed vehicle tasks; that is, a task needs to be completed by one vehicle of a certain type of vehicle, regardless of the load sharing task.

5. Model Establishment and Empirical Analysis

5.1. Mining the Main Factors That Affect the Pricing of the Platform—Based on the Gray Relational Model

The model of gray correlation, that is, the degree of correlation between factors is measured according to the degree of similarity or difference in the development of many factors. It reveals the characteristics and degree of dynamic correlation of things. Analyzing the development and change of a system provides a more accurately quantitative measure. The basic principle of the model is to conduct an association analysis on the similarity between the independent variable and the dependent variable. The higher the similarity between the two curve shapes, the higher the correlation between the two. Conversely, the lower the similarity between the two, the lower the degree of association; otherwise, the correlation is lower [1519]. The algorithm process is as follows.

Let be the 20 characteristic indicators in the data we selected, and are the corresponding orders’ pricing. Make to be the gray correlation degree between . The expression is shown as follows:

Among formulas (1) and (2), is the resolution coefficient, and the value range is (0, 1).

The calculation steps of the gray correlation degree are as follows:Step 1: initialize the sequence data:Step 2: calculate the difference of the dependent variable influencing the factor sequence:Step 3: calculate the maximum value M and the minimum value m in :Step 4: calculate the correlation coefficient as follows:Step 5: calculate the gray correlation degree:

Through formulas (3) to (9), we have obtained the gray correlation degree of each characteristic indicator and platform line pricing and sorted it, as shown in Table 2. At the same time, we use the K-means clustering model to cluster 20 feature indicators [2024]. According to the AIC and BIC, the 20 feature indicators are clustered into four categories, as shown in Figure 1.

According to the calculated gray correlation degree between each characteristic indicator and platform pricing and the K-means clustering result, from Table 2and Figure 1, we can see that the total mileage of the route , planned time , vehicle tonnage , vehicle length , and urgency of demandall have gray correlation coefficients higher than 0.75 with platform pricing and are grouped into one category, indicating that these five indicators are significantly related to platform pricing, which affects the platform line pricing major factor, but the gray relational algorithm cannot deeply analyze the inherent nonlinear relationship between variables. Therefore, based on the above conclusions, with the help of machine learning algorithms, we have deeply explored the internal connection between various indicators and platform transportation pricing.

5.2. Optimal Model of Transportation Pricing Based on BP Neural Network Algorithm

In order to maximize the rationalization of transportation pricing, the platform needs to work out an order price that can best match the final transaction price in the shortest possible time. While satisfying the interests of both parties on the platform, it greatly reduces the time and transportation costs spent in the quotation process. The BP neural network algorithm is a machine learning algorithm with integrated learning based on the principle of error backpropagation. We use the back propagation algorithm to adjust the weight and thresholds through the training error to make it reach the best state. Through the training of training samples, this can effectively reduce the output error caused by the algorithm and to realize the efficient learning of nonlinear relations in the sample data [2528]. Therefore, we built a multilayer feedforward neural network model based on the BP algorithm to achieve machine deep learning of the final transaction price in the order data and built a pricing optimization algorithm that maximizes the platform’s pricing. Finally, the model’s pricing forecast results are tested to prove the rationality of the pricing model.

5.2.1. Model Algorithm and Principle

The structure of the BP neural network is divided into input layer, hidden layer, and output layer. The input layer and output layer are both one-layer structure, but the hidden layer can be one or more [2935]. In this article, we constructed a model with 3 hidden layers. Under normal circumstances, the relationship between the number of hidden layer networks (n1) and the number of input layer neurons (n2) is shown as follows:

The structure of the BP neural network and the neuron model are shown in Figures 2and 3.

At the same time, based on the number of training sample sets, the hidden layers, weights, and the thresholds can be precisely calculated. According to Table 2and Figure 1, we believe that, among the 20 characteristic indicators, the 8 characteristic indicators clustered into the first two categories will have a significant impact on the pricing of the platform. Since other indicators are less relevant to pricing, we do not consider them. Therefore, the independent variable dimension of the neural network constructed in this article is 8, so the number of neurons in the input layer is 8. The dependent variable is a one-dimensional vector, which is the final transaction price of the order. Therefore, this paper constructs a network structure of 8-17-1.

Then, we trained the neural network, setting the abort error condition and normalizing the data in the training set. After this, inputting it into the neural network. Secondly, the neuron transfer functions of the hidden layer and the output layer are constructed using the tangent function of the output range [-1,1], and the LM method was chosen as the training algorithm of the BP network; the neural network training is completed [3641]. The specific training process is as follows:Step 1: setup initialization:We need to set the maximum number of training times, error conditions, initial weights, and thresholds of the neural network. Different initial weights and threshold settings will have a significant impact on the training effect of the network.Step 2: sample training:(1)Input training sample set , where q = 1, 2, 3, ……, Q(2)For each input sample, calculate the output value of each neuron in sequence as follows: , , and .Step 3: error backpropagation:Calculate the equivalent error of each layer of neurons, from the output layer to the input layer, return to Step 2 after obtaining the equivalent error, and perform forward propagation calculation and error backpropagation for other training sample pairs until all the training samples have gone through the above calculation steps.Step 4: adjust the connection weight of each layer:For the connection weight from the hidden layer to the output layer, the correction amount for each step is , which satisfies the following relationship:According to formulas (10) and (12), for the weight () from the input layer to the hidden layer, the correction amount for each step is , shown as follows:At this time, satisfies the following formula:Through formulas (13) to (15), the correction amount of the connection weight between each layer in the neural network can be obtained, as shown in the following formula:Step 5: forward calculation:Return to Step 2, and then perform forward calculations based on the weights and thresholds calculated above. If the accuracy requirements are met, that is, for each training sample, its output layer neurons all meet the following accuracy requirement:Among formula (17), , where ε is the given accuracy requirement value; when the accuracy requirement is met, the neural network training is completed; otherwise, it continues to iterate until the requirement is met.Step 6: test the results:Substitute the normalized test set into the trained BP neural network, and finally, get the test results.

5.2.2. Empirical Analysis of the Model

After testing, the optimal neural network structure selected for this question is 8-13-1. The input layer has 8 nodes, representing 8 characteristic indicators that affect transportation pricing, the hidden layer has 13 nodes, and the output layer has 1 node, which is the value for forecast pricing.

We use the collected 15,498 order data as samples and divide the training set, test set, and validation set according to the ratio of 70%, 15%, and 15%. Among them, the number of training sets is 11,211, the number of validation sets is 2402, and the number of test sets is 2403.

After repeated calculations, we set the learning rate of the model to 0.01, the learning momentum constant to 0.001, and the maximum number of training times to 1000. Then, we trained the training set data, and the process is shown in Figure 4.

Then, based on the training results of the above data, we selected the Levenberg–Marquardt algorithm to establish the BP neural network transportation pricing model and performed gradient descent on the data. The iterative test process is shown in Figure 5.

From Figure 5, we can see that when the test reaches the 57th time, the mean square error (MSE) reaches the minimum value, and the best model corresponds to the minimum MSE value.

5.2.3. Validity Test of the Model

We have used the trained BP neural network model to predict transportation pricing and test the prediction results. We perform regression fitting on the predicted value of transportation pricing and test the goodness of fit (R2). The fitting result is shown in Figure 6.

It can be seen from Figure 6that the goodness of fit (R2) between the predicted value of the test data and the measured value of transportation pricing is higher than 0.99, and the effect of the model fitting is good. It shows that the established BP neural network has high stability, accuracy, and reliability. At the same time, according to the charts of the error histogram and the forecast comparison (Figures 7and 8), we can see that the predicted results of the model for transportation pricing are highly consistent with the actual transaction price of the order, which once again illustrates the accuracy and high applicability of the model, and we have realized the optimized design of the transportation pricing model.

6. Conclusion

This paper took the car-free carrier platform in China as the research object, collected the historical order data, extracted relevant influencing factors, and built a platform route pricing optimization model based on the BP neural network algorithm. The main conclusions of this research are as follows.

By establishing a gray correlation model, we calculated the degree of correlation between each characteristic index in the historical transaction order and the final transaction price, established a K-means clustering model to cluster the above calculation results, and found “total mileage of the route,” “vehicle tonnage,” “vehicle length,” “transport level,” and “planned time” are the main factors that affect the pricing of the transportation route of the car-free carrier platform.

Based on the above results, we have established a BP neural network prediction model with the number of neurons in the input layer, hidden layer, and output layer being 8, 13, and 1, respectively. Through the training of the training set data and the error backpropagation calculation, the neural network is tested, and the model is optimized when the number of iterations reaches the 57th time. Finally, we test the prediction results of the neural network algorithm according to the fitting coefficient R2 and the prediction error value and found that the model fitting effect was good, and R2 was close to 1. And the error range between the predicted price and the actual transaction price is generally less than 3.7%. It shows that the prediction accuracy of this model is high, stable, and reliable. The optimized forecast of platform transportation pricing is realized.

Based on the above research conclusions, we put forward the following three suggestions:(1)Car-free carrier platforms should change the transportation pricing system based on traditional regression prediction models and use more accurate and time-efficient artificial neural network models for pricing(2)When setting prices, the platform should pay more attention to the influence of factors such as transportation mileage, relevant parameters of transportation vehicles, and the urgency of transportation(3)It is suggested that the noncar carrier platform can adopt the mobile quotation mechanism, and the transportation pricing can be dynamically adjusted with the change of time to increase the transaction rate of the platform

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The author declares that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This study was funded by the Humanities and Social Sciences Research Major Project of the Education Department of Anhui Province (SK2017A0452), the Teaching and Research Fund Project of the Education Department of Anhui Province (2020jyxm0017), “First-Class Course” of Anhui University of Finance and Economics (acylkc202008), and the Teaching and Research Fund Project of the Anhui University of Finance and Economics (acjyyb2020011 and acjyyb2020014).