Green Intelligent Transport SystemView this Special Issue
Modeling of a Small Transportation Company’s Start-Up with Limited Data during Economic Recession
This paper presents a modeling method for analyzing a small transportation company’s start-up and growth during a global economic crisis which had an impact on China which is designed to help the owners make better investment and operating decisions with limited data. Since there is limited data, simple regression model and binary regression model failed to generate satisfactory results, so an additive periodic time series model was built to forecast business orders and income. Since the transportation market is segmented by business type and transportation distance, a polynomial model and logistic curve model were constructed to forecast the growth trend of each segmented transportation market, and the seasonal influence function was fitted by seasonal ratio method. Although both of the models produced satisfactory results and showed very nearly the same of goodness-of-fit in the sample, the logistic model presented better forecasting performance out of the sample therefore closer to the reality. Additionally, by checking the development trajectory of the case company’s business and the financial crisis in 2008, the modeling and analysis suggest that the sample company is affected by national macroeconomic factors such as GDP and import & export, and this effect comes with a time lag of one to two years.
Transport infrastructure is critical to economic development of a country and can provide competitive advantage. Within China there is a diversity of transport companies which could be split between with large-scale transport enterprises (LTEs) and small- and medium-sized transportation enterprises (SMTEs). There are many distinct differences between the two types of enterprises. Generally, LTEs, more geographically spread, attract larger enterprises and resource management is not as critical. SMTEs especially in their start-up period need to be more agile in the use of their financial resources to ensure survival. This is particularly true during a period of economic change. This paper focuses on SMTEs since they are the main suppliers of road transportation services in China. They necessarily play the role of first-and/or-last kilometer carriers for door-to-door logistics services. Some 225 259 enterprises and 4 595 600 individually owned businesses were engaged in transportation, storage, and postal services in 2008 . The total value of the top 50 logistics enterprises, about 475.6 billion Yuan, is only 0.53% of the national added value of the logistics industry . Compared with LTEs, SMTEs face much greater risk, especially during an economic downturn. The survival of a business entity depends heavily on its ability to anticipate and prepare for change rather than wait for it and then react to it .
Demand forecasting is so significant for SMTEs that it can help to improve equipment utilization and establish smarter operational and investment strategies. Realistically, business owners estimate the service demand from their past experiences which can be wrong or misleading. Almost all freight demand analysis usually accounts for a whole country, a region, or a corridor between cities by integration or by mode and is usually to do with public transportation planning [4–6]. Two previous research articles which examined SMTEs’ demand forecasting tackled less than truckload (LTL) for both short-term and long-term forecasting. In , it is believed that the combination of neural networks and traditional time series analysis is good for forecasting short-term logistic demand for an LTL carrier. A small-to-medium-sized road transport service which offered a limited collection and delivery service for small and large consignments in a number of geographical zones of equal area is reported in .
The objective of this study is to model the start-up and growth of a newly established truck transportation company during the economic recession whose main business is seaport containers and bulk inland transportation. It ultimately aims to help SMTEs to look for ways of improving equipment utilization in the short to medium term by forecasting the orders and the trucks required, as well as the impact of financial crisis. The main feature of our work is its use of limited data to analyze and forecast a small transportation company’s business as it starts up in a rapidly changing environment.
The context of the model has two distinct elements: enterprise start-up and the short time series. Obviously, most enterprises in their start-up period need to grow their business to an economic scale. If they fail to do so the long time survival of the enterprise may be under question. Hence, the model selected will have to accommodate such growth. The second element reflects the need for quick appraisal of likely demand, so that appropriate economic strategy can be developed by the enterprise. This differentiates start-up SMTEs from LTEs where there is possibly longer times series and higher correlation of performance with macroeconomic factors. The modeling proposed attempts to cope with these two elements.
The data covers all the business orders of a small truck transportation company during a 40-month period (January 2008 to April 2011). Registered at the end of 2007, the small truck transportation company currently has a fleet of 30 trucks and 30 drivers. Some trucks are container trucks suitable for container transportation and others are bulk trucks designed for heavy long-distance bulk transportation services. This company mainly provides services for pier container transportation and inland bulk transportation. The container transportation service is mainly provided for container customers who have exporting and importing businesses at three ports located in Shenzhen, namely, Yantian, Shekou, and Mawan. In order to make use of the empty containers at these ports, a small part of the bulk is transported for the container customers in container trucks at the same price as bulk transportation. Most of the rest is carried by bulk trucks for different customers.
All the business orders from this company are classified according to the categories of the customer industry (see Table 1). “Money” in this paper represents the gross income which equals the orders multiplying the unit price. The customers come from eight industries, and 15 logistics customers contribute 42.7% of the total money, three machine customers account for 40.9% of the orders and 21.9% of the gross income, and the remaining 14 customers from the food, toy, chinaware, material, craft, and electronics industries account for 34.123% of the orders and 35.432% of the total gross income.
Further analysis shows that there are two patterns as regards gaps in orders, regular and irregular. Regular orders are made either continuously on weekdays and broken at weekends and vacations (shown in Figure 1(a)) or discretely and regularly (shown in Figure 1(b)). Irregular orders are made randomly, as shown in Figures 2(a) and 2(b). The customers who make regular and irregular orders are listed in Table 2.
(a) Continuous, only broken at weekends and vacations
(b) Discrete and regular
(a) Occasional orders
(b) Relatively longer span and irregular
In order to find out whether the economic recession of 2008 had an effect on the company’s business, further analysis of the orders and money per month was conducted for both regular and irregular businesses. It should be noted that the Chinese New Year usually falls in February when almost all businesses are at their lowest level of production. A three-year period is considered and the cycle starts from February of the first year to January of the last year. Therefore, “1” in Figure 3 represents February 2008 and “36” means January 2011.
The plots of the irregular orders and money per month shown in Figure 3 describe two factors. (1) The irregular business was affected by the economic recession which hit China between the third and fourth quarters of 2008. The downturn began in October 2008. (2) Orders declined rapidly from October 2008 and eventually disappeared.
Originating in the developed countries, the economic crisis soon spread to China, as shown by the decreasing number of orders. Those companies involved in overseas markets were affected first, in the third quarter of 2008. Pure OEM (original equipment manufacturing) and excessive reliance on overseas market were the two major factors which made these companies fail in the recession rapidly. Those companies relying on the home market were affected by a time lag, because of reduced business with the export-oriented companies and the declining purchasing power of workers laid off by those companies.
The regular orders and money per month in total are given in Figure 4, from which conclusions can be drawn as follows.(1)There is obvious periodicity by year both for orders and for income.(2)There is an increase in trend, but the slope is gradually decreasing.(3)Orders and money surprisingly increase after the arrival of the recession.
Some weak companies ceased trading almost immediately, but the robust ones survived. Businesses concentrated on those surviving companies.
Every coin has two sides. The recession had both negative and positive impacts on the company’s business. Whereas the business trade was slowing down, the quality of the company’s customers was better after the recession and a large number of irregular businesses disappeared. Graphs of the orders and income have similar shapes, so the regular group is the focus of the following work.
3. Modeling and Forecasting
Most modeling and forecasting approaches with limited data are about rapidly changing industries like motion pictures, telecommunications, or new products with a short history [9–13]. They suggest that combining ARIMA and diffusion models can improve one-year-ahead predictions, especially in the high technology market. The drawback, however, is the need for enough historical data points to create a time series. The necessity of judgment in forecasting with insufficient data for statistical methodology was discussed in . The key is to impose a structure with methods such as surveys of intentions or expectations, judgmental bootstrapping, structured analogies, and simulated interaction. A case study about fashion products, generally characterized by high demand uncertainty, high stock out costs, and a high risk of obsolescence, proposes that preorder data and judgments can be used to overcome the lack of data in demand forecasting. Advance order data can be obtained by allowing a selected group of customers to preorder at a discount from a preview catalogue. Judgments can be obtained from purchase managers or other company experts .
A significant improvement in nonlinear time series analysis (NTSA) has been seen since the 1990s . Tong listed the following as the five most promising directions: the interface between NTSA and chaos, the nonparametric/semiparametric approach, nonlinear state space modeling, financial time series (in both discrete time and continuous time), and nonlinear modeling of panels (e.g., spatially distributed) in time series. A polynomial spline approximation was employed to estimate the functional coefficient regression models for nonlinear time series . A time-varying coefficient time series model with a time trend function and serially correlated errors to characterize the nonlinearity, nonstationarity, and trending phenomena was discussed in . A time series can be decomposed into trend, cycle, and seasonal and random components . That is, moving averages, isolating seasonal factors and seasonal adjustments are common methodology for treating seasonal adjustment of time series.
No single model or combination model has been proved to outperform the rest. In order to model and forecast the small truck transportation company’s start-up and growth after the economic recession from the limited data described in Section 2, relative variable analysis and time series analysis were used to predict the orders and turnover. Meanwhile, correlation analysis, and polynomial and logistic curve models were also employed to find the trend by combining seasonality adjustment to predict order number and turnover.
Simple linear regression and binary linear regression are two basic versions of the generalized linear model (GLM) proposed by  as a way of unifying various other statistical models, including linear regression, logistic regression, and Poisson regression.
Linear regression was employed to search for the relation between orders and import & export in months or orders and import & export and GDP in quarters for GDP only available in quarters. The general form of a linear regression model is where is the dependent variable, denoting the orders. is the predictor variable with a subscript , and is the number of variables. is noise.
In statistics, signal processing, econometrics, and mathematical finance, a time series is a sequence of data points, measured typically at successive times at uniform time intervals. Time series forecasting is the use of a model to predict future values based on previously observed values. Methods for time series analyses may be divided into two classes: frequency-domain methods and time-domain methods. The former include spectral analysis and, recently, wavelet analysis; the latter include autocorrelation and cross-correlation analysis.
Our time series analysis began by regrouping the data, splitting the distance into five sections and business into two kinds. The first model was the polynomial, as in the following:
Then for an alternative of the logistic curve regression
Therefore, the turnover forecast is as the following for both trend forecast models: where is average price or value of distant and type , the orders of distant and type , the noise, is the time by year, and 1 = the starting year, 2008 in this case. and are coefficients of the logistic model in subgroups of a distance (= 1, 2, 3, 4, 5) and a type (= 1, 2). (= ) represents the time by month.
Any time series is made up of systematic components, such as a trend, cycle, and seasonal and random elements, which are by definition unpredictable. Therefore, a set of time series data can be decomposed into trend, cycle, and seasonal and random components. The four elements can be combined in either an additive or a multiplicative model : where represents the dependent variable of orders, the trend component, the seasonal factor, and the random element.
Moving averages are often used to isolate the trend. We propose, however, to extract the trend directly from the original data with both the polynomial and logistic curve models and then to decompose the combination of the cycle and seasonality by the additive model in (5). Considering the close relationship between transportation and economics, we can ignore the cyclical component by taking the seasonality as a cycle of 12 months. The cycle was previously defined as running from February to January of the next year, as the lowest demand usually falls in February.
We take the additive model, (5), and rewrite it as
Let be the combination of the seasonal and random components and then divide by the trend to get the seasonal ratio (including the random). Hence, the seasonal factor can be obtained by averaging the three ratio components (February 2008 to January 2009, February 2009 to January 2010, and February 2010 to January 2011) for each month , . For example, for February, the average seasonal ratio is as follows: where (=1,2,…,36) represents months from the beginning in the series. Now the forecasting value of can be obtained by Then the total orders in (10) and (4) should be changed to is the trends forecast by the polynomial or logistic model.
First, simple linear regression analysis is taken to model the relationship between the national export & import and the orders. The result is given in Figure 5.
The predicted regular orders per month .
Then the binary linear regression of the orders and imports & exports and GDP per quarter is presented in Figure 6.
The predicted regular orders per quarter .
As the relative analysis cannot give satisfactory results even intuitively, we turn to the time series analysis. The whole regular business is separated by distance and truck type when the time series analysis is chosen. Distance is separated into five categories according to the rates and transport time, and container and bulk are the two types of services. Variable represents the different segmentation of the distance shown in Table 3. Type variable represents container and the bulk.
The result shows that almost all the container transport orders are included in distance groups 1, 2, and 3, with just one out of 7755 regular container orders from February 2008 to January 2011 in container group 4. Most bulk transport orders are included in distance groups 1, 3, and 5. As the data of bulk groups 2 and 3 are insufficient for modeling, we merge bulk groups 1, 2, and 3 into a new bulk group 1 and bulk groups 4 and 5 into a new bulk group 2.
Thus, , and only the remaining five subgroups need to be estimated. The results of the polynomial trend models and seasonal ratios in Table 4 are listed in the following calculations and shown in Figures 7 and 8:
Putting (12) into (11) we can calculate the money (turnover). The results show that (1) the container and the bulk have linear and quadratic trends, respectively and (2) the results are acceptable until March 2011, but the linear trend just increases and the quadratic trend abruptly decreases after April 2011, which is not in line with the actual situation.
The order-prediction models are given in (13), with the second bulk group showing a polynomial trend:
Results of calculation of seasonal ratios are presented in Table 5.
The results of the polynomial and logistic models are compared in Figure 11. The predictions given by the two models are not very different at present but the logistic trend will be more significant in the future.
In conclusion, the initial modeling of the orders cannot immediately account for the demand in terms of exports & imports or GDP. The polynomial trend does not give a sound prediction in the sample, and the logistic modeling of the orders seems appropriate. The forecasted total money is presented in Figure 12.
Model selection depends primarily on two aspects. One is the forecast in the sample. The second is the prediction out of the sample. The forecasts of the polynomial model decrease abruptly out of the sample and the logistic has a better performance in this aspect. Therefore the logistic model outperforms the polynomial model out of the sample.
Now let us turn to the analysis in the sample. There are some specific criteria, such as the coefficient of determination and -test, which measure the goodness-of-fit to discriminate which model is better than the others with regard to linear regression models. This is not the case with nonlinear regression models (NLRMs), however. According to , we cannot use the -test (to test the significance of an individual coefficient) or the -test (to test the overall significance of the estimated regression) because we cannot obtain an unbiased estimate of the error variance from the estimated residuals. Furthermore, the residuals (the difference between the actual values and the estimated values from the NLRM) do not necessarily sum to zero, ESS and RSS do not necessarily add up to the TSS, and therefore may not be a meaningful descriptive statistic for such models. An alternative of pseudo square to is proposed by  as follows: where is the observation number, = regress, and , where are the estimated values from the (fitted) NLRM. Results are shown in Table 6.
From Table 6, the values of of orders prediction by the two models are nearly the same. When it comes to the income prediction, the value of of the logistic model has a slight advantage over the polynomial model. As previously described, the logistic model is better than the polynomial one in predicting performance out of the sample. The comparison in the sample shows, however, that performances of the two models are very similar. On balance, however, the logistic model is better than the polynomial.
5. Conclusions and Discussion
This paper introduces a small truck transportation company’s start-up whose main business is pier containers and whose minority business is long-distance bulk transportation. We have attempted to elucidate how our analysis led to useful information for SMTE owners. Modeling the forecasting of order numbers of both kinds of businesses and the total turnover contributes to establishing the company’s strategy for investment and operations.
Initial modeling of orders by simple regression and binary regression models cannot immediately account for demand in terms of exports or GDP. The results show that the company’s business is certainly affected by the national macroeconomic factors such as GDP and import & export, and this effect comes with a time lag of one to two years since transportation service demand is derived demand. The seasonal fluctuation of orders, however, is much more dramatic than that of the national GDP and total import & export. It may be the main reason why the regression models did not perform well in forecasting. Competitive environment factors such as customers and competitors affect performance directly for SMTEs.
There are insufficient data about customers and competitors to support the simple regression model. Therefore, the time series analysis is the inevitable choice. We segmented the data by business and transportation distance to get a set of time series data. The polynomial and the logistic trend models combined with additional seasonal components were employed to fit the data of different segments. Both fitted the sample data very well, but the polynomial trend does not give sound forecasting for the fast decreasing trend out of the sample. From the perspective of goodness-of-fit, the polynomial trend model is still acceptable.
We have to say that forecasting in future is obviously at variance with reality. On the one hand, small transportation companies are not necessarily bound to continue to grow into large ones because of indifferent marketing and the lack of economies of scale. On the other hand, most small transportation companies can survive for a relatively long period thanks to their flexible operation mode. Usually a polynomial trend model does not work well for unlimited increasing or decreasing trends while higher-order coefficients are positive or negative, respectively. One of the reasons why the logistic modeling of the orders seems appropriate is that it has given the upper limit of business. Acceptable goodness-of-fit and forecasting performance can only be provided by the logistic model, even though the performance of the two models in the sample is nearly the same to solve the question in this paper.
Unexpectedly, the analysis presented the process of the financial crisis’s effect on SMTEs. First, irregular business disappears immediately the crisis emerges; at the same time, the regular business increases rather than decreases and then slows down with a time lag of one to two years, which is established by the logistic model. Within the effective distance of the truck transportation, the total transportation demand is relatively stable and the market is competitive or contestable. The growth rate of most small and medium transportation enterprises will become slower after their quick growth period and have an upper limit scale of business.
Our approach can cope with the different impetus of SMTEs’ growth and a period of financial stress. One would expect that in less periods of financial stress then growth might be more rapid for SMTEs but not that they necessarily will continue to grow rapidly. With further data one could explore how later SMTEs might develop in more benign economic contexts. We believe, however, that the proposed model would still achieve a reasonable fit.
This work was supported by the National Natural Science Foundation of China (NSFC) under Grant no. 61203162. It was also supported in part by the Science Progress and Innovation Program of Hunan DOT under Grant no. 201244.
National Bureau of Statistics of China, “The second economic census data bulletin (no.1),” 2009, http://www.stats.gov.cn/tjsj/pcsj/jjpc/2jp/indexch.htm.View at: Google Scholar
National Development and Reform Commission, National Bureau of Statistics of China and China Federation of Logistics & Purchasing, “2008 National Logistics operation Situation Bulletin,” 2009, http://www.sdpc.gov.cn/jjyx/xdwl/t20090306_264999.htm.View at: Google Scholar
T. J. Fite, G. D. Taylor, J. S. Usher, J. R. English, and J. N. Roberts, “Forecasting freight demand using economic indices,” International Journal of Physical Distribution & Logistics Management, vol. 32, no. 4, pp. 299–308, 2002.View at: Google Scholar
J. Harris, Fuzzy Logic Applications in Engineering Science, Intelligent Systems, Control and Automation: Science and Engineering, Springer, Dordrecht, The Netherlands, 2006.
B. G. S. Hardie, P. S. Fader, and M. Wisniewski, “An empirical comparison of new product trial forecasting models,” Journal of Forecasting, vol. 17, no. 3-4, pp. 209–229, 1998.View at: Google Scholar
A. Mukherjee and V. Kadiyali, “Forecasting in rapidly changing environments: an application to the U.S. motion picture industry,” Cornell University Johnson School Research Paper Series 10-07.View at: Google Scholar
G. Madden, R. Cooper, and R. Fildes, “Theoretically-motivated long-term forecasting with limited data,” Working Paper 240, The MIT Center for Digital Business, 2008.View at: Google Scholar
K. C. Green and J. S. Armstrong, “Demand forecasting: evidence-based methods,” Working Paper, University of Pennsylvania Department of Marketing, 2005.View at: Google Scholar
M. Barrow, Statistics for Economics, Accounting and Business Studies, Pearson Education, London, UK, 5th edition, 2009.
J. A. Nelder and R. W. M. Wedderburn, “Generalized linear models,” Journal of the Royal Statistical Society, Series A, vol. 135, no. 3, pp. 370–384, 1972.View at: Google Scholar
D. N. Gujarati, Basic Econometrics, McGraw-Hill, New York, NY, USA, 4th edition, 2003.