Abstract

As passengers are proved to be preference heterogeneous in air travel, this paper tries to model the air ticket purchase behavior incorporating market segmentation. In the research, a latent-NL model, established on the latent class structure and the nested logit model, integrates the personal features as well as the purchase preferences into the forecast of segment-specific purchase probability. In order to calibrate the model, a stated preference survey is designed with the choice profiles using real service information, and the survey is conducted in four cities in China for data collection. The results show that the proposed model provides an effective approach for predicting the air travel demand in particular for air ticket pricing, and the estimation results outperforms the traditional-nested logit model with higher goodness-of-fit. Besides, the model is then adopted to test the efficiency of different pricing strategies, showing its advantages in improving the flight revenues.

1. Introduction

The air ticket price is acknowledged as the key determinate of airline profits in terms of affecting air travel demand [1], it has been constantly optimized to satisfy passengers’ needs since the market liberalization of civil aviation [2, 3]. For the airlines, the pricing strategies are traditionally proposed on the assumption that passengers are homogeneous or have the same needs to all the flights as well as other alternative travel modes [4], which may due to the purpose of convenient data collection and easy estimation [5].

However, reported by recent literatures, passengers show the distinct classified preference associated with their travel purpose [6], experiences or even demographic characteristics such as age, income, and occupation [7]. So, it is reasonable to believe that the air transportation market is actually composed of groups of passengers whose air ticket purchase preferences are in similarity [8]. In the sense, the pricing strategies should be proposed on the group-based air ticket purchase behavior.

A lot of studies have focused on the air ticket purchase behavior, the most widely adopted method is the discrete choice model [9]. According to the results, the multinominal logit model (MNL) and the nested logit model (NL) as well as the mixed logit model have the advantages in predicting the probability of purchasing an air ticket with specific price and other characteristics (such as departure and arrival airports, flight time, and on-time performance) [10, 11]. However, the discrete choice models are constructed on the assumption that passengers’ preferences are homogeneous, thus the modelling results may overestimate some travelers’ price endurance or underestimate others’ price tolerance, which may cause the loss in flight demand as well as the profit [7]. Therefore, the group identification method, also recognized as the market segmentation method, should be linked to the discrete choice model for predicting the segment-specific air ticket purchase behavior.

The concept of market segmentation was arisen by Smith in the middle of 19th century, which helps the merchants to diversify their products as well as sales strategies [12]. For the market segmentation in civil aviation, the conventional way is to divide the population statistically regarding passengers’ personal features or travel purpose [13]. Although such method is easily conducted, but the simple classification may not be enough to reveal the complex heterogeneity in the air ticket pricing. For instance, passengers who have higher income may not choose the expansive air tickets in business trip for the limited budget [14], so setting higher air ticket prices for business travelers may be a wrong strategy in some situations. Hence, it is necessary to construct a specialized market segmentation approach for the air ticket pricing by incorporating both of the personal features and the purchase preferences. As the purchase preferences can be well modelled by the discrete choice model, we should find an appropriate market segmentation method, which can be combined with the discrete choice models in our research.

Given the existed market segmentation approaches, the clustering model as well as the latent class model are the two well-known methods. The clustering method uses the statistical correlation to empirically describe the market segmentation, so it is hard to be linked to the econometrical models. However, the latent class model can do the model-based classification which gives out the probability that a passenger belongs to a specific market segment [15], and it is estimated based on the probabilistic distribution of the survey data. In this sense, the latent class model should be a suitable method to be linked to other probabilistic models such as the discrete choice model [1619].

Therefore, our paper aims to combine the latent class model and the discrete choice model to identify the market segments and to predict the segment-specific purchase behavior for air ticket pricing. Although the latent-discrete choice models have been applied in other fields of research [20, 21], there are also two main challenges in our study: the first is how to construct the suitable latent-discrete choice model with appropriate segment membership function and utility function for air ticket pricing; the second is to find out how to use the estimated results in improving the air ticket pricing strategies.

To solve the abovementioned challenges, our paper is organized as follows. Section 2 reviews previous studies about the latent class model and discrete choice models; Section 3 establishes the latent-discrete choice model for interpreting the segment-based air ticket purchase; Section 4 introduces our data collection and the case in our study; Section 5 shows the estimation results of the proposed model, and Section 6 gives out the sensitivity analysis; finally, Section 7 tries to summarize the limitations and our future research studies.

2. Literature Review

To establish the latent-discrete choice model, we should firstly make clear the mechanism of the latent class model as well as the discrete choice model. The latent class model was originally proposed by Kamakura and Russell [22] to distinguish the households with different brand preferences. In the model, the classification is identified on two probabilities: segmental probability and the conditional probability. To be specific, the segmental probability represents the chance that an individual belongs to one defined group [23], and it can be estimated by maximizing the loglikelihood of the joint probability of the observable variables [16]. Then, the joint probability is calculated using the posterior probability of an individual who has one specific feature on the condition that he or she belongs to one segment. Moreover, the segment number of the latent class model is determined randomly according to the requirements of the research as well as the efficiency evaluation criteria [24].

Then, for the discrete choice model, it has been widely applied in the research of air travel behavior since Skinner [25] used the MNL model to analyze the airport choice behavior within a multiairport region. For evolving about 40 years, a lot of researchers have used the MNL, NL, Probit, and mixed logit models to analyze passengers’ airport choice, air route choice, airline choice, etc. in different contexts [26]. As the air ticket purchase behavior is closely associated with the passengers’ flight choice, here we will refer to the literatures about flight-choosing behavior which use the discrete choice structures.

In the existed literatures about flight choice, most studies only incorporate the service attributes of different flights; however, the car, train, and even the bus may have influences on the choice preferences [27]. Therefore, the prices and other service characteristics of the substitutable travel modes should also be taken into account in the proposed air ticket purchase model. Because there exists strong correlation between different alternatives within one travel mode [28, 29], the nested logit model with a hierarchical structure is more suitable to analyze the air ticket purchase behavior in our research.

Based on the latent class as well as the discrete choice model, some literatures have established the joint structure integrating the two models to predict passengers’ heterogeneous travel behavior [3032]. For the research studies related to the civil aviation, Wen and Lai [33] applied the latent class model into the MNL structure to predict passengers’ air carrier choice. According to their results, the proposed latent-MNL model has advantages over the multinomial logit model for better fitness. Meanwhile, their another study constructed latent class nested logit model in the travel mode choice and proved that the model could relax the IIA (Independent from Irrelevant Alternative) property of logit models [24]. Although the latent nested logit has been applied to analyze the travel behavior [34], this paper will construct a specialized latent-NL model for predicting the segment-specific air ticket purchase for the airlines’ air ticket pricing.

When the latent-NL structure is confirmed, the next important work is to choose the variables of the model. For the latent class part, passengers’ demographic characteristics and travel experiences are commonly selected to identify the market segmentation [35]. In the nested logit part, the air route, flight time, air fare, and flight frequency as well as the ground access time are the main factors influencing air ticket purchase [36, 37]. Furthermore, service attributes of other travel modes such as the ticket price, the departure time, and in-vehicle time would also be incorporated [38].

Besides, the data collection is another task of establishing the latent-NL model, as reported by existing literatures, and the stated preference survey (SP) has the advantages of collecting experimental data when the real travel data is hard to obtain, but the revealed preference survey (RP) could explicitly reveal the choice behavior in reality [39]. Considering the difficulty in collecting the previous real travel data in reality, we decided to use the SP survey designed with profiles using real travel alternatives and information in our research.

Based on the abovementioned literature review, Section 3 will introduce the proposed latent-NL model and the estimation method.

3. Modelling Approach

3.1. Model Structure

The structure of the latent-NL model is as shown in Figure 1. In the figure, the whole demand market would be classified into several segments based on the latent class structure, and each segment would have its own air ticket purchase model in terms of the nested logit model. With the model, we could obtain the probability of a passenger who belongs to one segment and purchases a flight with the specific air ticket price.

The aforementioned probability would be calculated on two functions: one is the segment membership function of the latent class structure and the other is the utility function of the NL model. It needs to point out that the parameters of the two functions are jointly estimated with the mechanism in Figure 2. We can see that the joint estimation is the key to construct the latent-NL model, thus to determine the market segmentation and the segment-based air ticket purchase model simultaneously.

3.2. Model Construction

In the model construction, the probability of individual i who belongs to segment s purchases alternative j (Pij|s) which is written as follows:

In the model, the variables are defined as follows:Pij|s: probability of individual i who belongs to segment s and purchases alternative jxij: dummy variable, equals to 1 when individual i chooses alternative j and 0 otherwiseyis: dummy variable, equals to 1 when individual i belongs to segment s and 0 otherwise

So, represents the probability that individual i purchases the ticket of flight j, on the condition that individual i belongs to segment s, which can be calculated by the utility function of the nested logit model; gives out the probability that i belongs to segment s, which can be calculated by segment membership function of the latent class structure.

First, for the probability , it is constructed on the segment membership function, and we define the related variables as follows:uis: the benefits of individual i belonging to segment smi: variables of describing the personal features of individual iγs: the parameters of mi for segment sδs: the error variable of the benefit uis

The segment membership function is formulated as equation (2), which establishes the relationship between uis and the personal feature variables mi. Meanwhile, the probability of individual i belonging to segment s is as equation (3):

Second, is predicted in the nested logit frame. The structure of the hierarchical air ticket purchase behavior is described in Figure 3. In the figure, there are four nests and two levels, and within level f, each nest represents one travel mode. All the nests are set under the same scenario, and the alternatives within a nest are attributed to several related service attributes. Within the first nest for the aviation mode, three alternatives are set in level h to represent the peak-hour flight (6:00–9:30; 17:00–19:30) and the normal daytime flight (9:30–17:00) as well as the evening flight (19:30–24:00). It needs to point out that j in equation (1) donates each alternative in level h in the figure.

Besides the “Air” nest, in the nest “Train,” we set two alternatives which are the “normal train” and “high-speed train.” The normal train is the traditional train of which the speed is lower than 300 km per hour, and the speed of high-speed train is over 300 km per hour. As the prices of the normal train and the high-speed train are fixed in China, we do not set multiple alternatives under the “normal train” and the “high-speed train.” Meanwhile, we set “private car” and “rented car” as the choice alternatives of the nest “Car” considering that the travel fees of the two kinds of cars are different. For the nest “Coach,” we set the “daytime coach” and “overnight coach” by taking the ticket price differences into account.

According to the hierarchical structure, substitutable relationship exists between alternatives within every nest. Hence, the error variances in the utility functions of these alternatives are not independently and identically distributed. Therefore, to capture this purchasing process, we develop a two-level nested logit model on the substitutable relationship between the alternatives. The related variables are demonstrated as follows:h, f: levels within the nested logit hierarchical structureUj|s: utility of jth alternative cognized by passengers in segment sJ: the volume of alternativesUq|s: utility of qth nest cognized by passengers in segment sQ: the volume of the nests in the hierarchical structureVj|s: observable portion of the utility function Uj|sVq|s: observable utility of qth nest cognized by passengers in segment szj: observable variables in the utility function Uj|szq: observable variables in the utility function Uq|sβjs: parameter of observable variable zj in the utility function Uj|sβqs: parameter of observable variable zq in the utility function Uq|sαs: constant variable of the utility function Uj|sεj|s: error variance of Uj|s, assumed to be i.i.dθf|s: inverse of scale parameter, and 0 < θf|s < 1Γqf : the maximum utility of alternative j within q in level fhj|s: one individual who belongs to segment s choose alternative j in h levelfq|s: one individual who belongs to segment s choose alternative q in f level

On the basis of the nested logit model, the utility function of alternative j for segment s is expressed as equation (4), the utility function of nest q for segment s is as equation (5), and the probability is as equation (6). And we can see that is formulated on two related probability: one is the probability that passenger i who belongs to segment s choosing nest q in level h in Figure 3 and the other is that passenger i chooses one alternative j on condition that i chooses the nest q:

On the basis of the nested logit model and latent class model, we can rewrite equation (1) based on equations (3) and (6) to develop the equation of the probability that individual i belonging to segment s purchases the ticket of j, which is written as follows:

3.3. Model Estimation

To estimate the parameters (βjs, βqs, αs, γs, θf|s, θf|s, and γs) of the proposed latent-NL model, we do the estimation on maximizing the loglikelihood value of using the expectation maximization (EM) algorithm.

It needs to point out that the volume of segment s is fixed before the estimation, which means that we should find out the most suitable segment number with several times of estimation work. Regularly, the segment number and the segment membership need to be determined by testing the model fitness measured by AIC (Akaike Information Criterion) and BIC (Bayesian information criterion), which are written as equations (8) and (9):

In the equations, LL is the maximum value of the loglikelihood in the estimation, K is the number of parameters in the proposed model, and N is the sample size. Except for model fitness, these two indicators also consider the number of parameters to prevent overfitting of the model [40]. As BIC places more penalties on the number of parameters than AIC, BIC is recognized as a better criterion to measure the fitness of the latent class model than AIC and loglikelihood.

4. Survey Design

As the statistical trip data may not contain passengers’ demographic characteristics which are important for market segmentation, here we should use the survey to do the data collection. Generally, there are two main types of survey design: the revealed preference (RP) survey and the stated preference (SP) survey. The RP survey does the interviews based on the respondents’ travel experiences but may not be able to lead them to memorize the detailed choice situations in the past, so it is hard to use the RP data to predict passengers’ potential choice decisions in other situations.

In the contrary, the SP survey does not interview the past trips, but it provides experiments for collecting choice decisions in different scenarios. The experiments are selected out from the profiles which are generated by combining the travel scenario and the choice alternative as well as the service attribute. Commonly, the profiles are designed to reveal the real situation but they may be not existed in reality, so the SP survey data would lead to overestimation [4143]. Therefore, to update the SP survey, we choose to create the virtual travel scenario for each profile, but to set the service attributes using the real information of flight, train, car, and coach, thus to make sure that the respondents could give the choice results as close as to reality.

4.1. Survey Design

The survey comprises of two parts. The first part collects the data about personal features of respondents, including income, occupation, age, and air travel experience, such attributes are the variables in the segment membership function in equation (2). The second part of the survey gives out three experiments in which the trip scenarios and choice alternatives are listed. In the experiment, the alternatives are described by several attributes which are the variables of utility function in equation (4).

Table 1 gives an overview of the selected variables related to the personal features, which are the variables of the segment membership function, the values of the variables are set according to the suggestions from the airline company. It needs to explain that the purchase dates for business and leisure trips are the duration between the ticket purchase date and the departure date.

Besides, Table 2 demonstrates the environmental variables of the utility function describing the trip scenario for all nests. In the survey, we choose four origin cities to deliver the questionnaires, Dalian, Shanghai, Nanjing, and Chengdu, and choose 50 cities in China as the trip destinations. The respondents would receive the questionnaires in which the trip origin is the city that he/she lives in. Meanwhile, when the origin and destination are confirmed, the travel distance would be calculated and the travel purpose would be set for the survey.

Then, in Table 3, the variables describing the choice alternatives are listed. In the table, 5 to 8 are the generic variables [44] for all alternatives, and then 9 to 15 are specific variables for different nests. The values of the variables are set using the real information collected from websites and the operating data provided by airports. In this sense, there is no need to set the levels of attributes in the SP survey design, and the real travel information would help the respondents make the choice as close as to the reality.

Totally, 213 profiles generated, 43 profiles for Dalian, 79 profiles for Shanghai, 59 profiles for Nanjing, and the rest for Chengdu; the profiles are partly shown in Figure 4. In the figure, each line represents a choice scenario with different alternatives attributed to specific variables using real service information. Based on the generated profiles, each questionnaire will be set with three experiments which are randomly selected from the profile set.

4.2. Data Collection and Descriptive Analysis

Then, the questionnaires were distributed to respondents face to face in four cities where four airports are located in; they are Dalian, Shanghai, Nanjing and Chengdu, from October 25th to 31th in 2019. As the survey duration does not cover the holiday as well as the starting or ending of school season, we would confirm that this period is valid for conducting the survey without the special service characteristics and abnormal choice decisions. The scale of the four airports are different, Shanghai Pudong airport is one of the largest airports in China. Dalian, Nanjing, and Chengdu airports are the regional hub airports in different areas. Meanwhile, the four cities are well reachable by other travel modes (train, car, and coach), making the selected cities the suitable cases for validating our proposed model.

In the end, 153, 122, 169, and 136 valid questionnaires were collected in Shanghai Pudong, Dalian, Nanjing, and Chengdu airport, respectively, collecting 1740 sets of choice data, and the response rate was 87.3%. A preliminary descriptive analysis is conducted. It shows that 26.1% of the respondents are young travelers (≤30), 48.3% of them are middle-age person (>30&≤55), and others are old people (>55). Besides, 36.2% of the respondents only travel by air for once a year, 12.1% of them have over 8 times of air travel per year, and others’ air travel frequency drops in the range between 1 to 8 times per year.

On the basis of the survey data and the proposed latent-NL model, we could conduct the estimation work. Section 5 introduces the estimation results.

5. Estimation Results

Before estimating the latent-NL model, we firstly try to estimate the parameters of the variables in the nested logit structure and set the goodness-of-fit of the NL model as the benchmark in our research.

5.1. Results of Nested Logit Model

Based on the nested logit structure, the parameters are estimated using the maximum likelihood estimation method. The alternative “Coach 2” is chosen as the referent alternative, and the associated constant is zero. Results are shown in Table 4. The goodness-of-fit is satisfactory, with the pseudo R2 equal to 0.35. The inclusive value parameter is 0.54 for “Air” nest, 0.43 for “Train” nest, 0.48 for “Car” nest, and 0.61 for “Coach” nest. All the inclusive value parameters fall within 0 and 1, which supports the nested logit structure of the air ticket purchasing model.

In the table, the variables written in italic type are the constants of the utility functions. By comparing the constants of different alternatives, we could sort the travel modes in a preference-descending sequence for long-distance trips: air, high-speed train, normal train, car, and coach. Then, for the variables related to personal features, the coefficients of age, position, and long-distance travel frequency are statistically significant, showing that the three variables may have strong influences on the market segmentation. Meanwhile, the negative sign of long-distance travel frequency states that passengers with frequent travel experiences may have higher requirements on the service quality of the travel alternatives.

For the coefficients of the environmental variables, the travel purpose is the significant one in the estimation; its positive sign indicates that passengers would obtain gains from the trip. Meanwhile, the travel distance is significant but seems that it does not have strong impacts on the alternative choice, which may be due to the development of the high-speed travel mode. In addition, it can be easily found that passengers put more concerns on the money costs than the travel time according to the coefficient value and the significant status. Besides, comparing the coefficient of air fare and those of other alternatives, the same rise in ticket price would lead to larger decrease in the air travel utility, showing that passengers have much more critical concern on the air fare increases.

Another significant coefficient is for the delay time, its value shows higher negative impacts of the delay time on the ticket purchase than the access time and even the fare of some travel modes. Moreover, the insignificance of the coefficients for rented car fare as well as coach fare may be due to the reason that the two travel modes are not popular for long-distance trip, especially when the travel distance is longer than 800 km.

5.2. Results of Latent-NL Model

To determine the model specification with the best fit, we tested a variety of models by varying the segment number. Table 5 summarizes the goodness-of-fit of two-, three-, and four-segmented model, based on which we could determine the model specification by comparing the AIC and BIC value, and the values are calculated using the maximum loglikelihood value (LL) based on equations (8) and (9). It can be seen that the lowest BIC value appears when the segment number equals to three; hence, the three-segment latent-NL model is the best choice of our research statistically.

The estimation results of the three-segment latent-NL model are shown in Table 6, and the first segment (S1) is set as the base of segment two and three (S2 and S3) in the estimation. In the following paragraphs, we will introduce in great detail of the segment-based characteristics of market, and further compare the results across different segments with the purpose of testing segment-specific attitude in air ticket purchase.

In the table, the first column demonstrates the label of the variables, the second column shows the coefficients of the NL model in Table 4, and the rest columns list the coefficients estimated for different segments (S1 to S3) within the latent-NL model. From the results, the respondents in our research are classified into three groups which comprise 41.04%, 37.89%, and 21.07% of the population, respectively. The pseudo R2 for each segment is listed in the last line of the table, indicating that the latent-NL model can explain the air ticket purchase behavior more precisely than the NL model.

The coefficients of latent variables are listed in the fourth and fifth column; S1 does not get such coefficients because it is the base in the estimation. Then, taking all the coefficients into joint consideration, we could title each segment according to the segment-specific features as follows.

5.2.1. S1: Low-Fare Seekers

The distinctive characteristic of S1 is that the memberships have highest negative attitudes to the travel fare but lowest negative attitudes to the travel time; hence, the passengers within the group are defined as the low-fare seekers.

As per the personal features, 79.6% of the group members lie in the age thresholds of ≤22 and >55, 36.8% of them do not have any job or retired, 87.6% of the employed members are first-line workers, and the average salary may not exceed 100 thousand RMB per year. Besides, the long-distance travel frequency is less than 3 times per year and the frequent travel purpose is concentrated in leisure and family. In addition, 92.5% of the members purchase the tickets for business and leisure trips at the dates about 30 and 60 days, respectively, in advance of the departure, indicating that they may be the customers enjoying the early bird prices.

Besides the feature of low-fare seeking, members in S1 also have other segment-specific characteristics in the ticket purchase. For example, the constants of the utility functions state that they prefer using air and train in long-distance journey but do not show the favorite mode; moreover, the travel purpose seems to have weaker impact on the purchase decision than that it has in other segments, demonstrating that passengers in this group would choose the cheapest alternative regardless of the travel purpose; furthermore, the group members may not concern a lot about the service quality such as delay and departure time, that is because the low cost and high service quality could not exist simultaneously.

In general, members in S1 are attributed to low-fare seek and early purchase; thus, they might be treated as the target passengers of airlines at the start-up time of air ticket booking horizon.

5.2.2. S2: Experienced Waverers

Compared to S1, members in S2 show lower negative attitude to the travel fare but stronger negative attitude to the travel time, so they inherently prefer air and high-speed train in the choice. However, seen from the constants of utility functions, the group members do not show obvious preferences to air or high-speed train, declaring that their purchase decisions depend on, to large extents, the service attributes such as ticket price, departure time, and delay time. In the sense, they may easily switch between air and high-speed train with the variations of service attributes, so members in S2 can be called waverers in the market.

Besides, when we check the social-economical features of the group, 82.4% members drop in the age range between 30 and 40, 86.1% of them are employed, and 45% are the middle-level managers with their salary over 100 thousand per year. Because of the high employment rate, 45.7% of the members commonly travel for business and commonly buy the tickets 7–15 days in advance of the trip. More importantly, their average travel frequency (5–8 times per year) is much higher than that in S1; thus, we can define the group passengers in S2 as experienced waverers.

Because of the cumulated experiences, passengers in S2 may be familiar with the flights as well as the high-speed trains to various destinations, so the coefficient for the destination is significant. In the circumstance, the members would have the ability to quickly compare the service attributes of the two modes in their mind without asking for lots of information. Apart from the prices, other service attributes are also highlighted by the members in S2; for instance, they have strong negative attitude to the flight delay and also care about the departure time.

As summarized above, the experienced waverers have higher preferences to air travel than the ones in S1, but they are also interested in using high-speed train. Meanwhile, they concern a lot about the service quality of the travel modes, so the way of airlines to capture them is to optimize the price strategy and improve the service quality at the same time.

5.2.3. S3: Loyal Flyers

Comparing to S2, members in S3 show highest inherent preferences to the air travel, indicating that they are the loyal passengers of airlines. The formation of loyalty is due to several reasons; first is the high income. To be specific, 42.5% of the group members are leadership managers and 52.7% of which could earn the income over 400 thousand RMB per year, so they could afford the air ticket prices even if the prices are very high just before the flight departure. The second reason is due to the time value. As declared by the coefficient of travel time, it is easy to find that the group members have the strongest negative attitude to the time costs, which is because 62.6% of them need to travel for more than 8 times every year with business purpose; thus, the aviation which can provide quick and comfortable connections becomes the first choice.

Due to the loyalty, members in S3 would not easily shift to other modes in reality. However, they also have higher requirements on the airlines’ service quality, for example, they have lowest tolerance of flight delay and prefer early morning and evening flights for the convenience of day-time work. Besides, they are strict on the access time as well as the comfort during the trip as 23% of the group members are with the age over 55. Therefore, members in S3 are the airlines’ loyal passengers, and they would not easily switch to other modes for high prices but would leave for weak services.

On the basis of previous analysis, it is clear that airlines’ markets are statistically composed of three segments of passengers which could be defined as low-fare seekers, experienced waverers, and loyal flyers. According to the segment-specific ticket purchase behavior, the low-fare seekers pursue cost saving; the experienced waverers who highlight both of the fare and the service quality may easily switch between the air and the high-speed train; the loyal flyers who habitually prefer the flights would put more emphasizes on the service quality than on the prices. In the sense, the air ticket pricing strategies would be set according to the different price attitudes kept by the segment members. In Section 6, we would do the sensitivity analysis for testing the segment-specific reaction to the change of ticket prices.

6. Discussion

The sensitivity analysis is conducted in three situations: increasing the price of peak-hour flight by 10%, decreasing the price of normal day-time flight by 10%, and decreasing the price of evening flight by 10%. Under each situation, we will calculate the changes in the purchase probability to the changes in the travel fare for each market segment. Table 7 illustrates the calculation results. We need to note that, when changing the travel fare of a specific alternative, the changes in the choice probability of other alternatives should be the same. This phenomenon is due to the independence from irrelevant alternative (IIA) property of discrete choice models.

In the table, columns 3 to 5 demonstrate the segment-based changes of the choice probabilities, and the probability can be also recognized as the market share of different alternatives. In situation 1, the peak-hour flight may totally lose 1.59% market share with 10% increment of its air ticket price. In situations 2 and 3, the normal day-time flight as well as the evening flight would win 10.96% and 11.38% market share, respectively, for cheaper air fare. When we go through the market share variations in each segment, it is easy to find out that the largest win and loss exists in S2, proving that the members in S2 are the experienced waverers. Meanwhile, passengers in S1 also do large contributions in improving the market share of the normal day-time flight and the evening flight, which is because they are low-fare seekers. However, because of the price insensitive, only a few members within S3 change their choices. Therefore, the pricing strategies should be proposed on the segment-specific price sensitivity.

Apart from the segment-specific attitude to the travel fare, the segment membership volume would also be essential for air ticket pricing. In order to do further interpretation, we create an experiment to compare the flight revenues resulted from different pricing strategies. The information of all the alternatives in the experiment is listed in Table 8.

Based on the abovementioned service attributes, we fix the ticket prices of all alternatives except for that of Flight 2 and then use the surveyed personal features and the estimated latent-NL model to calculate the revenues of Flight 2 with different pricing strategies. Table 9 declares the results.

For the strategies in Table 9, each of them contain four classes attributed to different air ticket prices and seat numbers; we call this kind of strategies as class allocations for airline marketing. Comparing the sold seats and revenues in the table, Strategy 2 would be better than Strategy 1 for the higher average revenue per seat. The success of Strategy 2 is due to the reason that it captures the passengers who would pay higher prices for the same seats. In other words, the class allocation set by Strategy 2 is proposed on the right distribution of passengers with different segment-specific price-sensitivity. Therefore, the market segmentation would have great effects on improving airlines’ profits with its usage in air ticket pricing, or more precisely, in the class allocation.

7. Conclusion

Regarding the fact that passengers’ air ticket purchase behavior is relevant to the heterogeneous preferences, this paper established an air ticket purchase model incorporating the market segmentation. In the model construction, the discrete choice model which had advantages in predicting purchase preferences was integrated into the latent class structure which could classify the market segments, thus to established a latent-NL model for predicting the segment-specific air ticket purchase behavior.

The proposed model was estimated using the SP survey data, and three groups of passengers were identified, titled with low-fare seekers, and experienced waverers as well as loyal flyers. With the comparison to the common classifications such as the “business-and-leisure” used in practice, our proposed approach could distinguish the passengers according to their purchase predispositions derived from the personal features and the purchase preferences. Because of the precise segmentation, the estimated segment-specific purchase models had been proved with advantages over the nested logit model for better fitness, indicating that the latent-NL has advantages in predicting the air ticket purchase behavior. Therefore, by constructing the latent-NL model with the careful variable selection as well as the survey design, we found an effective way to predict air ticket purchase behavior especially for the job of air ticket pricing.

Then, on the basis of the sensitivity analysis, we made clear that passengers’ feedbacks to the price fluctuations would vary across the segments. Hence, the same percentage change in the air ticket price would lead to the diverse variations in the demand of different flights which were preferred by passengers with differentiated price sensitivities. In this sense, the airlines could use the proposed model to precisely evaluate the effects of varying the air ticket price in improving the flight market share. Furthermore, the estimated latent-NL model was adopted to compare the revenues of one flight under different pricing strategies. The results proved that the market segmentation would support the airlines to predict the demand volume for each price class, thus to capture the high-value passengers. Therefore, our second contribution was that we proved the importance of market segmentation in the air ticket pricing and gave out the benefits of adopting the latent-NL model in raising the flight revenue.

However, there are some limitations existed in our work. First, the SP survey was conducted within only one week, which might lead to the difficulty in covering all kinds of ticket purchase behavior. Although 4 cities were selected to deliver the questionnaires with the purpose of offsetting the limitation leaded by the short survey duration, we would try to collect much more survey data to verify our results in the future. Second, apart from the demographical features, the experiences, and the service attributes, some other factors such as the passenger location, the airport scale, and the seasonal factor may also affect the airlines’ market segmentation in air ticket pricing, and all these factors should be incorporated in the future work.

Data Availability

The survey data used to support the findings of this study have not been made available because the data will be used to support another ongoing research, and meanwhile the data only can be used when permitted by the respondents as there are some private information incorporated in the data.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was sponsored by the National Key Research and Development Project (Grant no. 2018YFB1601200) and National Natural Science Foundation of China (Grant nos. 71701094 and 51608268).