Abstract

In the case of insufficient quantitative data, qualitative evaluation information is very important for investment decision-making. However, if it is completely based on qualitative evaluation information, the results may be subjective. In response to this problem, this paper proposes a method, namely, probabilistic hesitant fuzzy cross-efficiency evaluation (PHFCEE) method, based on probabilistic hesitant fuzzy theory, cross-efficiency data envelopment analysis (CEDEA). This method uses probabilistic hesitant fuzzy sets to collect qualitative evaluation information and then uses the cross-efficiency DEA model to fuse quantitative data and qualitative information. And finally, an investment portfolio is built based on the cross-efficiency value and its variance. In addition, this article gives the specific operating steps of the PHFCEE method and uses the construction of a portfolio of 10 stocks in the China CSI 300 Index as an example to illustrate the effectiveness of this method.

1. Introduction

The investment portfolio can effectively diversify risks [1]. At present, a large number of suitable tools and sophisticated models have been used for portfolio construction and selection research [2]. However, most of the methods that have been constructed require sufficient quantitative data [3, 4]. In fact, in the research process, there are situations where important indicators are difficult to quantify, there is no data, or there is little data [5, 6]. For example, the level of the team is difficult to measure with quantitative data, and the company that has just gone public has no financial data, etc. In addition, quantitative data are almost all historical data, which has certain limitations and one-sidedness in reflecting expected information and uncertain information. Based on the consideration of the above issues, we constructed the PHFCEE model and apply it to portfolio selection. We use expert evaluation to measure some qualitative indicators and then use CEDEA to fuse quantitative data and expert evaluation information and finally build an investment portfolio based on cross-efficiency values and constraints. This article will use probability fuzzy theory to describe the expert’s evaluation information. It is an interesting method to combine qualitative evaluation with quantitative data to construct a portfolio. Because experts have knowledge, experience, wisdom, and thinking and judgment capabilities that far exceed those of ordinary investors, they can grasp the essence and key points from complex information and give a more objective evaluation. The quantitative data can effectively reflect historical information and further ensure the objectivity of the model results.

At present, according to the type and quantity of data required by the model, the research of portfolio model can be roughly divided into the following three categories: First, a portfolio model that requires a large amount of time-series data to calculate statistical indices. Representative methods are Markowitz’s portfolio model and its extended models, such as Huang [7], Zhao and Xiao [8], and Kuzmanovic et al. [9]. Second, the investment portfolio model proposed is based on the nonstatistical model, which requires relatively little data. Nonstatistical methods used in portfolio models include neural networks [10], CEDEA [11], genetic algorithms [12], and fuzzy theory [13, 14]. Third, some qualitative portfolio models were developed based on fuzzy environments, such as Wang and Zhu [15], Chiarawongse et al. [16], and Zhou and Xu [3, 4].

However, fuzzy data is usually used to describe uncertain information provided by experts, investors, or decision-makers [17, 18], which is subjective. Therefore, we believe that integrating some quantitative data into the model can build a more effective investment portfolio. Fuzzy theory is widely used in various fields because it can describe uncertain information. In the field of investment, interval fuzzy sets [1921], hesitant fuzzy sets [22, 23], intuitionistic fuzzy sets [24, 25], and probabilistic hesitant fuzzy sets [18, 26] are often used to describe all kinds of evaluation information. These documents provide a theoretical basis for this article to describe qualitative information using fuzzy theory. Taking into account the inconsistent preferences of the expert group when making complex multiattribute decision-making, this paper chooses to use probabilistic hesitant fuzzy sets to describe the evaluation information of the expert group.

In addition, there are differences between the evaluation information of experts and the measurement methods of quantitative data. Therefore, it is necessary to find a model to fuse the two kinds of information. Sexton et al. [27] proposed a CEDEA model that can consider both self-evaluation and mutual evaluation, which provides an effective method for this paper to evaluate efficiency using data covering both quantitative and qualitative information. The CEDEA model has the characteristics of dimensionless, and it is also a method of measuring the overall organizational efficiency on multiple metrics and synthesizing a single index. These characteristics provide a theoretical basis for the fusion of data in this paper. In addition, the CEDEA model has been applied to portfolio evaluation or selection [11, 28, 29]. These articles provide a valuable reference for us to use the calculation results of cross-efficiency DEA to construct investment portfolios.

Based on the above background and literature review, this paper will combine probabilistic hesitant fuzzy theory, CEDEA model, and mean-variance to construct a new portfolio method. Its main function is to construct a portfolio based on quantitative data and qualitative evaluation information. As an illustration of the methodology presented in this article, we report a case study involving 10 companies in the China CSI 300 Index. In addition, the possible advantages of this article are as follow: First, the model uses both quantitative data and qualitative evaluation to describe important information about investment products or projects, which will show product features more comprehensively. This will help decision-makers or investors to make more accurate judgments. Second, the model uses the cross-efficiency DEA model to fuse two forms of data. Third, the model can be used for portfolio construction with only one data form or mixed data information form.

The rest of this article is structured as follows: Section 2 introduces the basic knowledge of hesitant fuzzy, probabilistic hesitant fuzzy, and CEDEA model. In Section 3, we build a method, named PHFCEE model, and introduce the steps to use it for portfolio selection. Section 4 provides a practical case to demonstrate the effectiveness and operability of this model. Section 5 introduces the conclusions, shortcomings, and prospects of this article.

2. Preliminary Concepts

As an effective multicriteria decision-making method, the cross-efficiency DEA is often used in portfolio selection and construction. But when using this method, it is mostly based on objective statistical data, and the statistical data has certain one-sidedness and limitations. In order to overcome this shortcoming, this paper introduces the probabilistic hesitant fuzzy data with subjective information into the CEDEA model to form a new method of portfolio construction based on comprehensive information. This part will briefly introduce the basic theories and concepts involved in the new method, that is, hesitant fuzzy, probabilistic hesitant fuzzy, and the CEDEA model.

2.1. Hesitant Fuzzy Set (HFS) and Probabilistic Hesitant Fuzzy Set (PHFS)

To solve the problem of inconsistent preferences of experts in complex multiattribute decision-making, Torra [30, 31] proposed the hesitant fuzzy set, whose core idea is that all the preferences of experts are taken as membership values. This concept is expressed as follows:

Definition 1 (see [30, 31]). Given any nonempty set , the hesitant fuzzy set defined in the collection can be expressed aswhere , is an element in the set , is the hesitant fuzzy element, is the degree of membership, and arranged in ascending order, and represents the sort position of . is the set of , and is the set of .

Example 1. If an expert’s rating on the return on assets of an enterprise is 0.5 or 0.6, the corresponding hesitant fuzzy element can be expressed as .
According to Definition 1 and Example 1, all membership degrees in HFS have the same importance or the same probability. However, when evaluating assets, it is usually carried out by multiple experts, and experts have a certain tendency to score. Example 2. If the expert’s score for a certain company’s return on assets is more inclined to 0.6, this means that the degree of membership of 0.6 is more important or the probability of occurrence is greater. Example 3. Suppose that the set of scores given by Expert 1 and Expert 2 on the return on assets of a certain enterprise are and , respectively. Then, the hesitant fuzzy element composed of the comprehensive information of Expert 1 and Expert 2 is denoted as . Among them, 0.6 appears in both score sets, indicating that it has a more important degree.
From the above analysis, it can be seen that the degree of membership in the HFS, that is, the expert score, fails to reflect the tendency of the expert score and the number of experts. This will result in the loss of a large amount of decision-making information. Therefore, this paper further uses PHFS to synthesize the evaluation information of the expert group. This concept can be expressed as follows.

Definition 2 (see [32]). Given any nonempty set , the probabilistic hesitant fuzzy set defined on set can be expressed aswhere , is an element in the set , is the hesitant probabilistic fuzzy element, is the degree of membership arranged in ascending order, and represents the sort position of. When , the probabilistic hesitant fuzzy set degenerates into the hesitant fuzzy set .

Example 4. Based on Example 3, the probabilistic hesitant fuzzy element can be expressed as , where the membership degrees are 0.5, 0.6, and 0.7, respectively. The probability of 0.5 is 0.25, the probability of 0.6 is 0.5, and the probability of 0.7 is 0.25. It is found that the probability of 0.6 is higher than other membership degrees.

2.2. Cross-Efficiency DEA Model (CEDEA)

CEDEA can not only effectively measure efficiency, but also provides a method to systematically choose a weight when there are multiple metrics. All these provide the basis for this article to build a portfolio based on objective data and subjective judgments. The calculation steps of the CEDEA model are briefly described as follows [27].

Suppose that there are as many as DMUs whose efficiency needs to be measured, denoted as ; each DMU evaluated has kinds of inputs and kinds of outputs, denoted as and , respectively; the weights of inputs and outputs are and , respectively.

First, solve the efficiency value of based on the linear programming form of the CCR model [33]:where is the efficiency value of and , are the m kinds of inputs of and , respectively, and , are the n outputs of and , respectively, and are the weights of input and output, respectively, , and .

Second, the optimal weights and of calculated in model (3) are substituted into the following equation to calculate the mutual evaluation efficiency value between and :

Third, to calculate the cross-efficiency value of , the formula is as follows:

In addition, calculate the degree of deviation of cross efficiency of , that is, the variance. The formula is as follows:

This section briefly introduces the definitions of hesitant fuzzy set and probabilistic hesitant fuzzy set and the application steps of the CEDEA model. This provides a theoretical basis for constructing the PHFCEE method.

3. PHFCEE Model Construction and Portfolio Selection

At present, portfolio selection and efficiency evaluation are mostly based on objective statistical data, ignoring the one-sidedness and limitations of statistical data. However, the probabilistic hesitant fuzzy theory can effectively deal with the fuzzy subjective evaluation information of experts. Therefore, this paper introduces it into the CEDEA model and puts forward the PHFCEE model. The essence of the model is to take the multiangle and multifactor fuzzy evaluation data of the expert group as the output or input in the CEDEA model at the same time. Then, the portfolio is constructed based on the efficiency value and its variance, which is estimated by the PHFCEE model.

In the whole method, first of all, each expert evaluates the attributes of the asset to form a hesitant fuzzy set, and then based on all the hesitant fuzzy sets obtained by the expert group, a comprehensive hesitant fuzzy set is formed. Finally, the probabilistic hesitant fuzzy set is formed by calculating the probability. Definitions 46 are proposed to show more clearly how the model integrates information and obtains the probability of membership. The relevant definitions are as follows.

Definition 4. Given that is a nonempty set. The comprehensive hesitant fuzzy set defined on set can be expressed aswhere ( is the comprehensive hesitant fuzzy element and represents a hesitant fuzzy element.

Definition 5. Given any comprehensive hesitant fuzzy element , let , where membership degrees can be defined as follows:where is membership degrees of the comprehensive hesitant fuzzy element, corresponds to the hesitant fuzzy element of the kth expert, , is the ranking of the membership degree, which is sorted by default from small to large, and , are the number of membership degree and comprehensive membership degree, respectively.

Definition 6. Given an arbitrary probabilistic hesitant fuzzy element , let such that the probability corresponding to its membership degree can be defined aswhere represents the frequency of the same membership value in the hesitant fuzzy elements of the attribute and represents the sum of the number of all the subhesitant fuzzy elements.
Because the CEDEA model cannot effectively deal with the fuzzy data of probabilistic hesitant degree, this paper obtains its expected value based on probability to represent the expert group’s score of asset attributes. Above, the nonempty collection defined represents the property collection of the asset.

Definition 7. Given an arbitrary probabilistic hesitant fuzzy element , the score function is defined as

Definition 8. Given any nonempty set , defining the corresponding set of scores on the set, it can be expressed asThen, the score set of all assets can be represented by a matrix, and at the same time, :where it corresponds to one by one. represents the attribute of the asset; then corresponds to the comprehensive score given by the expert group.
Secondly, based on the above definition and formula, the PHFCEE method containing probabilistic fuzzy information data is constructed, and its linear programming form is as follows:where is the optimal efficiency value of the asset (), and have kinds of inputs of assets, and have kinds of the objective statistical data output of assets, , are kinds of the subjective fuzzy data output of assets, is input weights, and and are output weights.
Through model (13), the optimal efficiency value and the corresponding optimal weights , , are obtained, and the mutual evaluation efficiency of , as shown in Table 1, is further calculated.The cross-efficiency of and its variance are calculated as follows:Finally, the cross-efficiency value and its variance are defined as the return and risk of financial assets, respectively [11]. Based on the mean-variance framework, a portfolio selection method including probability model information and cross efficiency can be established. We provide the portfolio selection method of maximizing return and the portfolio selection method of minimizing risk, as follows.
The revenue maximization under constraint conditions:Risk minimization under constraints:where is the weight of the , that is, the investment proportion of the corresponding financial assets; l is the number of assets; the purpose function is to maximize the return and minimize the market risk ; the constraint conditions mean that the expected return of the portfolio should not be lower than the average return, and the risk should not be higher than the average risk, so the sum of the investment proportion of the assets is 1.
To better illustrate the operability and practicability of the above method, this paper gives specific steps. Suppose there are as many as l decision-making units (, ), among which there are inputs (, ), and there are objective outputs (, ) and subjective fuzzy inputs (, ). At the same time, fuzzy input corresponds to the comprehensive evaluation of attributes of the decision-making units by the expert group (), in which the attributes are (, ), k members of the expert group (, ), the comprehensive score is (, ), the hesitant fuzzy set is , the comprehensive hesitant fuzzy set is , and the probabilistic hesitant module set .(i)The first step is to obtain the hesitant fuzzy set of all attributes of the decision-making unit from the members of the expert group.where.(ii)the second step, according to Definition 4 and Definition 5, the hesitant fuzzy set of the expert members is combined to form a comprehensive hesitant fuzzy set for all attributes of all decision-making units. And then,where.(iii)The third step is to calculate the probability of the membership degree in the comprehensive hesitant fuzzy element according to Definition 6, that is, equation (10), to form a probabilistic hesitant fuzzy set as follows:(iv)In the fourth step, according to Definition 7 and Definition 8, the probabilistic hesitant fuzzy setis calculated, and the comprehensive evaluation information of the expert group is obtained, which is shown as follows:where .(v)In the fifth step, , , () is substituted into model (12) to estimate the optimal efficiency value of the evaluated asset and its corresponding optimal weight , , .(vi)In the sixth step, the optimal weight of is substituted into equation (13) to calculate the mutual evaluation efficiency of (), as shown in Table 1.(vii)In the seventh step, the crossover efficiency values and variance are calculated according to equations (14) and (15), as shown in Table 2.(viii)In the eighth step, the cross-efficiency value and variance estimated by the cross-efficiency DEA are defined as the return and risk of the investment, respectively, and then the investment weight is calculated based on the investor’s investment purpose model (16) or (17). Model (16) maximizes the profit and model (7) minimizes the risk.

4. Illustrated Example

Investors often build their portfolios based on quantitative data and information and lack of adoption and analysis of qualitative information. The main reason is that qualitative information is difficult to quantify. To solve this problem, the method of expert evaluation is introduced to evaluate the attribute information that cannot be quantified, and the fuzzy methods of hesitant and probabilistic hesitant are used to collect and quantify the evaluation information with preference. To fuse and analyze the quantitative data and qualitative information, this paper further introduces the CEDEA model, which has the advantage of dimensionless. Based on this, this paper proposes the PHFCEE model to measure the cross-efficiency (benefit) and variance (risk) containing quantitative and qualitative information. Finally, the calculation results of the PHFCEE model are substituted into the portfolio model constructed in the third part to calculate the final portfolio components and weights.

To demonstrate the effectiveness of the method proposed in this paper, we select 10 stocks in the CSI 300 index for a case study. The corresponding enterprises (code/variable life) are Industrial and Commercial Bank of China (601398/DMU1), China Pacific Insurance (601601/DMU2), Shanghai Pharmaceutical (601607/DMU3), China Zhongzhi (601618/DMU4), China Life Insurance (601628/DMU5), China Construction (601668/DMU6), Shanghai Electric (601727/DMU7), Everbright Securities (601788/DMU8), China Exemption (601888/DMU9), and Zijin Mining (601899/DMU10). In addition, because the general CEDEA model cannot deal with data with negative values, this paper excludes enterprises with missing values and negative values when screening sample data. Finally, the ten enterprises shown above are formed as samples. According to the annual report of the enterprise in 2020, the relevant financial data are obtained to calculate the input and output indicators required by the PHFCEE model. Indicators that will reflect the operational ability, solvency, and growth ability of the enterprise [11, 34, 35] are considered as input to the PHFCEE model. The profitability of the enterprise and the market reaction of the stock is taken as the output of the objective data of the model. In addition, this paper also takes the output of the subjective data of the model through the evaluation of experts from the aspects of enterprise prospect, enterprise innovation, team level, and so on. Specific information on the relevant indicators is shown in Table 3.

In Table 3, except for the expert evaluation section, the corresponding financial data in other sectors can be calculated from the annual report data published by enterprises between March and April each year. This paper takes the annual report data of 10 enterprises in 2020 as an example. The expert evaluation section consists of a team of three experts to evaluate the selected enterprises from three aspects: enterprise prospect, enterprise innovation, and team level, in which the evaluation result of each expert is expressed in the form of the hesitant fuzzy set. The evaluation result of the expert group is expressed by the comprehensive hesitant fuzzy set, and the evaluation result of the expert group considering the importance of membership is expressed by the probabilistic hesitant fuzzy set. According to the specific steps of the method given in the third part of the article, the analysis of this case is as follows.

The first step is to obtain the evaluation results of 10 enterprises by three experts from three aspects: enterprise prospect, enterprise innovation, and team level, using hesitant fuzzy set , , and Express. The correspondence is shown in Tables 46, respectively. The second step, based on Definitions 4 and 5, merges the hesitant fuzzy elements for the same attribute in the hesitation fuzzy set and of the three experts and finally combines the comprehensive hesitation fuzzy set, which contains the evaluation results of all attributes of 10 enterprises, as shown in Table 7.

The third step is to calculate the probability of each membership degree in the comprehensive hesitant fuzzy set according to equation (10), to form a probabilistic hesitant fuzzy set that contains more expert group evaluation information, as shown in Table 8.

The fourth step is based on Definitions 7 and 8 and probabilistic hesitant fuzzy set . The comprehensive scores for each attribute of 10 enterprises are calculated, respectively, and the results are shown in Table 9.

In the fifth step, the input and output data of 10 enterprises are substituted into model (12) to estimate the optimal self-evaluation efficiency value and the optimal weight , , , and substitute it into model (13) to calculate the mutual evaluation efficiency between enterprises (). The estimated results run through MATLAB are shown in Table 10. In this table, the value on the diagonal is the self-evaluation efficiency of the enterprise, and the nondiagonal data of each row is the mutual evaluation efficiency of other enterprises to the corresponding enterprises of the bank.

The sixth step is to calculate the cross-efficiency value and variance between enterprises according to equation (15) and equation (16) and sort them from large to small, as shown in Table 11.

According to Table 11, the three enterprises with the largest cross-efficiency values are DMU1, DMU5, and DMU2, and their corresponding variances are 0.2031 (10), 0.3457 (4), and 0.3428 (5), respectively. Thus, it can be seen that except for the relatively high return and relatively low risk of enterprise DMU1, the income and risk of other enterprises show different characteristics.

In the seventh step, the cross-efficiency value and variance are regarded as the risk and return of the investment, respectively, and substituted into model (17) or model (18) to calculate the investment weight . It is assumed that there are no taxes, no transaction costs, no short selling, and full liquidity when constructing the investment portfolio.

In this paper, 5, 6, 7, and 8 enterprises with high cross-efficiency are selected to construct the investment portfolio, and the investment weights under different constraints are obtained, as shown in Tables 12 and 13. From the observation, we can see something important. First, with the increase of the number of stocks in the portfolio, the weight of the corresponding stocks of each enterprise is decreasing accordingly. Second, no matter how many stocks are included in the investment portfolio, whether the purpose of the portfolio construction is to maximize the return or minimize the risk, the investment weight of the enterprise DMU1 is the largest. Third, under the condition of income maximization, the investment right of DMU1 is more important than the investment weight of risk minimization.

Combined with the analysis of Tables 11 and 12, when investors aim to maximize returns, each portfolio will increase the investment weight of stocks with higher returns on the premise of not exceeding the average risk. Similarly, combined with the analysis of Tables 11 and 13, when investors aim to minimize risk, each investment portfolio will increase the investment weight of less risky stocks on the premise that the return is not lower than the average return. The above results are consistent with the facts, which shows the rationality of the portfolio construction model.

In addition, the return and risk of the portfolio are calculated according to the required weight and compared with the return and risk of the equal weight investment; the results are shown in Table 14. The analysis shows that the investment weight calculated based on model (16) and model (17) is lower than the equal weight investment in terms of return and risk. The investment return of income maximization is higher than that of risk minimization, and the risk of risk minimization is lower than that of return maximization. The comparison results in Table 14 are in line with expectations, which once again illustrates the rationality and effectiveness of the portfolio construction in this paper.

Compared with the quantitative portfolio selection methods such as modern portfolio theory [3638], the PHFCEE model proposed in this paper uses more comprehensive information, and then the derived results meet the real needs of investors and companies because this model not only uses quantitative data reflecting historical information but also uses hesitant fuzzy sets to describe some qualitative factors that cannot be measured with quantitative data. Compared with the qualitative portfolio model proposed by Zhou and Xu [3, 4] and others, the given model also uses some quantitative financial data to ensure the objectivity of the given results. The incomplete rationality of experts could lead to subjectivity in the evaluation information provided by experts and decision-makers, which can further affect the optimal results. Then, we add objective quantitative data into the model. Concerning these comparisons, the investment portfolio derived by this model could be more realistic and reasonable than that got by the other two types of models.

5. Conclusions

A portfolio approach that utilizes qualitative information is an effective venture capital technique when quantitative data are scarce or unavailable. However, the approach of building a portfolio using only qualitative information is somewhat subjective. To solve this problem, this paper proposes the PHFCEE model. The acquisition of qualitative information and the effective fusion of two data forms are the key technologies for portfolio construction. Then, we introduce probabilistic hesitant fuzzy set and the CEDEA model. Next, based on the cross-efficiency value and its variance, the investment portfolio ratios considering the investors’ risk preferences are constructed. Lastly, a real example has been provided to demonstrate the feasibility of the proposed models.

The mixed-data portfolio model based on the probabilistic hesitant fuzzy environment in this paper has some disadvantages, such as simple modeling rules, simple probabilistic calculation, and excessively ideal constraints. Therefore, in future work, the research will further consider the mixed-data portfolio model under the environment with incomplete or no-probability information and intend to develop a new probability solution algorithm. It will be worthwhile and interesting work.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported by the Natural Science Foundation of China (nos. 72071176 and 71840001), the Humanities and Social Sciences in Universities of Jiangsu Province, China (no. 2019SJA0337), and the Natural Science Research Project of Universities in Jiangsu Province, China (no. 19KJB120008).