#### Abstract

We address the well-known “factor zoo” problem in the Chinese stock market. By replicating a generation of pricing factors, we verify the Liu–Stambaugh–Yuan four-factor model which subsumes other counterparts in the Chinese A-share market. We further construct a characteristic library and apply the double-selection LASSO approach to explore whether significant anomalies contribute to current pricing factors. We find that some anomalies indeed play a significant role in pricing cross-sectional returns, but the improvement to the Liu–Stambaugh–Yuan four-factor model is limited.

#### 1. Introduction

Recent studies have proposed a series of factor models to price cross-sectional expected returns. As indicated by Cochrane [1], there exists a zoo of factors which can potentially explain expected stock returns. Searching for truly effective factors is a hot issue in asset pricing field. Once a new factor is constructed, researchers will evaluate whether it can provide incremental information by comparing with other mainstream factor models; see, for instance, works of Hou et al. [2], Racicot and Rentz [3], Racicot et al. [4], and Racicot et al. [5]. Hou et al. [2] show that the q-factor dominates all other pricing factors by various spanning tests. Racicot and Rentz [3]; Racicot et al. [4]; Racicot et al. [5] study some variants of the new Fama and French [6] model by using a new generalized method of moments estimator (GMM_{d}) and find that only the market risk factor is the consistently significant factor. Furthermore, these papers also find that adding to the new Fama and French [6] model with the illiquidity factor of Pástor and Stambaugh [7] does not provide more explanatory power. Certainly, other researchers may put forward different opinions in the future, but studies in this field have one thing in common: they all depend on WRDS, a large and complete data library in the US market. However, it still remains a puzzle whether these powerful factors take effects in other emerging markets, such as the Chinese stock market.

As the second largest economy of the world, China has developed its capital market vigorously in recent years and the total market value exceeded 8 trillion dollars in the end of 2019. Even though Chinese stocks are getting more appealing to global investors, limited studies focus on asset pricing problems in the A-share market, especially about which factors drive cross-sectional Chinese stock returns. The political and economic environments are quite different from those in the US. Besides, China limits foreign investors to participating in its domestic stock market directly. Hence, the factor models in the US may not all work in China. We hope to determine the most effective factor models for A-share market in recent 20 years and explore whether there exist undiscovered latent pricing factors.

Using Chinese A-share data from July 2000 to December 2019, we start by replicating a generation of pricing factor models emerging in recent literature, including Fama–French six-factor factor model (FF6, henceforth), Hou–Xue–Zhang q5-factor model (q5, henceforth), Stambaugh–Yuan four-factor model (SY4, henceforth), Daniel–Hirshleifer–Sun three-factor model (DHS-3, henceforth), and Liu–Stambaugh–Yuan four-factor model (LSY4, henceforth). Our paper independently constructs a database based on the work of Qiao [8] and Hou et al. [9]. We find LSY4 factors can generate a maximum Sharpe ratio of 2.16, significantly higher than other models. Differing from the US market, the investment factors hardly have any risk premiums for A-share stocks, regardless of whether we use the q-factor proposed by Hou et al. [2] or CMA factor proposed by Fama and French [10]. These two factors only generate the average return of 0.07% and −0.10% across 20 years, respectively. Profitability factors, however, indeed show some significant risk premiums. The ROE factor of q5 is better than Fama’s RMW, since the former is constructed with quarterly updated ROE rather than annually updated ROE. Consisting with this result, we find for SY4 and DHS-3, which is based on mispricing and investor behaviors, respectively; only the parts related to profitability measures have significant risk premiums.

Next, we test the pricing power of these pricing models. Following Hou et al. [9], we construct 104 anomalies and obtain their value-weighted long-short return spreads. By calculating the alphas with respect to various pricing models, we show that LSY4 can explain up to 98 anomalies, far more than other models. Regardless of average alphas, average t-statistics, or Gibbons–Ross–Shanken statistics, LSY4 always achieves the lowest value, indicating its outstanding performance to price A-share stocks. When we turn to spanning tests, it is not surprising that LSY4 can explain all factors except for the postearnings-announcement drift (PEAD) factor. For example, the powerful q-factor recommended by Hou et al. [2] is totally subsumed and has a tiny alpha of −0.06% (*t* = −0.38). However, PEAD survives from LSY4 with a significant alpha of 0.72% (*t* = 3.35).

Finally, we explore whether other potential risk factors can be added into LSY4 in future research. In contrast to the previous risk premium perspective, we focus on SDF loadings in this section. We mainly adopt the double-selection LASSO approach proposed by Feng et al. [11] to evaluate the marginal contribution of other latent risk factors. This approach accounts for model selection mistakes in finite sample and tests the importance of a given factor beyond other benchmark factors. The factors we test here include significant anomalies in the A-share market found by Qiao [8] as well as anomalies with significant LHY4 alphas in the previous section. We choose the benchmark factors as the union of aforementioned pricing factors. The empirical result shows 8 out of 18 anomalies can provide incremental information, most of which describe trading frictions, such as idiosyncratic volatility per the FF3 model, daily trading volume, and volatility of turnover. As a robustness check, we also extend the number of benchmark factors and replace test portfolios, but the conclusion changes a little. However, we notice a slight improvement when we add these newly found factors into the LSY4 model. It indicates that current factor zoo is limited to price A-share stocks better than LSY4. We suppose future research should focus on constructing new characteristics to come up with new models.

The contribution of our paper is twofold. First, we fill the gap in asset pricing studies related to Chinese market. As we discussed before, few studies apply the forefront asset pricing models to Chinese market. Although the data library of A-share market is not comparable to WRDS, we suppose it is enough for us to conduct a thorough research. Unfortunately, most of researchers still focus on testing Fama–French factors, ignoring the rapid development of literature in this field. On the contrary, these studies cannot even reach a consensus due to various samples. Examples of such studies include Hu et al. [12], Jiang et al. [13], and Cheung et al. [14]; some papers emphasize the importance of the value factor but others hold the opposite opinion. However, a recent research by Liu et al. [15] is remarkable, which can be regarded as a pioneer to explore factor models suitable for China. They construct a three-factor model using market return, size, and *E*/*P* ratio and find their model plus a turnover factor can explain 10 significant anomalies in the Chinese market after excluding the 30% smallest stocks. Nonetheless, this model still needs time to be universally accepted since it is newly published. Our paper inherits the work of Liu et al. [15] and will provide more robust evidences to support it.

Second, we conform to the trend in recent asset pricing literature that using machine learning techniques to search for latent pricing factors. As more and more factors are discovered, researchers are challenged by “multidimensionality” problem. Thus, a strand of literature takes advantage of variable selection approaches to screen effective factors. Representative works include Han et al. [16], Freyberger et al. [17], and Feng et al. [11]; all of which use LASSO regressions to exclude redundant factors. Inspired by their work, our paper applies LASSO to estimate SDF loadings based on a data library constructed by Qiao [8]. This author followed Hou et al. [9] and replicated 231 anomalies with Chinese stocks. Given some unique characteristics in the A-share market, the category and number of significant anomalies are really different from the US market. Thus, we pick out these significant anomalies and test whether they have contributions beyond the existing pricing factors using a double-selection LASSO approach. We hope our findings will inspire future asset pricing studies.

The rest of paper is organized as follows. Section 2 describes data and construction details of pricing factors. Section 3 reports the main empirical results of comparing asset pricing models, including explaining anomalies, spanning regression, and GRS test. Section 4 introduces the double-selection LASSO approach and shows its application on evaluating other potential risk factors. Section 5 is the conclusion.

#### 2. Data

##### 2.1. Data Source and Filters

We collect trading data and financial data from China Stock Market & Accounting Research Database (CSMAR). Our sample period is from July 2000 to December 2019. We do not use data before June 2000 since the cash flow statements of A-share stocks begin at the end of 1997, and the calculation of some factors needs the data from previous consecutive quarters. In A-share market, income statement and cash flow statement only report flow data between two recent quarters. Thus, we recover stock values of accounting data by subtracting values from the prior quarter. Note that Chinese listed firms are required to release quarterly reports after 2002; we subtract flow data semiannually before 2002.

Our sample includes all A-shares, and we impose several filters to exclude stocks (1) of financial industry, (2) listed less than six months, (3) having less than 15 trading days last month, and (4) having less than 120 trading days in the past 12 months. In addition, referring to the views of Liu et al. [15], the smallest 30% stocks in the A-share market are possibly shell targets in reverse mergers, which contaminate stock returns. Thus, we also exclude the smallest 30% stocks each month to avoid the shell value contamination.

##### 2.2. Factor Pricing Models

This section briefly introduces the factor pricing models we are devoted to compare in our paper, including Fama–French six-factor factor model (FF6), Hou–Xue–Zhang q5-factor model (q5), Stambaugh–Yuan four-factor model (SY4), Daniel–Hirshleifer–Sun three-factor model (DHS-3), and Liu–Stambaugh–Yuan four-factor model (LSY4). The details to construct each factor are provided in the Supplementary Appendix (available (here)).

###### 2.2.1. FF6 Model

Fama and French [6] propose their five-factor model by adding two additional factors, CMA and RMW, into the well-known 3-factor model [18]. CMA is the average returns on stocks with low investment minus the average returns on stocks with high investment. RMW is the average returns on stocks with high operating profitability minus the average returns on stocks with low operating profitability. Fama and French [10] further add the sixth factor, UMD, to control the momentum effect found by Jegadeesh and Titman [19]. UMD is the difference of returns on winner and loser portfolios.

###### 2.2.2. q5 Model

Hou et al. [20] propose the famous q-factor model and Hou et al. [2] augment it to the q5-factor model, whose factors include MKT, , , , and . In accordance with Fama–French factors, , , and correspond to SMB, RMW, and CMA, respectively. However, they are derived from the q-theory instead of cashflow discounting. The fifth factor, , describes expected investment growth.

###### 2.2.3. SY4 Model

Stambaugh and Yuan [21] do not construct their pricing factor by sorting stocks based on a specific characteristic; instead, they combine rankings of stocks with respect to 11 documented anomalies. The two new factors in addition to size and market factors, MGMT and PERF, are return spreads between underpriced stocks and overpriced stocks.

###### 2.2.4. DHS-3 Model

Daniel et al. [22] propose a model that supplements CAPM with another two behavior factors. The financing factor FIN captures the long-horizon mispricing, which is induced by investors’ overconfidence and exploited by managers’ decisions to issue or repurchase equity. The postearnings’ announcement drift factor PEAD captures the short-horizon mispricing, which is induced by investors’ limited attention and underreaction to earnings’ surprises.

###### 2.2.5. LSY4 Model

Liu et al. [15] re-examine the size and value effect in China and construct new factors suitable for the A-share market. Compared to Fama and French [18], they replace the book-to-market ratio with earnings-to-price ratio and propose a new factor named VMG. Moreover, to subsume more anomalies, they add the fourth factor PMO based on one-month abnormal turnover. PMO represents the return spread between stocks about which investors are relatively pessimistic and stocks about which investors are relatively optimistic.

##### 2.3. Test Anomalies

Hou et al. [9] have listed totally 447 anomalies published in top finance and accounting journals. However, one characteristic can construct several anomalies using different holding periods. Thus, their 447 anomalies are derived from 190 characteristics, which can be classified into 6 categories: *momentum, value versus growth, investment, profitability, intangibles,* and *trading frictions*. According to details disclosed in the online appendix of Hou et al. [9], our paper replicates 104 characteristics that are available and have fewest missing values in the A-share market. Qiao [8] also performs similar work but our characteristics accommodate theirs are more diverse in each category. We show their acronyms and definitions in Table 1.

One problem is how to combine characteristics with different frequencies. We need to merge the values of low-frequency characteristics with monthly stock returns. There are three possible cases: (1) for annually updated characteristics, returns from July of year *t* to June of year *t* + 1 are matched with characteristics constructed in the December of year *t* − 1; (2) for quarterly updated characteristics, returns in current month are matched with latest available characteristics, which are constructed with data published in the newest financial report; (3) for monthly updated characteristics, returns in current month are matched with characteristics constructed at the end of last month. Then, we sort each characteristic into 10 deciles and calculate the value-weighted decile returns for the next month. The deciles are rebalanced monthly. The anomaly returns are earned by holding a decile portfolio with respect to high expected returns and selling short a decile portfolio with respect to low expected returns.

#### 3. Comparing Factor Models

##### 3.1. Summary Statistics

We start by reporting the in-sample performance of individual factors. Table 2 displays some summary statistics for each factor model. The sample period is from July 2000 to December 2019. We find no Fama–French factors are significant and its maximum Sharpe ratio is only 1.25, which is the annualized Sharpe ratio of the tangency portfolio spanned by these factors. Specifically, the value effect measured by HML only derives the average return of 0.37% (*t* = 1.43) per month. Other factors SMB, RMW, and UMD are even worse, whose returns are 0.35% (*t* = 1.40), 0.24% (*t* = 1.11), and 0.23% (*t* = 0.86), respectively. Consistent with Liu et al. [15], we see the investment factor CMA is very weak in the A-share market, since its mean return is even negative. As a result, the q-factors showing strong explanative power in the US market does not have enough risk premiums as we expected. Similar to Fama’s CMA, the investment factor also has a mean return approaching to zero, about 0.07% (*t* = 0.49) per month. And the addition of expected growth factor is likewise useless; it only yields a tiny premium of 0.18% (*t* = 0.85). However, we find the profitability factor is much stronger than RMW, with an average return of 0.87% (*t* = 4.11) versus 0.24% (*t* = 1.11). Although both of them are return spreads sorted by firms’ ROE, uses quarterly data, and its value is updated every three months. That indicates stocks in the A-share market are more sensitive to recent changes of earnings, and firms with high profitability indeed generate substantially higher stock returns, which is exactly the main conclusion of Jiang et al. [13].

This fact can also be seen from anomaly-based factors. We find the maximum Sharpe ratio for Stambaugh and Yuan’s mispricing factors is only about 0.73 annually, the lowest in Table 2. As we have listed in the Supplementary Appendix, MGMT includes various investment measures, whereas PERF includes various profitability measures. Thus, it is not surprising that the average return of MGMT is nearly zero (0.01% and *t* = 0.07), but the average return of PERF is significantly positive (0.63% and *t* = 2.12). Likewise, we see for the DHS-3, the postearnings-announcement drift (PEAD) factor, which is constructed by anomalies reflecting earnings’ surprises, earns on average 0.98% (*t* = 4.97) per month. However, the financing factor FIN only delivers a premium of 0.12% (*t* = 0.84). At last, LSY-4 factors show the most significant risk premiums, with a maximum Sharpe ratio of 2.16. VMG effectively reflects the value effect; it earns an average return of 1.09% (*t* = 5.04), the highest among all factors. The sentiment factor PMO is also profitable and yields on average 0.81% (*t* = 3.68).

Interestingly, Table 2 also shows each pricing factor and illustrates the characteristic of leptokurtosis and fat tail, since all values of kurtosis are greater than 3. This fact is consistent with Jarque–Bara statistics, since we find the null hypotheses of the time series following normal distribution are rejected for all factors at the 1% significance level, except for and PEAD.

Table 3 reports the correlation matrix of all pricing factors. To make it clear, SMB in Table 3 corresponds to the size factor in LSY4. We are mainly interested about how LSY4 factors are related to the others. Not surprisingly, SMB is nearly equivalent to the size factor in the q-factor model, with a correlation of 0.92. The value factor VMG is highly correlated with profitability factors. It has a correlation of 0.68 with RMW and 0.81 with . It is because VMG is derived from earnings-to-price ratio, which is a comprehensive characteristic reflecting the value and expected earnings’ growth of a stock. The conventional value factor HML is not closely related to VMG. For sentiment factor PMO, we find few factors are highly correlated with it. The closest factor is HML, only with a correlation of −0.27. That indicates other factors cannot subsume the premium of turnover effect, and thus, it is reasonable to add it into the model to price A-share stocks.

##### 3.2. Explaining Anomalies

In Table 4, we regress 104 anomaly returns on various factor models, to investigate the performance of factor models in explaining anomalies. Columns 1–3 report the mean return, standard deviation, and *t*-statistic for each anomaly over the period from July 2000 to December 2019. For these unadjusted raw returns, we find 42 of 104 anomalies are significant (shown in bold). That indicates mispricing is a little severe in the A-share market, and strategies based on mispricing can earn significant profits. Next, columns 4–8 correspond to *t*-statistics of alphas controlling for various pricing factors, given Newey–West adjustments over 12-month lag. Surprisingly, it shows for models proposed from the US market, which are displayed in columns 4–7, are nearly useless: there are still 49, 43, 55, and 39 alphas with *t*-statistics higher than 1.96, respectively. In other words, significant anomalies can hardly be eliminated by these four models. In comparison, LSY4 has great effect on explaining these anomalies, with only six of 104 significant alphas, including Dimson beta (**betad**), one measure of seasonality (**rn25**), two measures of idiosyncratic skewness per different models (**isff** and **isq**), cumulative abnormal returns around earnings announcement dates (**abr**), and 6-month residual momentum (**e6**).

Table 5 further reports other statistics reflecting explaining power of pricing models. Again, we use these 104 anomalies as test assets. Average alpha is the absolute mean of 104 alphas with respect to each model, and average *t* is the absolute mean of corresponding *t*-statistics. Following Shanken [23], pricing errors are computed aswhere *α* is the vector of alpha and Σ is the covariance matrix of residuals for each factor model. We also perform the well-known GRS test proposed by Shanken et al. [24] to show whether alphas of 104 anomalies are jointly zero. Generally speaking, it seems that LSY4 has great advantage over other counterparts. It generates the smallest average absolute alpha of 0.31% as well as average absolute *t*-statistic of only 1.00. Its pricing error is the second lowest with the value of 3.65, slightly higher than the q-factor model (3.61). As for GRS test, we find all models reject the null hypothesis that alphas are jointly zero. It is not surprising since, as Table 4 shows, no factor model can accommodate all anomalies. However, LSY4 derives the smallest GRS statistic.

##### 3.3. Spanning Regressions

So far, we have shown the strong pricing power of the LSY4 model. Another key question is whether LSY4 can price factors of other models, which matters for model comparison. In this section, we regress FF6 factors, q5 factors, SY4 factors, and DHS-3 factors on LSY4 factors in turn, to show whether LSY4 factors can explain their risk premiums. Table 6 displays the results. From the Durbin–Watson statistics in the last column, we realize we cannot reject the hypothesis that residuals may suffer from autocorrelations. Thus, we adjust standard errors of regression coefficients using the Newey and West [25] correction with three lags. We find, except for PEAD and , other alphas are insignificant with the specification of the LSY4 model, indicating corresponding factors are accommodated by LSY4 factors. For example, the alpha of powerful q-factor is only 0.07% (*t* = 0.48), and the investment premium is further reduced to −0.06% (*t* = −0.41). On the contrary, PEAD survives from the LSY4 model with a highly significant alpha of 0.71% (*t* = 3.35). This fact consists with Table 4, since, as we have illustrated previously, PEAD is constructed with four-day cumulative abnormal return (**abr**), which cannot be explained by LSY4 factors. We also report two GRS statistics. We can hardly reject the null that all alphas are jointly equal to zero (GRS = 2.23), but once excluding PEAD, the GRS statistic declines to 1.47 immediately.

#### 4. Exploring New Factors

In this section, we will explore whether there any significant anomalies can serve as latent pricing factors which can be added into the current LSY4 model. We mainly depend on the double-selection LASSO approach proposed by Feng et al. [11] and evaluate marginal contributions of a series of new factors beyond existing factors. We start with a brief introduction of this methodology and then report our findings.

##### 4.1. Double-Selection LASSO

In asset pricing theory, the ability of a given factor to explain asset prices is reflected in its stochastic discount factor (SDF) loadings, as discussed by Cochrane [26] and Ferson [27]. Typically, we assume the stochastic discount factor has the following linear specification:where is the zero-beta rate, is a vector of benchmark factors, is a vector of new factors needed to be tested, and and are so-called SDF loadings. Given a vector of test asset returns , we may want to evaluate the explanatory power of while controlling for other benchmark factors . According to definition of SDF, formula (2) can be expressed as the relationship between expected returns and covariances with factors and :where is a vector of expected returns, is a vector of ones, is the covariance matrix of and , is the covariance matrix of and . Intuitively, SDF loadings and measure how expected return is correlated to its covariances with corresponding factors. However, since we mainly focus on the significance of , considering dependence between and , it is worthwhile to project on :where and are coefficient matrices and is the residual matrix.

Because there are a number of pricing factors in the real world (Fama–French factors, Hou–Xue–Zhang q-factors, etc.), we need to determine the components of before we test . It requires us to select factors with the most explanatory power to serve as the benchmark. Least Absolute Shrinkage Selection Operator (LASSO) regression is one of approaches to help us reduce the number of benchmark factors. LASSO is a modification of the RIDGE regression method where the constraint in the Lagrangian is the absolute values of the parameters and where the Lagrange multiplier governs the shrinkage Eric Ghysels [28]. Specifically, we run a cross-sectional LASSO regression of mean returns on the covariance between returns and all benchmark factors:where is the time series mean of all test asset returns, is the covariance matrix of and demeaned , and is the LASSO penalty parameter. We usually rely on cross validation to choose and solve the Lasso coefficients with least angle algorithm [29] or coordinate descent algorithm [30]. LASSO regression will shrink some to zero and generate a sparse function specification. We denote the indices of remaining benchmark factors as set .

However, in finite sample, traditional LASSO regression does not have the so-called “oracle property,” that is, we cannot select the true model with the probability of one. It indicates the result given by previous LASSO regression suffers from severe omitted variable bias, which means our inference about loadings may be erroneous. Thus, performing a single LASSO step is not enough to make sure we can control all useful benchmark factors for testing . We need to conduct another LASSO regression to search for factors which are omitted in the first step but still play a role in explaining test asset returns:where is the covariance matrix of and demeaned . Intuitively, for each new factor , we identify the corresponding benchmark factors whose covariance with mean returns are closest to covariance between mean returns and factor . Denote the union of all indices of selected benchmark factors as set .

So far, we have performed two rounds of LASSO regressions to select sufficient benchmark factors. Now, we can evaluate the marginal importance of new factors with following cross-sectional regression. We are concerned about SDF loadings , which tell us whether the corresponding factors are significant to price test asset returns:

##### 4.2. Factors and Test Portfolios

We construct our factor library based on previous discussion. Note that the aim of this section is to explore explanatory power of some new factors beyond a series of benchmark factors. Thus, the vector includes all pricing factors appearing in Table 2, which come from mainstream factor models in latest literature. In addition, we also add another two factors. The first one is QMJ, a Quality-Minus-Junk factor proposed by Asness et al. [31]. The other is a liquidity factor AMI, which is constructed by Amihud’s measure. Both factors are confirmed to have significant return spreads in A-share market, see details in Kang and Zhang [32] and Hu and Gu [33]. Our main empirical results are derived from these 18 benchmark factors in total. Someone may argue the volume of our factor library is too small to evaluate the contributions of new latent factors, especially compared to related literature in the US market. We respond to this doubt from two aspects. First, unlike the US market, there is no clear timeline of published date for newly discovered factors in China. Using the A-share market data, related studies mainly focus on testing the significance of factors discovered in developed countries. The purpose of our paper is to investigate whether we can add new factors into the baseline LSY4 model; hence, it is natural to denote current pricing factors as benchmark factors. Second, as a robustness check, we also expand the factor library to accommodate all the other anomalies in Table 1 except for anomalies we are going to test. However, we find our empirical result changes a little, indicating current pricing factors are powerful enough to subsume the main part of cross-sectional test asset returns.

For vector of new factors , we firstly consider significant anomalies unexplained by the LSY4 model. Table 4 shows only 6 anomalies can yield significant LSY4 alphas. Among them, cumulative abnormal returns around earnings’ announcement dates (**abr**), Dimson beta (**betad**), and one measure of seasonality (**ra25**) have significant return spreads. We suppose these three factors are proper candidates of latent new factors. Besides, we link to the work of Qiao [8], who replicates 231 anomalies in the A-share market. We test significant anomalies found in this paper, including quarterly assets-to-market (**amq**), changes in return on assets (**droa**), changes in return on equity (**droe**), dollar trading volume (**dtv**), idiosyncratic volatility per the FF3 model (**ivff**), long-term reversal (**lrev**), price momentum with prior 24 month returns (**m24**), maximum daily return (**mdr**), R&D expense to market equity (**rdm**), quarterly sales growth (**sgq**), quarterly sales-to-price ratio (**spq**), short-term reversal (**srev**), total volatility (**tv**), variation of share dollar trading volume (**vdtv**), and variation of share turnover (**vturn**). Note that some of these anomalies have been investigated by us in the previous section. Although they may generate insignificant return spreads according to Table 4, we still add them into the set of since it will contribute to literature in this field. Certainly, we reconstruct these anomalies following the standard procedure of Fama and French [18] in order to compare with other factors. For each anomaly, we split stocks into 3 portfolios according to 30 and 70 percentiles of corresponding characteristic. Note that high rankings are associated with higher future average returns. Meanwhile, we also split stocks into two groups according to the median size. Factor returns are spreads between the average of two high-characteristic portfolios and the average of two low-characteristic portfolios.

We use 650 portfolios as test assets based on our factor library. Following the procedure of Fama and French [18], stocks are independently sorted into 5 × 5 portfolios by the market value and one firm characteristic. In order to avoid imposing a specific factor structure on our test portfolios, we choose as many characteristics as possible, all of which are used to construct factors in and . The sorting characteristics include cumulative abnormal returns around earnings announcement dates, turnover, Amihud’s liquidity measure, quarterly assets-to-market, Dimson beta, book-to-market ratio, changes in return on assets, changes in return on equity, dollar trading volume, expected investment growth, earnings-to-price ratio, investment growth, idiosyncratic volatility per the FF3 model, long-term reversal, price momentum with prior 24 month returns, maximum daily return, 11-month momentum, seasonality, R&D expense to market equity, return on equity, quarterly sales growth, quarterly sales-to-price ratio, short-term reversal, total volatility, variation of share dollar trading volume, and variation of share turnover. In robustness check, we also form another 156 3 × 2 portfolios.

##### 4.3. Empirical Results

Now, we apply the double-selection LASSO approach to A-share market. We start by reporting summary statistics of factors in and , which are not introduced during previous discussion. Table 7 displays means, *t*-statistics, annualized Sharpe ratios, skewness, kurtosis, and Jarque–Bera statistics of totally 20 factors. To some extent, we find all these factors have significant return spreads except for long-term reversal (**lrev**) and price momentum with prior 24 month returns (**m24**). It is a little inconsistent with findings of Qiao [8]. Compared to their work, we exclude the smallest 30% stocks while constructing factors. On the contrary, Table 7 indicates our selection for tested factors is quite reasonable, since most of them yield extremely high *t*-statistics. Similar to discovered pricing factors we discussed before, these alternative factors also have leptokurtosis and fat tails.

We are interested in whether those factors with significant risk premia contribute to explaining cross-sectional returns. As emphasized by Feng et al. [11] and Cochrane [26], the key to answering this question is to compare factors’ SDF loadings. A positive SDF loading indicates the high value of this factor captures good states of the world, and vice versa. Columns 1-2 of Table 8 report our SDF loading estimates as well as *t*-statistics using double LASSO approach. The two steps select totally 4 benchmark factors, including expected growth (), quality-minus-junk (QMJ), liquidity (AMI), and market excess return (MKT), where the first three factors are derived from the first step and the last one is derived from the second step. One may argue that why these four factors can serve as the benchmark instead of LSY4 factors. It needs to be emphasized that the members of selected benchmark factors highly depend on test assets. As for returns of our 650 test portfolios, the factors above show greater explanative ability than LSY4 factors. This inference is obtained from Table 8. In columns 1-2, we find 8 out of 18 tested factors have significant SDF loadings with the significance level of 0.05, most of which are categorized into trading frictions. In comparison, when we use LSY4 factors as controls, as shown in columns 3-4, the number of significant tested factors rises to 12. Since we do not select benchmark factors optimally, the estimate of SDF loadings suffers from omitted variable bias and is possibly imprecise. Furthermore, in columns 5-6, we add all 18 benchmark factors simultaneously into the regression to estimate SDF loadings. The result is similar to using LSY4 factors, with 11 significant factors. Selecting proper benchmark factors is important to evaluate the contributions of new factors. In fact, as pointed out by Feng et al. [11], controlling all available factors is an unbiased but inefficient approach, and the inefficiency becomes especially severe when the dimension of factor space is large. We will show this in Table 9.

We next show robustness of our results since double LASSO approach relies on two inputs: returns of test portfolios and the set of benchmark factors. In Table 10, we re-estimate SDF loadings by changing test portfolios and expanding the dimension of . Firstly, we replace 650 test portfolios with fewer 156 portfolios. Then, we test extra benchmark factors which are anomalies presented in Table 4 but excluding the ones we need to test. Certainly, we follow the standard procedures to convert anomaly returns to factor returns, that is, we use spreads between two bivariate portfolios sorted by size and a specific characteristic instead of long-short decile spreads. In this way, we obtain another 81 benchmark factors and the total number increases to 99. Intuitively, we want to explore whether these tested factors can distinctively explain cross-sectional returns beyond the rest of characteristic-based anomalies. For convenience, Table 10 reports our baseline results in the left. Generally speaking, we find the results derived from 99 benchmark factors are similar to using only 18 benchmark factors. That indicates the original 18 factors can dominate the others when explaining variation of asset returns. On the contrary, it shows SDF loadings are less significant when we use only 156 portfolios. As we have mentioned before, the selection of test assets will influence the SDF loading estimates. However, we still find some tested factors kept significant over different cases, including cumulative abnormal returns around earnings’ announcement dates (**abr**), Dimson beta (**betad**), changes in return on assets (**droa**), changes in return on equity (**droe**), and idiosyncratic volatility per FF3 model (**ivff**). We suppose these factors indeed contribute to explaining cross-sectional returns.

Finally, we investigate whether aforementioned factors can augment the performance of the LSY4 model. Besides the five newly found factors, we also consider quality-minus-junk (QMJ) and liquidity (AMI), both of which are selected from the first step of LASSO. In practice, we augment the LSY4 model to a 5-factor model by adding new factors into the original model one by one. Similar to the previous section, as for a new factor model, we mainly focus on its ability to explain anomalies and other pricing factors. Table 11 shows the results. The first two rows report the GRS statistics and average *t*-statistics in explaining 104 anomalies. The last two rows are derived from simplified spanning regressions that only report alphas of two pricing factors, and PEAD. Note that these two factors still have significant alphas even if they are controlled by LSY4 factors. By comparison, we find **abr, droa**, and **droe** can improve the performance of the LSY4 model. All of the three factors can reduce the value of GRS statistic as well as alphas of and PEAD. However, the improvement is limited. None of them can totally explain all anomalies and eliminate the significance of the factor PEAD. Thus, we suppose relying on current factor zoo is hard to enhance the pricing power of the LSY4 model. New characteristics beyond our data library are needed. We hope our paper can inspire future research in this field.

#### 5. Conclusion

The “factor zoo” problem is a hot issue in empirical asset pricing. Given the short history and incomplete trade regulations of the Chinese A-share market, few studies focus on addressing this issue in China. Our study aims to fill this gap by testing the performance of pricing factors emerging in recent studies. Using A-share market data, we replicate a generation of pricing factors and construct a characteristic library. By explaining 104 anomalies and performing spanning regressions, we verify the Liu–Stambaugh–Yuan four-factor model dominates other counterparts in the A-share market, including the Fama–French six-factor factor model, the Hou–Xue–Zhang q5-factor model, the Stambaugh–Yuan four-factor model, and the Daniel–Hirshleifer–Sun three-factor model. Next, to explore whether there exist latent factors which have explanatory power beyond current factor models, we use a double-selection LASSO approach to estimate SDF loadings. Although we find some anomalies play a significant role in pricing cross-sectional returns, once we add them into the Liu–Stambaugh–Yuan four-factor model, the improvement is limited. We suppose future research should not be restricted in the current factor zoo when coming up with new factor models.

#### Data Availability

The trading data and financial data were collected from China Stock Market & Accounting Research Database (https://www.gtarsc.com/) and independently construct factors as well as anomalies in the A-share market. The data used to support the findings of the study are available from the corresponding author upon request.

#### Conflicts of Interest

The authors declare that they have no conflicts of interest.

#### Supplementary Materials

The details to construct factors are provided in the Supplementary Appendix.* (Supplementary Materials)*