Abstract

Although socially responsible investment (SRI) has developed into an important investment style, only a small number of studies discuss SRI portfolio construction. In view of the overwhelming breakthrough of machine learning in prediction, this paper proposes SRI portfolio construction models by combining a double-screening mechanism considering machine learning prediction and an extended global minimum variance (GMV) model (or extended maximum Sharpe ratio (MSPR) model), which are, respectively, named double-screening socially responsible investment (DSSRI) portfolio models I and II. The proposed models consist of two stages, i.e., stock screening and asset allocation. First, this paper develops a novel double-screening mechanism incorporating environmental, social, and corporate governance (ESG) and return potential criteria to ensure that high-quality stocks with good ESG performance and high-return potential are input into the optimal portfolio. Specifically, to obtain accurate stock return predictions, an extreme learning machine model optimized by the genetic algorithm is employed to predict stock prices. Next, to trade off the financial and ESG objectives of SRI investors, an extended GMV model (or extended MSPR model) considering the ESG factor is introduced to determine the capital allocation proportion of the stocks. We take the A-share market of China as the sample to verify the effectiveness of the proposed models. The empirical results demonstrate that compared with alternative models, the proposed models can yield better annualized return and ESG score performance as well as competitive Sharpe ratio performance.

1. Introduction

Socially responsible investment (SRI) considers personal values and societal concerns in investment decisions [1, 2]. SRI investors consider investing as an extension of their lifestyle and thus are willing to incorporate their social beliefs and values into financial activities [3]. With the increasing salience of social and environmental issues and the gradual awakening of investors’ consciousness of environmental, social, and corporate governance (ESG) factors, SRI has presented rapid growth [4] and has become an important investment style. As of 2020, there were 3038 signatories in the global financial market agreeing to abide by the Principles for Responsible Investment supported by the United Nations, encompassing management assets of 103.4 trillion dollars. Compared with the levels in 2019, this number of signatories represented an increase of 20%, and the management asset volume exhibited an increase of 666 trillion dollars (see Principles for Responsible Investment at http://www.unpri.org).

In fact, SRI has attracted wide attention from academia over the past decades. Researchers have explored SRI related topics from various perspectives, including the motivation of SRI investors [5, 6], socially responsible measurement [7, 8], and the relationship between ESG rating and finance returns of the firm [9, 10]. For more references, see [11]. However, only a small number of studies focuses on SRI portfolios. Unlike the conventional portfolio construction, SRI portfolio construction requires a stock screening process to ensure that the stocks in the portfolio are in line with SRI investors’ social values. A few researchers have made valuable explorations of stock screening. For example, Auer [12] applied different screening methods to test the performance of SRI. The empirical results showed that compared with positive screening, which can damage investment performance, negative screenings with low cut-off rates are better. Kempf and Osthoff [13] compared the three SRI screening mechanisms of negative, positive, and best-in-class screening through an investment strategy of buying stocks with high social responsibility ratings and selling stocks with low social responsibility ratings. The best-in-class screening was proven to generally obtain the highest abnormal returns. Barnett and Salomon [14] evaluated the effect of the intensity of social screening on the financial performance of SRI funds. They concluded that the financial returns of the SRI funds decrease with increasing screening intensity, while a rebound in financial returns occurs once the screening intensity reaches a maximum. It is worth mentioning that the stock screening conditions in previous studies were based solely on ESG criteria. However, as discussed by Diouf et al. [15], MacKenzie and Lewis [16], and Rosen et al. [17], SRI investors are concerned with both ESG and financial performance at the same time. There is no conclusive evidence that stocks screened based on ESG criteria can deliver high returns [2, 1820]. Therefore, it is necessary to consider the return potential of candidate stocks in the screening process to ensure that the screened stocks meet both the ESG criteria and the return demand of SRI investors.

To screen the potential high-return stocks, reliable stock return prediction is critical. Machine learning methods with excellent learning capability have been widely used for stock return prediction. The representative machine learning models for stock prediction include support vector machine regression (SVR) [21, 22], artificial neural networks (ANNs) [23, 24], fuzzy logic system [25, 26], deep neural network (DNN) [27, 28], convolutional neural network [29, 30], long short-term memory model (LSTM) [3133], and ensemble learning [34, 35]. Recently, extreme learning machine (ELM), which is a novel machine learning model, has received attention due to its powerful performance. For instance, Lie et al. [36] exploited the high accuracy and quick learning advantages of ELM to predict the stock price and then generate trading signals. Dash and Dash [37] employed ELM to train a computational efficient functional link artificial neural network and obtained better prediction effects for stock index movements than those with the backpropagation (BP) algorithm. Wei et al. [38] effectively improved the accuracy of exchange rate forecasting using a hybrid model of kernel-based ELM. Jiang et al. [39] developed two hybrid models for stock price prediction that employed ELM as the predictor. The empirical results showed that the proposed models had high prediction accuracy. Although ELM is capable of achieving good prediction accuracy, its main drawback is the mechanism of randomly generating input weights and biases of the hidden layer, which may lead to inferior solutions and thus undermine its stability and performance [40, 41]. To overcome this drawback, an effective strategy is to introduce an intelligent algorithm to optimize the input weights and biases of the hidden layer of ELM [40, 42, 43]. The genetic algorithm (GA), a classic and excellent intelligent algorithm, has been utilized to optimize machine learning models, i.e., ELM [4446] and BP [47]. In particular, ELM optimized by the GA (GA-ELM) has been successfully applied in various prediction topics, such as emission [44], wind power [45], and gas concentration [46]. Therefore, this paper introduces the GA-ELM to predict stock prices and then screens stocks with high-return potential based on the prediction results.

Once the stock screening is completed, SRI investors next need to determine the optimal investment weight of screened stocks. The mean-variance (MV) model, as the basis of modern portfolio theory, is still a common choice for SRI portfolio optimization. On this topic, only a handful of scholars have done relevant research. For example, Oikonomou et al. [48] tried to explore optimal ways to build SRI portfolios. They applied six models (three simplistic and three sophisticated) to construct SRI portfolios and compared the performance of each. The results indicated that the SRI portfolios based on the sophisticated optimization models represented by MV had superior risk-return trade-offs relative to the simplistic models (e.g., those with naive diversification). Branch et al. [49] analysed the performance of quantitative methodologies in ESG portfolio construction. They concluded that quantitative methods are effective at minimizing risk but could lead to unwanted exposure. Although the aforementioned studies performed meaningful and in-depth analyses, they either did not take the ESG factor into account in the portfolio optimization process or took ESG as a constraint condition instead of an optimization objective, thus failing to cater to investors’ demand to maximize ESG. In [50], an extended MV was developed to construct an ESG portfolio, where minimizing the negative Sharpe ratio with the ESG score was taken as a novel variable in the objective function. The proposed model showed considerable superiority in ESG scores over the traditional MV model. However, this research lacked an effective screening process to guarantee that high-quality stocks with both high ESG and high-return potential were input into the SRI portfolio, leading to mediocre return performance. Similarly, several modified versions of MV models incorporating ESG objectives (see, for example, Utz et al. [51], Calvo et al. [52], Stephan et al. [53] and Bilbao et al. [54]) also fail to preselect high-quality stocks before SRI portfolio construction.

Based on the above analysis, this paper proposes two SRI portfolio construction models called double-screening socially responsible investment (DSSIR) portfolio models I and II (abbreviated as DSSIR-I and DSSIR-II), which combine a double-screening mechanism and portfolio optimization models. Specifically, the novel double-screening mechanism consists of ESG and return potential screening. First, in the ESG screening, stocks with inferior ESG scores are excluded from the investment universe to ensure that the screened stocks align with the social values of investors. Subsequently, in the return potential screening, to include stocks with high-return potential into the portfolio, the GA-ELM model is introduced to predict the price of the screened stock. Finally, the stocks with high ESG and high-return potential, as high-quality stocks, are input into the portfolio. After this stock screening, the portfolio optimization model is employed to weigh the screened high-quality stocks. In this work, to trade off the financial and ESG objectives of SRI investors, an extended global minimum variance (GMV) model (or extended maximum Sharpe ratio (MSPR) model) incorporating the ESG factor, which we call GMV-ESG (MSPR-ESG), is introduced as a portfolio optimization model. The main contributions of this paper are as follows:(1)Unlike the majority of studies on SRI portfolio construction, this paper incorporates stock prediction into the portfolio selection, thereby providing a new method for SRI portfolio construction.(2)This paper develops two novel SRI portfolio models called DSSRI-I and DSSRI-II, in which a double-screening mechanism is designed to ensure high-quality stocks (both high ESG and return potential) being the component of the SRI portfolio.(3)The proposed models are compared with alternative models, including SSRI-GMV (or SSRI-MSPR), DSSRI-I (ELM) (or DSSRI-II (ELM)), DSSRI-I (SVR) (or DSSRI-II (SVR)), DSSRI-I (DNN) (or DSSRI-II (DNN)), and DSSRI-I (LSTM) (or DSSRI-II (LSTM)). The comparisons demonstrate the feasibility and advantages of the proposed models.

The rest of this paper is as follows. In Section 2, we outline the related component models and introduce the proposed models. In Section 3, we describe datasets, parameter settings, and present and discuss the empirical results. In Section 4, we summarize the conclusions and prospects for future work.

2. Methodology

2.1. GA-ELM Model
2.1.1. Extreme Learning Machine

ELM, originally developed by Huang et al. [55], is an advanced algorithm for single-hidden layer feedforward neural network (SLFN), whose network topology consists of three layers, namely, the input layer, hidden layer, and output layer. The traditional algorithm for SLFN like BP algorithm determines the optimal weights through multiple iterations, while the ELM randomly generates input weights and biases of the hidden layer. The ELM can analytically calculate the output weights based on the minimum norm least squares solution. Therefore, compared with the BP algorithm, the ELM is capable of obtaining good generalized performance at an extremely fast speed. Given sample pairs , where , , and , the following equation can be obtained for the ELM owning neurons in the hidden layer [55]:where denotes the input weight, represents the output weight, is the activation function, and is the bias of the th neuron in the hidden layer. The matrix form of equation (1) is as follows [55]:where

is the hidden layer output matrix. Accordingly, the output weight of the ELM can be obtained using the minimum norm least squares solution [55] as follows:where represents the Moore–Penrose generalized inverse of . The ELM can be implemented in the following four steps [55, 56]: (1) determine the number of neurons in the hidden layer; (2) randomly generate the input weights and biases of the hidden layer, and ; (3) compute the hidden layer output matrix, ; and (4) estimate the output weight, .

2.1.2. Genetic Algorithm

The genetic algorithm is a classic intelligent algorithm first proposed by Holland in 1975 [57]. GA is regarded as a very efficient tool for optimization problems since it is computationally simple and powerful, as well as there are no restrictive assumptions about the solution space [58]. The core idea of the GA, inspired by the theory of ‘survival of the fittest’, is to imitate the biological evolution process of a population to obtain a globally optimal solution [59]. In solving a practical problem, the GA randomly generates a population of individuals (called chromosomes) as candidate solutions of a practical problem to evolve in successive generations, aiming to eliminate poor solutions within the population; thus, the remaining good solutions are further developed into better ones [60]. As a result, after finite iterations of evolution (until a predefined termination criterion is satisfied), an optimal solution from the population can be obtained. In the process of evolution, the chromosomes of each generation determine those in the next generation through three genetic operations, i.e., selection, crossover, and mutation.

(1) Selection Operation. This operation chooses chromosomes from the current population as parents to produce offspring input within the new population. A common selection technique is the route wheel method in which the probability of selection is proportionate to the fitness of each individual [61]. The fitness is the measure of the quality of the chromosome (solution), which is calculated by presetting the fitness function defined according to the practical problem. Given that is the fitness of the -th chromosome, the total fitness of the population of individuals can be expressed as follows [62]:

The accumulated probability of each chromosome is as follows [62]:

A random number is generated, if , individual is selected as the parent.

(2) Crossover Operation. A crossover operation that exchanges some parts of the two parents to produce a new pair of offspring is conducted to seek promising solutions from the search space [63]. Given a crossover probability , if generating a random number with a standard uniform distribution is less than , a locus is chosen between two neighbouring chromosomes, i.e., parents 1 and 2. The chromosomes after this locus are exchanged, thereby forming new offsprings 1 and 2 [64]. The crossover processing can be expressed as follows:

(3) Mutation Operation. The mutation operation ensures the diversity of individuals and prevents solutions from being too similar to avoid obtaining the local optimal solution [65]. Similar to the crossover operation, given a mutation probability , if a random number with a standard uniform distribution is generated and is less than , a locus is chosen randomly on the parent, and the value of the locus is reversed, thereby producing a new offspring [64]. The mutation process can be expressed as follows:

2.1.3. Optimization of the ELM Model Using the GA

Although the ELM has a fast learning speed and good generalization performance, the stability of the prediction results is weak because it randomly selects the input weights and the biases of the hidden neurons in training [40]. To improve the prediction performance of the ELM, the best input weights and the biases of the hidden layer should be determined to replace those from random selection. The most direct way to do so is by constructing a hybrid framework to combine an optimization algorithm and the ELM, thereby optimizing the input weights and the biases of the hidden layer randomly generated by the ELM [42]. Following previous studies [4446], considering the sound performance in global search, GA is utilized to optimize the input weights and the biases of the hidden layer to improve the performance of the ELM in stock prediction. A detailed procedure of the GA-ELM algorithm is given in Algorithm 1.

Input: initialize the chromosome populations
Output: the optimal input weights and biases of the hidden layer of the ELM
(1)Begin
(2)Set the parameters of the ELM
(3)Define the fitness function
(4)Evaluate the fitness of all chromosomes in population
(5) = the best chromosome
(6)
(7)while < maximum generation do
(8)fordo
(9)   Generate randomly in the interval
(10)  if< crossover probability then
(11)   Update the chromosome according to equation (7)
(12)  end if
(13)   Generate randomly in the interval
(14)  if< mutation probability then
(15)   Update the chromosome according to equation (8)
(16)  end if
(17)  Evaluate the chromosome fitness
(18)end for
(19) Update the chromosome populations according to equation (6)
(20) Rank the chromosome populations according to fitness
(21) = the best current chromosome
(22)
(23)end while
(24)Return the optimal input weights and biases of the hidden layer of the ELM according to
(25)End
2.2. Extended Portfolio Optimization Models

Markowitz’s mean-variance (MV) model, as a classic portfolio optimization method, has been popular in the practice of financial investment. Based on a set of different return or risk levels, MV can determine the efficient frontier, at which the portfolios can obtain the maximum return for the given risk level or achieve the minimum risk for the given return level. There are two commonly known portfolio models on the efficient frontier, i.e., the GMV model [66] and the MSPR model [50]. The former denotes the minimum risk portfolio of the efficient frontier, and the latter denotes portfolio that maximizes the Sharpe ratio [66]. This paper incorporates the ESG factor into the objective functions of the GMV and MSPR models to form the GMV-ESG and MSPR-ESG models, respectively. The GMV-ESG model can be defined as follows:where is the weight of stock , represents the covariance between stocks and , and is the standard deviation of the portfolio. Following the advice of Auer [12], considering that investors usually have only a vague understanding of ESG, a synthetic ESG score is used to quantify the ESG performance. and denote the expected ESG score of portfolio and stock , respectively. For the MSPR-ESG model, similarly to the Sharpe ratio, we trade-off ESG score maximization and risk minimization, thereby incorporating the ESG score under a unit of risk into the MSPR model. The MSPR-ESG model can be defined as follows:where and separately denote the expected return of stock and the portfolio, respectively, and denotes the risk-free return. In this study, the expected return , covariance , and ESG score are obtained by the historical data.

2.3. The Proposed Models

The input of high-quality stocks plays an important role in constructing an optimal portfolio [67]. Thus, it is necessary to exclude inferior stocks that cannot satisfy investor demands from the investment universe. For most SRI investors, high-quality stocks not only need to meet ESG expectations but also have high-return potential. In the proposed models, a novel double-screening mechanism considering ESG and return potential is implemented to guarantee that high-quality stocks are input into the optimal portfolio. To be specific, low-ESG stocks are first cleaned from the investment universe based on the ESG threshold given by the investor. Subsequently, the GA-ELM model is utilized to predict the price of the remaining stocks in the future period, thereby obtaining the corresponding return prediction. Finally, the predicted returns are sorted in a descending order, and stocks with top-ranked returns are regarded as the ultimate high-quality stocks obtained from double-screening.

Furthermore, as Oikonomou et al. [48] claimed, in addition to optimizing the investment universe through the screening process, SRI investors also carefully consider portfolio selection techniques. The conventional GMV and MSPR models cannot trade off the financial and ESG performance of SRI investors since they do not incorporate the ESG factor. Therefore, this paper includes the ESG score within the objective functions of the GMV and MSPR models to form extended portfolio optimization models, i.e., the GMV-ESG and MSPR-ESG models, to balance the ESG and financial objectives of SRI investors. In summary, the proposed models are divided into the two stages of stock screening and asset allocation, involving four steps. The flowchart of the proposed models is shown in Figure 1, and a more detailed description of the proposed models is given as follows:Step 1. Exclude low-ESG stocks. All sample stocks are ranked in descending order of ESG scores, i.e., , where denotes stock . Then, the ESG score threshold is set, and stocks with ESG scores lower than the threshold are excluded from to form a new stock pool .Step 2. Predict stock prices. Because SRIs are usually medium-to long-term investments, we predict stock prices over a relatively long interval period instead of the next. First, stock price data are divided into a training set and a testing set, which are transformed into supervised learning forms. Let denote the price of , denote the interval, and represent the time window size. Thus, the input matrix and output matrix of can be expressed as follows:We stack in one large feature metric V with dimension . Similarly, we can obtain one large target matrix O with dimension . Subsequently, the feature metric V and target matrix O are fed to the GA-ELM model for training. Then, the optimal GA-ELM model obtained is applied to predict the prices of all stocks for the next periods, i.e., .Step 3. Screen high-quality stocks. Converting predicted stock prices to predicted returns, . Then, the obtained predicted returns are sorted in descending order, and the top stocks are considered the ultimate high-quality stocks .Step 4. Compute the proportion of capital allocation. The GMV-ESG and MSPR-ESG models are employed to determine the proportion of capital allocated to the high-quality stocks , thereby obtaining the optimal SRI portfolios. For convenience in solving, a multiobjective model is often transformed into a single-objective model. Equations (9) and (10) can be expressed as follows:

3. Empirical Results and Analysis

3.1. Data Description

In this paper, we use the ESG score from Thomson Reuters ASSET 4, one of the most prominent professional ESG rating agencies. We obtain the monthly closing prices of stocks from January 1, 2010 to December 31, 2019 in Choice Database. At present, the Thomson Reuters ASSET 4 database reports the ESG scores of only a few Chinese listed companies. After excluding some companies whose ESG scores are much lower or whose stock price data are notably incomplete, we select 50 companies as the study samples, as shown in Table 1. The stock closing price time series is split into training and testing sets based on the yearly rolling window approach, thereby forming three study periods. Specifically, in the first study period, we apply the stock prices from 2010 to 2016 for the training set and from 2017 for the testing set. Then, we move to the next year in turn until the third study period, using the stock prices from 2012 to 2018 for the training set and the stock prices from 2019 for the testing set.

3.2. Parameter Settings

In our experiments, the interval is set to 3 and the time window size is set to 6; that is, the stock price from the first 6 months is used to predict the price 3 months ahead to achieve the quarterly prediction. The parameters of all relative models are presented in Table 2. The asset size of the portfolio is a critical factor to be considered by investors. Because inappropriate portfolio cardinality weakens the performance of the portfolio, it can either increase the fee or worsen the risk. Some studies provide us with valuable references on this subject; for example, Zulkifli et al. [68] argued that the inclusion of 15 stocks can diversify away the diversifiable risk. Tang [69] found that portfolios containing 20 stocks can eliminate 95% of the diversifiable risk. Considering the above conclusions and trading off the fee and risk, we set the portfolio cardinality C = 15, 16, 17, 18, 19, and 20. To guarantee the reliability of the prediction results, the ultimate prediction results of all models are obtained by computing the average value of 20 independent runs. The empirical study is implemented using Python 3.7 and TensorFlow 2.0 environments on a computer with 1.80 GHz CPU and 16.0 GB RAM.

3.3. Stock Double-Screening

In this section, double-screening, including ESG screening and return potential screening, is performed. First, considering the limited stock universe in this study, we censor the 20% of stocks with the lowest ESG scores to complete the ESG screening. Next, for the remaining stocks, we use the GA-ELM model to predict the stock prices, further screening out stocks based on their return potential. To demonstrate the performance of the GA-ELM in terms of stock screening, we compare it with four advanced machine learning models: ELM, SVR, DNN, and LSTM. For prediction evaluation, several statistical criteria, such as the mean square error, root mean square error, and mean absolute error, are commonly used to measure the error of the prediction model. However, considering the purpose of the quantitative investment, these statistical metrics may not truly meet the financial goals of the investor. As mentioned in [70], practitioners focus more on potential trading profitability than on the prediction accuracy of models. A straightforward fact is that the more high-return stocks a portfolio contains, the greater is its probability of yielding high returns. Therefore, in this paper, we screen high-return stocks to evaluate the prediction models. In other words, a good prediction model for quantitative investment can screen more high-return stocks than an inferior prediction model can. Obviously, there are two extreme cases: the best model can screen all high-return stocks, and the worst model cannot screen any.

The top stocks with the highest return among candidate stocks are regarded as high-return stocks. To verify the accuracy and generalizable performance of the GA-ELM model, we can give different values. Considering that the value of directly determines the asset size of the SRI portfolio constructed, we set N = 15, 16, 17, 18, 19, and 20. Table 3 shows the minimum, maximum, mean, and average accuracy (average accuracy = mean/N) of the screened number of high-return stocks from 2017Q1 to 2019Q4 using different models. We find that the GA-ELM outperforms the other models in nearly all cases in terms of minimum, maximum, mean, and average accuracy. Specifically, when the number of high-return stocks N = 15, 16, 17, 18, 19, and 20, the values of minimum, maximum, mean, and average accuracy obtained by the GA-ELM are almost uniformly the highest. For example, when N = 15, the GA-ELM can screen 6.25 high-return stocks on average, while the ELM screens 5.25. The average accuracy of the GA-ELM is 6.67% higher than that of the ELM. The comparison results imply that using GA to optimize ELM can effectively improve the performance of ELM. Furthermore, compared with conventional machine learning models, i.e., SVR, and deep learning methods, i.e., DNN and LSTM, the average number of high-return stocks obtained by GA-ELM separately increase by 0.58, 1.17, and 0.83, and the corresponding average accuracy also improves by 3.89%, 7.78%, and 5.56%. Accordingly, the superiority of the GA-ELM is further confirmed. When the number of high-return stocks equals other values, i.e., 16, 17, 18, 19, or 20, we obtain similar conclusions, which indicate that the GA-ELM model has good generalizability. Therefore, the GA-ELM is a promising tool for use in the practice of stock screening.

3.4. SRI Portfolio Construction

In this section, the screened high-quality stocks are input into the GMV-ESG and MSPR-ESG models to construct SRI portfolios. The proposed models are compared with several alternative models at different portfolio cardinalities along the dimensions of return (annualized return), risk-adjusted return (Sharpe ratio), and ESG (ESG score).

3.4.1. Outline of Alternative Models

To test the performance of the proposed models, two kinds of alternative models are introduced for comparison:

(1) Alternative Model I: SSRI-GMV (or SSRI-MSPR). These models represent conventional SRI strategies that do not consider the return potential of stocks in the screening stage and do not introduce the ESG factor into the portfolio optimization model. In these models, low-ESG stocks are first excluded using ESG screening. Next, to keep the portfolio cardinality consistent with that of the proposed models, the same number of stocks as in the proposed models are selected from the screened stocks. Then, the GMV model (or MSPR model) is utilized to construct the SRI portfolio.

(2) Alternative Model II: DSSRI-I (Based on Other Machine Learning Models) (or DSSRI-II (Based on Other Machine Learning Models)). The purpose of introducing these models is to prove that the screening results obtained by different machine learning prediction models can affect the performance of the SRI portfolio. Specifically, we first exclude some stocks with low-ESG scores from the investment universe. Subsequently, several advanced machine learning models, such as ELM, SVR, DNN, and LSTM, are employed separately to predict stock prices, thereby further screening stocks with high-return potential. Finally, as with the proposed models, the GMV-ESG model (or MSPR-ESG model) is utilized to construct the SRI portfolio.

3.4.2. Alternative Model I Comparison

(1) Comparison of the Proposed DSSRI-I and the SSRI-GMV. Table 4 reports the annualized return, Sharpe ratio, and ESG score of the proposed DSSRI-I and the SSRI-GMV at different portfolio cardinalities (C = 15, 16, 17, 18, 19, and 20). The performance comparison results of the two models are shown in Figure 2. It is obvious that the proposed DSSRI-I provides the highest values on the three evaluation metrics. If we take portfolio cardinality C = 15 as an example, the annualized return, Sharpe ratio, and ESG score of the SSRI-GMV are 0.0444, 0.3198, and 55.3456, respectively. Correspondingly, these metrics for the proposed DSSRI-I are 0.0982, 0.4930, and 61.9809, representing increases of 0.0538, 0.1733, and 6.6353. For other portfolio cardinalities (C = 16, 17, 18, 19, and 20), the proposed DSSRI-I still generate good performance on all evaluation metrics. Specifically, compared with the SSRI-GMV, the annualized return of the proposed DSSRI-I increases in the range of 0.0320–0.0428, the Sharpe ratio increases in the range of 0.0800–0.1510, and the ESG score increases in the range of 5.3184–6.3273. Furthermore, at the portfolio cardinality C = 17, the proposed DSSRI-I provides the highest annualized return and Sharpe ratio of 0.1113 and 0.5517, respectively. The highest ESG score is generated by the proposed model at 61.9809 at portfolio cardinality C = 15. In summary, the proposed DSSRI-I can yield better ESG and financial performance than the SSRI-GMV.

(2) Comparison of the Proposed DSSRI-II and the SSRI-MSPR. The annualized return, Sharpe ratio, and ESG score of the two models are shown in Table 5. Figure 3 compares the performance of the two models. Obviously, DSSRI-II has higher evaluation metrics than the SSRI-MSPR at different portfolio cardinalities (C = 15, 16, 17, 18, 19, and 20). For example, the annualized returns of the proposed DSSRI-II are 0.0882, 0.0780, 0.1057, 0.0856, 0.0905, and 0.0887 at portfolio cardinalities C = 15 to 20, representing increases of 63.02%, 67.76%, 94.95%, 82.01%, 62.13%, and 65.61% over those of the SSRI-MSPR. For the Sharpe ratio, the DSSRI-II performs similarly. Specifically, the best Sharpe ratio (0.5339) is obtained by the DSSRI-II with portfolio cardinality C = 17. In contrast, the best Sharpe ratio of the SSRI-MSPR is 0.3750. When the portfolio cardinality C = 15, the DSSRI-II generates the highest ESG score of 62.0703, which is 11.50% higher than the best score of the SSRI-MSPR, at 55.6684. In short, the proposed DSSRI-II outperforms the SSRI-MSPR in terms of annualized return, Sharpe ratio, and ESG score.

3.4.3. Alternative Model II Comparison

(1) Comparison of the Proposed DSSRI-I and the DSSRI-I (Based on Other Machine Learning Models). In this section, we contrast the proposed DSSRI-I with the DSSRI-I (ELM), DSSRI-I (SVR), DSSRI-I (DNN), and DSSRI-I (LSTM) to verify the superiority of return potential screening based on the GA-ELM. The annualized return, Sharpe ratio, and ESG score of all models at different portfolio cardinalities C = 15, 16, 17, 18, 19, and 20 are presented in Table 6. The comparison of the evaluation metrics for the different models is displayed in Figure 4. First, we compare the proposed DSSRI-I with the DSSRI-I (ELM). As shown in Table 6 and Figure 4, when the portfolio cardinality C = 15, the annualized return of the proposed DSSRI-I is 0.0708 higher than that of the DSSRI-I (ELM). The Sharpe ratio of the proposed DSSRI-I is 0.4930, which is 117.32% higher than that of the DSSRI-I (ELM). The highest ESG score (61.9809) is still obtained by the proposed DSSRI-I. For other portfolio cardinalities (C = 16, 17, 18, 19, and 20), similar conclusions are obtained. In general, the proposed model is superior to the DSSRI-I (ELM) in terms of annualized return, Sharpe ratio, and ESG score and thus proves again that GA optimization can improve the performance of the ELM. Subsequently, the proposed DSSRI-I is compared with SRI portfolio models based on double-screening considering other advanced machine learning models (SVR, DNN, and LSTM), i.e., DSSRI-I (SVR), DSSRI-I (DNN), and DSSRI-I (LSTM). From Table 6 and Figure 4, we can find that the highest annualized returns at almost all the given portfolio cardinality levels are obtained by the DSSRI-I. When the portfolio cardinalities are C = 17, 18, 19 and 20, the proposed DSSRI-I provides the highest Sharpe ratio at 0.5517, 0.4952, 0.5049, and 0.5222, respectively. The proposed DSSRI-I generates the second-highest Sharpe ratio with portfolio cardinalities C = 15 and 16, following the DSSRI-I (DNN). The DSSRI-I generates the highest ESG scores from portfolio cardinalities C = 15 to 20 with values of 61.9809, 61.8525, 61.9619, 61.5369, 61.2844, and 60.9385, respectively. In general, compared with other SRI portfolio models based on advanced machine learning models, the proposed DSSRI-I presents obvious advantages in annualized return and ESG performance and has a competitive Sharpe ratio performance. Therefore, the proposed DSSRI-I based on the GA-ELM is capable of improving the ESG and financial performance of SRI portfolios.

(2) Comparison of the Proposed DSSRI-II and the DSSRI-II (Based on Other Machine Learning Models). As shown in Table 7 and Figure 5, the proposed DSSRI-II performs well in terms of annualized return, Sharpe ratio, and ESG score. Specifically, for the annualized return, the proposed DSSRI-II obtains the highest values at almost all the given portfolio cardinalities. However, an exception is when the portfolio cardinality C = 16, in which case the DSSRI-II (DNN) yields the highest annualized return of 0.1024. The proposed DSSRI-II yields the second-highest annualized return with 0.0780. The proposed DSSRI-II generates the second-highest Sharpe ratio at portfolio cardinalities C = 15, 16, 18, and 19 and the highest Sharpe ratio at portfolio cardinalities C = 17 and 20. For the ESG score, with the portfolio cardinality C = 15 taken as an example, the ESG score of the proposed model is 62.0703, representing an improvement of 3.76%, 6.02%, 7.28%, and 4.87% over the DSSRI-II (ELM), DSSRI-II (SVR), DSSRI-II (DNN), and DSSRI-II (LSTM), respectively. At the other portfolio cardinality levels, the proposed DSSRI-II is consistently ahead of the comparison models by at least 2.30% on the ESG score. In summary, similarly to the DSSRI-I, the DSSRI-II can provide good annualized return and ESG performance as well as competitive Sharpe ratio performance in comparison with the performance of the DSSRI-II (based on other machine learning models).

4. Conclusion and Future Work

In this work, we propose two SRI portfolio construction models called DSSRI-I and DSSRI-II based on machine learning, intelligent algorithm, and portfolio theory. The models consist of two stages: stock screening and asset allocation. In the stock screening stage, a double-screening mechanism is developed to generate the input of the SRI portfolio. Specifically, low-ESG stocks are first excluded from the investment universe to guarantee that the assets are in line with the social values of SRI investors. Then, an ELM model with GA optimization is introduced to predict the prices of the remaining stocks and thus further screen high-return potential stocks. In the asset allocation stage, the extended GMV model (or extended MSPR model) considering the ESG factor is introduced to determine the investment proportion of the screened stocks from the first stage. To test the performance, we compared the proposed models with alternative models in terms of annualized return, Sharpe ratios, and ESG scores. The conclusions show the following: (1) compared with the conventional SSRI-GMV (or SSRI-MSPR), the proposed DSSRI-I (or DSSRI-II) performs better on both ESG and financial measures. (2) The proposed DSSRI-I (or DSSRI-II) can obtain a higher annualized return and ESG score than the alternative models based on advanced machine learning models (i.e., DSSRI-I (ELM) (or DSSRI-II (ELM)), DSSRI-I (SVR) (or DSSRI-II (SVR)), DSSRI-I (DNN) (or DSSRI-II (DNN)), and DSSRI-I (LSTM) (or DSSRI-II (LSTM)). (3) The proposed models can generate a competitive Sharpe ratio performance in comparison with that of the above models. Therefore, considering the superior performance of the proposed models, they can be regarded as effective tools to construct SRI portfolios.

In the future, our work may focus on the following three aspects. First, other stock markets in different political, economic, and social environments should be studied to test the generalizability of the proposed models. Second, the performance of the proposed models can be further analysed by introducing various intelligent algorithms to optimize the ELM. Finally, the proposed models can be extended to a general framework for portfolio construction based on the personal preference of investors.

Abbreviations

ANNs:Artificial neural networks
BP:Back propagation
DNN:Deep neural network
DSSRI:Double-screening socially responsible investment
ELM:Extreme learning machine
ESG:Environmental, social, and corporate governance
GA:Genetic algorithm
GMV:Global minimum variance
LSTM:Long short-term memory
MSRP:Maximum Sharpe ratio
MV:Mean-variance
SLFN:Single-hidden layer feedforward neural network
SRI:Socially responsible investment
SVR:Support vector regression.

Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the Graduate Science and Technology Innovation Foundation of the Capital University of Economics and Business, China.