#### Abstract

We focus on the equity mutual funds offered by three Malaysian banks, namely Public Bank Berhad, CIMB, and Malayan Banking Berhad. The equity mutual funds or equity trust is grouped into four clusters based on their characteristics and categorized as inferior, stable, good performing, and aggressive funds based on their return rates, variance and treynor index. Based on the cluster analysis, the return rates and variance of clusters are represented as triangular fuzzy numbers in order to reflect the uncertainty of financial market. To find the optimal asset allocation in each cluster we develop a hybrid model of optimization and fuzzy based on return rates, variance. This was done by maximizing the fuzzy return for a tolerable fuzzy risk and minimizing the fuzzy risk for a desirable fuzzy return separately at different confidence levels.

#### 1. Introduction

The portfolio optimization is also known as a risk management, and how to obtain the optimal solution of portfolio allocation has atracted many researchers on portfolio decision-making in the recent years. Thus the objective for models, either the return model or the risk model, is to maximize the profit or to minimize the cost for portfolio selection based on mean-variance (MV) theorem that was proposed by Markowitz.

Nowadays, mutual funds have become an ideal form of investment for many people since they have the ability to separate risks to the smallest degree. Therefore, it is not surprising at all to have many banks offering equity mutual fund schemes since it is investors’ preference. However, selecting the best equity mutual fund is a difficult process. In the selection process, investors normally analyze the past performance of funds in order to evaluate the future performance. But sometimes, this method cannot be used for prediction of the future performance of funds due to the high volatility of market environment that is there is no assurance past trends will continue.

Thus Ammar and Khalifa [1] introduced the formulation of fuzzy portfolio optimization problem as a convex quadratic programming approach and gave an acceptable solution to such problems. They determined how much money should be allocated to each investment so that the total expected return would be greater than or equal to some lowest fuzzy return or the total fuzzy variance less than or equal to some greatest fuzzy variance.

In a recent study, by Chen and Huang [2], the uncertainty of future return rates and risk was considered and presented in triangular fuzzy numbers in contrast with the previous research whereby return rates were of trapezoidal form. They solved the optimal asset allocation by fuzzy optimization. This portfolio selection model was developed by using the Ammar and Khalifa [1] model and applied to Taiwan’s equity mutual fund.

The present paper therefore also focuses on equity mutual funds and proposes a basic portfolio selection model for three selected banks of Malaysia, where the future return rates and future risks are represented by also triangular fuzzy numbers. In this work we use the cluster analysis, and the equity funds are clustered into four groups and categorized based on the performance evaluation techniques by using rate of return, Treynor Index, and variance, respectively.

The aim of this study is also to help the investors by providing a more sensible basis for decision making since indefinite financial environment is taken into account. The uncertainty of future return rates and risks was considered and presented in triangular fuzzy numbers, where question of how much money to allocate for each investment asset is answered by employing fuzzy optimization in order to solve the optimal asset allocation; see [2] where the authors performed an empirical study to Taiwan Listed Equity Mutual Fund and analyzed the performance of equity mutual funds based on their rates of return, standard deviation, Treynor Index, and Turnover rate.

Thus in this study, we apply fuzzy optimization (see for details [3, 4]) to the equity mutual funds which are offered by three different Malaysian Banks, namely, Public Bank Berhad, CIMB, and Malayan Banking Berhad. Only equity funds from these banks were selected since they provide higher returns compared to equity funds that are offered by other institutions. The performance of these funds will be investigated and the ideal portion of money to be invested in order to obtain optimal return rates with bearable risk or minimum risk and further a desirable return will also be determined. Thus the objectives of this research can be summarized as follows.

(1)To investigate the performance of mutual funds in Malaysia by using cluster analysis and fuzzy method.(2)To determine the best allocation of money to equity mutual funds in order to gain optimal return rates with tolerable investment risk or minimize investment risk.In the progress of study we use the following terminologies.

*(1) Net Asset Value*

Net asset value is the current market value of all the fund’s assets, minus liabilities, divided by the outstanding number of shares; see [5].

*(2) Rate of Return*

Rate of return is calculated based on net asset value as follows
where, represents the number of funds, represents rate of return of portfolio (in percentage), represents net asset value of funds at current evaluation period, represents net asset value of funds at previous evaluation period.

*(3) Treynor Index*

The measure of a portfolio’s excess return per unit of risk calculated by
where, represents number of funds, represents Treynor’s portfolio performance measure of portfolio represents beta of portfolio over the evaluation period, represents average holding period returns on the portfolio over the evaluation period, represents average risk-free return over the evaluation period.

Note that Treynor index is similar to Sharpe’s ratio (see [6]) but it measures excess return of portfolio per unit of systematic risk whereas Sharpe’s ratio measures excess return of portfolio per unit of total risk.

*(4) Variance*

We have
where, = variance for portfolio represents rate of return of portfolio on the th day, represents average return rates of days,

According to Markowitz [7, 8] higher levels of return are accompanied by higher risk. He suggested the mean-variance methodology to solve the portfolio problem whereby the return is maximized for the same level of risk or risk is minimized for a given return. In this model, return is measured as mean and risk quantified as variance; however, complicated calculation is required in Markowitz model [7, 8].

#### 2. Methodology

The primary objective of this research is to optimize a portfolio of equity mutual funds. Therefore, we first have to choose the right funds based on its performance. Once, a desirable portfolio of equity mutual funds is selected, we decide how much money should be allocated to each investment. To simplify matters, this chapter is divided into two sections whereby the first section is devoted to portfolio selection and the second is about portfolio optimization.

##### 2.1. Research Scope

The data for this research consist of 38 equity funds both local and global, which are distributed and managed by CIMB, Public Mutual Berhad, and Malayan Banking Berhad. Although, the total number of equity funds offered by these banks are 58 in total, only 38 funds were selected for the purpose of this research. The other funds were eliminated due to incomplete data. In fact it might also be an open problem to study case for another 20 funds under incomplete data. Although, the time period of study is one year that is 365 days, the data available was for 216 days only minus weekends and public holidays because the counters were closed on these days. There were also few days where the reading was not taken due to certain problems. Other data’s regarding the risk-free rate were taken from the website Central Bank of Malaysia (Bank Negara Malaysia) http://www.bnm.gov.my/.

##### 2.2. Portfolio Selection

For the selection of portfolio of funds we use cluster analysis. Thus the objective of cluster analysis is to separate the characteristics of funds into groups to help investors in selection process which groups of equity funds to invest in. The cluster analysis is done in two parts. In the first part, funds are clustered using Ward’s method with correlation as similarity measure whereas in the second part, the number of clusters is determined.

##### 2.3. Performance Evaluation Techniques

Once the results of the cluster analysis obtained then each cluster is analysed based on the three performance evaluation techniques to categorize the cluster of funds with its performance level. For the evaluation techniques the rate of return, Treynor index, and variance were used. Actually, the Treynor index is only used to differentiate the evaluation of funds performance and was not adapted in the optimization model. Variance, on the other hand, was a measure of risk and higher risk might give higher returns.

##### 2.4. Portfolio Optimization

The objective of portfolio optimization is to determine the best allocation of money in the portfolio of mutual funds. This can be done by either maximizing the return for a given risk or minimizing risk for a given return. The rate of return and the variance were represented as fuzzy numbers in order to reflect the uncertainty at the evaluation stage. Further, the optimization method for fuzzy models is developed based on Ammar and Khalifa model [1]. In this study, given the past return of each cluster, we approximate values such as future expected return and future risk. As in Chen and Huang [2], the fuzzy return rates are denoted as a triangular fuzzy numbers, whose membership function is as follows:

The level confidence of in terms of interval values corresponding to the triangular fuzzy numbers as follows:

where and are the lower and upper bounds of the level confidence. Similarly, fuzzy variance is denoted by with the membership function:

The level confidence of in terms of interval values corresponding to the triangular fuzzy number is

where the lower and upper bounds of the -level confidence can also be obtained. The portfolio optimization problem can be developed in two ways by using (5) and (7). Firstly, to maximize the expected return subject to a given risk. is the crisp maximum of fuzzy variance and is the investment proportion in cluster

Secondly, to minimize future risk subject to an expected return. *L *is the crisp minimum of fuzzy return and is the investment proportion in cluster

Both of the problems could be solved by satisfying the Kuhn-Tucker conditions based on the lower and upper bounds, separately, at different a-level confidences. The optimal solutions at level will be in ; see [2]. In conclusion, investors can choose either one of these model based on their preferences to solve the optimization problem.

#### 3. Results and Discussion

This section is divided into three subsections based on the three methods that is cluster analysis, fuzzy method, and optimization. Each section is devoted to the gained results, their interpretation as well as discussion.

##### 3.1. Cluster Analysis

The cluster analysis is performed by using MINITAB 14.0 Software. The input needed for the cluster analysis is the daily rate of return for all the funds. The cluster analysis is done by following two stages. Firstly, Ward’s method is used to separate the funds into groups. Secondly, *K*-means of nonhierachical method is used to determine the number of clusters. The number of cluster is set to be four (*k* = 4) because dividing it into three clusters would make the funds with different characteristics forced into a group. Thus, there would be less diversity between cluster two and four. Therefore, the suitable number of clusters is four. The cluster analysis results can be represented in a tree diagram or dendogram which might help in interpreting the clustering process.

Next, we categorize cluster based on their performance characteristics, namely, inferior funds, stable funds, good performing funds, and aggressive funds. The rate of return and variance of each cluster represented by its midpoint. The category of clusters is displayed in Table 1.

By looking at Table 1it is observed that; The funds in Cluster 1 are categorized as stable since they have a rather standard return. It is the second lowest while Treynor Index is the highest whereas the variance the lowest which indicates a lower risk. Thus, we categorized Cluster 1 as stable.

Cluster 2 has high returns and low variance which shows low risk for a good return. Its Treynor index is the highest indicating that it is well diversified.

Cluster 3 has the highest return and highest variance which proves that higher return does come with higher risk. The Treynor index is in the second highest position which indicates that the systematic risk is small.

Cluster 4 has the lowest return but its variance is in the second lowest position instead of being the lowest since lowest return rate should have lowest risk. Thus, this cluster has inferior performance even though its Treynor index is higher than that of Cluster 1.

From the result of cluster analysis, investing in Cluster 2 and Cluster 3 would lead to higher profit compared to investing in the other clusters. Thus, investors are highly recommended to invest in these two clusters since expected return and the risk are two fundamental factors to consider. Thus the investors can select which funds to invest in from these clusters to form their portfolio.

##### 3.2. Fuzzy Method

Since, the financial market is highly volatile and thus, the return rate and risks of investing in cluster 2 and cluster 3 most probably would not be the same as calculated in the table. In order to reflect this uncertainty of financial market we use triangular fuzzy numbers instead of a crisp representation to represent the future return rates and risks. Hence, the spread of each side of the triangle fuzzy number is a set as the minimum and maximum value of each variable (return rate and risk). Their values are given in Table 2.

###### 3.2.1. Representation of Cluster 2 and Cluster 3 as Triangular Fuzzy Numbers

Return rates of Cluster 2 are denoted as a triangular fuzzy number, (0.035, 0.128, 0.191). The values of , and are taken from the Table 2 and are substituted into (4) and its membership function is as follows:

The -level confidence of in terms of interval values corresponding to the triangular fuzzy number, , is given by

This is found by substituting the values of in (5) where the lower and upper bounds of the -level confidence are obtained by substituting the values of .

Next, the variance of Cluster 2 is also represented as a triangular fuzzy number, = , , . The values of *l *= 0.708, *m* = 1.664 and *n *= 2.620 are taken from Table 2 and are substituted into (6) to obtain the following membership function:

The -level confidence of in terms of interval values corresponding to the triangular fuzzy number, = is given by

This is found by substituting the values of in (7) where the lower and upper bounds of the -level confidence are obtained by substituting the values of . Similarly, the rate of return and variance of Cluster 3 are represented as triangular fuzzy numbers.

##### 3.3. Optimization Model

The optimization problem is divided into two parts as follows.

###### 3.3.1. Maximize Fuzzy Return Subject to a Given Fuzzy Variance

This is done to find the optimum investment proportion in Cluster 2 and Cluster 3 in order to gain the maximum fuzzy return for a given upper bound of fuzzy variance that an investor can tolerate. The optimization problem is developed as follows by using (11), (13), and and for cluster 3 where and for cluster 3 can be found by using (4), (5), (6) and (7) which is similar to that in Section 3.2.1:

By using (14), the problem is solved by applying the upper and lower bounds separately at different -level confidence that is, , 0.25, 0.5, 0.75 and 1.0. For example for solving the problem with . If *M* is the upper bound of variance that can be tolerated

The optimal solutions for different -level confidence can be obtained, similarly using the Lingo 11.0 Software. The results obtained areas shown in Table 3.

In Table 3, “NFS” means there is no solutions for the corresponding conditions. For *M* greater than a certain variance for all the -levels, the optimal solutions remain the same, that is all investments are made in Cluster 3 only. For example, shows that when *M* equals to or greater than 44.915, the return interval and investment proportions are the same. For , the fuzzy return interval is the same regardless of the risk because of the different proportions of investment in Clusters 2 and 3. This shows that we have managed to maintain the same level of return interval by making different investment proportions and hence succeeded in reducing the risk as well. Thus, if investors want to maximize fuzzy return with , they should invest in the proportion with the lowest risk since the return interval is the same.

On the other hand if investors want to maximize fuzzy return with , 0.5, 0.75, and 1.0, they should choose the optimal investment proportion according to the return intervals they desire. However we note that the desirable return is accompanied with a fuzzy risk that should be tolerated with. Therefore, if investors want to desire a certain fuzzy return but they cannot tolerate its risk, then they should opt for return with tolerable risk and invest according to its optimal proportion. Besides that, the investment proportion in Cluster 3 is generally higher than that of Cluster 2 because it has higher return. Thus it is also expected to invest more in Cluster 3 in order to to gain higher return. It is also observed that as -level increases, the return intervals becomes smaller.

###### 3.3.2. Minimize Fuzzy Risk Subject to a Given Fuzzy Return

We have

This optimization problem is solved at different - level confidences and the results obtained are shown in Table 4.

In Table 4, “NFS” means that there is no feasible solution corresponding to the fuzzy return rates constraint. The results at are not displayed although, there are feasible solutions because it is not significant. Thus, in this case, investors are not advised to choose the investment proportions at . It is observed that as -level increases, the variance intervals become smaller. As the variance becomes smaller (risk decreases), the investment proportion is higher in Cluster 2. The simple reason for that, the return of Cluster 2 is lower than Cluster 3, hence, its variance is also lower. Thus, if investors want to reduce risk, then they should invest mostly in Cluster 2.

#### 4. Conclusion and Suggestions

In this paper, we used cluster analysis to aid investor in the portfolio selection. By using cluster analysis, we were able to categorize the funds in four groups according to their characteristics. This method is the fastest and a good way to categorize funds because it is not possible to classify a fund’s performance unless we have compared it with the performance of all the other funds in the market. From the results of cluster analysis, it is observed that performance of funds in Cluster 2 and Cluster 3 dominates the other clusters. Thus, only the return rates and variance of these clusters are incorporated into the optimization model.

To reflect the uncertainty of financial environment, we denoted the return rates and risk in triangular fuzzy number instead of crisp representation. The optimal investment proportion is found using a hybrid of fuzzy and optimization. The solutions are found using the Lingo 11.0 Software. We used few different -cuts to demonstrate that the interval of variance and risks decreases as the value -cuts increases.

There is only a crisp figure for rates of return, risk, and investment proportion at because the triangular fuzzy has reached its peak, thus only a crisp number will be obtained and not intervals. In general, the proportion of investment is higher in Cluster 3 compared to Cluster 2 for investors who pay more importance to return. On the other hand, investors who pay more attention to risk will invest more in Cluster 2. In this study, we found the optimal investment proportions by solving the optimization problem separately for the lower and upper bound.

#### Acknowledgment

The authors would like to express their sincere thanks to the referees for their very constructive comments and suggestions.