Abstract

Rubberized concrete (RC) has attracted more attention these years as it is an economical and environmental-friendly construction material. Normally, the uniaxial compressive strength (UCS) of RC needs to be evaluated before application. In this study, an evolutionary random forest model (BRF) combining random forest (RF) and beetle antennae search (BAS) algorithms was proposed, which can be used for establishing the relationship between UCS of RC and its key variables. A total number of 138 cases were collected from the literature to develop and validate the BRF model. The results showed that the BAS can tune the RF effectively, and therefore, the hyperparameters of RF were obtained. The proposed BRF model can accurately predict the UCS of RC with a high correlation coefficient (0.96). Furthermore, the variable importance was determined, and the results showed that the age of RC is the most significant variable, followed by water-cement ratio, fine rubber aggregate, coarse rubber aggregate, and coarse aggregate. This study provides a new method to access the strength of RC and can efficiently guide the design of RC in practice.

1. Introduction

Concrete has been the most widely used construction material in civil engineering, and its demand still increases quickly due to the rapid growth in urbanization and industrialization. Reducing costs and maximizing the strength and durability of concrete are quite challenging issues [1]. Hence, recycled aggregate concrete (RAC) containing some recycling materials, such as plastic materials [2, 3], construction and demolition (C&D) wastes [4, 5], and waste tire rubber [6], has been a hot research area. Given the rapid growth of tire rubber waste and its harmful effect on the environment, using rubber as a substitute in concrete not only contributes to economic growth but also benefits the environment [7]. Considering its ductility and strain behavior, rubber is normally utilized as fine aggregates (FAs) and coarse aggregates (CAs) in concrete, and therefore, the new cementitious composites, namely, rubberized concrete (RC), can be applied in civil engineering.

RC has some special advantages such as reducing CO2 emissions and decreasing construction costs, and therefore, it attracts many concerns of scholars recently [811]. To evaluate the applicability of RC, some mechanical properties including compressive strength and elastic modulus have been studied [1215]. Moreover, the uniaxial compressive strength (UCS) is the key indicator that has been commonly used for assessing the strength property of RC. Normally, with the increase in the content of rubber in RC, the compressive strength decreases. Some models considering the laboratory tests or compiled databases of previous studies have been proposed to predict the UCS of RC [1618]. However, due to the limitations of input parameters and the small number of obtained results, these models are not generalizable and not convenient for application. Consequently, systematical investigations are necessary to evaluate the compressive strength of RC by more economic and efficient techniques as per a compressive database including various input parameters.

Nowadays, machine learning methods are considerably used for the prediction of the mechanical properties of cementitious materials [1921]. The estimation of the mechanical strength of RAC has also been considered. For example, the multilinear and nonlinear regression models were proposed and used for evaluating the UCS of RAC [22]. The artificial neural network (ANN) methods were applied to assess the strength of RAC [2325]. Genetic programming (GA) methods were also introduced for nonlinear regression in predicting mechanical properties of RAC [26, 27]. However, to the authors’ knowledge, there were no reports of the prediction of RC by artificial intelligence approaches. In addition, though the mentioned machine learning algorithms (ANN, GA, etc.) were used for prediction in concrete, they still had some limitations such as lower efficiency, time-consuming, and indefinite structures. Therefore, more robust and simple machine learning models need to be proposed and utilized in predicting the compressive strength of RC. Nowadays, the random forest (RF) approach has been employed in predicting the mechanical parameters of concrete due to its excellent performance on nonlinear regression and classification [28, 29]. However, no relative studies to date used RF to predict the strength of RC. It should be pointed out that some hyperparameters still need to be optimized to reach its best predictive ability. In this paper, a high-efficiency global optimization algorithm called the beetle antennae search (BAS) algorithm was applied to obtain the best parameters of RF [30].

Therefore, a robust machine learning technique (evolved random forest, namely, BRF) was proposed for the evaluation of the UCS of RC. Several contributions to the literature are as follows:(1)The random forest (RF) and beetle antennae search (BAS) algorithms were combined to form the BRF model(2)The strength of rubberized concrete (RC) was, for the first time, predicted and analyzed considering 9 key influencing variables(3)The variable importance that affected the UCS of RC was first revealed

The proposed method can be a fast tool for estimating the strength of RC and efficiently guide the design and application of RC in practice.

2. Materials and Methods

2.1. Model of Evolved Random Forest (BRF)

As mentioned above, the BRF is the combination of RF and BAS, in which the RF was applied to obtain the nonlinear relationship in datasets, while the BAS was used for hyperparameter tuning of RF. In this part, the RF and BAS were introduced as follows.

2.1.1. Random Forest (RF)

Random forest is a classification algorithm that employs an ensemble of classification trees, each of which is established by applying a bootstrap sample of the data [31]. For tree building, the variables are selected randomly as the candidate set of variables at each split. The other way is to use bagging which can combine unstable learners successfully. The random forest has outstanding performance in classification tasks such as strong robustness in terms of large feature sets, incorporation of interactions among predictor variables, and high quality and free implementations [32]. The diagram of the structure of RF is shown in Figure 1. This method has been widely used for dealing with the questions of classification and regression in civil engineering.

2.1.2. Beetle Antennae Search (BAS)

BAS algorithm is proposed recently, which can be efficiently used for optimizing the global problems and has been used for selecting the optimum parameters of algorithms such as BPNN and SVM [33, 34]. It simulates the behavior of beetles that utilize two antennae to explore nearby areas randomly and turn to a higher concentration of odor. The flowchart of beetle antennae search algorithms is depicted in Figure 2.

2.2. Performance Validation and Evaluation Methods

In this paper, the proposed model was trained and validated on the 70% dataset and tested on the other 30% dataset [21]. During the training and testing processes, all data were split randomly. In addition, the predictive effect on the dataset was assessed by the correlation coefficient (R) and root-mean-square error (RMSE), which were widely used in the previous literature. The relevant equations are as follows:where N denotes the numbers in the collected dataset; and yi are the expected values and real values, respectively; and and indicate the mean predicted values and mean actual values, respectively.

Furthermore, to minimalize the bias, a 10-fold cross-validation method was introduced [35]. Specifically, the samples in the training dataset were divided into 10 subsets. Then one of the 10 subsets was selected for validating the overall performance of the proposed model, while the other 9 subsets were applied to train. This process would be repeated for 10 times.

2.3. Procedures of Hyperparameter Tuning

To obtain the best structure of RF, the hyperparameter tuning is necessary. In this paper, two key parameters (i.e., the number of the trees (tree_num) and the minimum required samples at a leaf node (min_sample_leaf) in RF) were tuned by BAS. By the described 10-folder cross-validation method, the 9 subsets as training sets were used for searching the ideal hyperparameters of RF by BAS for 50 times. The smallest RMSE in terms of the validation set can be chosen after 50 iterations, which represents the best RF model in this fold. Therefore, the best RF model and its corresponding optimum hyperparameters (tree_num and min_sample_leaf) were chosen after 10-folds. The performance of the RF model should be verified by evaluating the test set due to the possibility of overfitting problems. The flowchart of hyperparameter tuning of RF by BAS in training and testing is shown in Figure 3.

2.4. Dataset Description

The dataset of RC was collected from the literature [13, 15, 18, 3638], which was used for establishing and validating the BRF model for strength prediction. A total number of 138 valid samples with 9 key influencing variables were assembled in this study. Generally, depending on the different sizes of rubber, the crumb rubber is used for replacing the fine aggregate (FA) in concrete, while the tire chips are used for replacing coarse aggregate (CA). The compressive strength normally decreased with different rates by replacing various contents of rubber and different rubber types. Therefore, it is necessary to distinguish the rubber into two types in RC, i.e., fine rubber aggregate (FRA) and coarse rubber aggregate (CRA). Moreover, the influencing variables and their description in statistics are given in Table 1. The main objectives are to predict the UCS of RC that is determined by its influencing variables. The relative importance of variables is to be further analyzed. To improve the efficiency of the model, the collected data were normalized into [0, 1]. According to the percentage split of the dataset, 97 samples were randomly chosen as the training set and the remaining 41 samples were set as the test set.

3. Results and Discussion

3.1. Results of Hyperparameter Tuning

In this study, according to the RMSE obtained from 10-fold cross-validation, the hyperparameters were tuned on the testing set. The RMSE versus iterations during BAS tuning is shown Figure 4. As can be seen, the RMSE decreased considerably, revealing that the BAS can tune the RF effectively. Then, the RMSE became stable at 15 iterations. Here, only 20 iterations were shown. The final hyperparameters of RF are given in Table 2.

3.2. Assessment of the Established Model

The comparison of the predicted UCS of RC by the BRF model and actual UCS of RC in datasets is depicted in Figure 5. As can be seen, the predicted UCS values of RC were rather close to the actual UCS, indicating that the BRF can establish the nonlinear relationship between UCS of RC and input variables successfully, and therefore, the model can predict the strength accurately. In addition, the high R values for the training set and test set were 0.985 and 0.959, respectively. The low RMSE values of 2.24 in the training set and 3.90 in the testing set were observed. Overall, the above results showed that there is no underfitting or overfitting phenomena by the proposed BRF model.

3.3. Variable Importance of RC

Furthermore, the relative importance of the input variables is shown in Figure 6. As can be seen, the age of RC was the most important variable with an important score of 1.42, and this result was consistent with the strength development of cementitious materials reported in the previous studies [12, 39]. The water-cement ratio also played a crucial role in the strength of RC, and the superposition effect of water and cement in this study was similar to the age. This agrees well with some studies that the water-cement ratio affects the strength considerably [40, 41]. As can be seen, the FA ranked third with an importance score of 1.23, which was more sensitive than CA (importance score of 0.49) in RC. Correspondingly, the FRA had a relatively larger influence on the UCS of RC than CRA. It should be pointed out that both the FRA and CRA affect the strength of RC obviously, and therefore, more attention should be paid when adding rubber materials to RC in practice. The admixture of SP and SCMs had the least influence on the strength of RC with the importance score of 0.27 and 0.24, respectively. The obtained results can guide the design of RC effectively and select the proper parameters for optimizing RC. The results can guide the accurate design of RC and boost the application of RC.

4. Conclusions

This paper presented an evolved random forest algorithm, namely, BRF, for evaluating the UCS of RC. Based on the 138 samples collected from the previous literature and 9 key influencing variables, the compressive strength of RC can be determined by the independent variables by BRF. The hyperparameters of RF were tuned by using the BAS algorithm and validated by 10-fold cross-validation. In addition, the performance of optimized BRF was examined by R and RMSE. The variable importance was first revealed and discussed. The main results are as follows: BAS can efficiently tune the hyperparameters of RF and can be used in evolved RF to establish the BRF prediction model; the proposed BRF model can accurately predict the strength of RC, which can guide the design of RC; on the testing set, R and RMSE were 0.96 and 3.9, respectively, meaning that the proposed BRF model has a good prediction on the collected RC data; the age of RC is the most significant variable for the strength, followed by the cement ratio, FRA, CRA, and CA; the SP and SCMs have the least influence on the strength of RC.

It should be pointed out that the results were limited by the amount of the samples. If the larger dataset was obtained, the more accurate results would be derived. In the future, we would collect more samples and design a bigger dataset for analysis by machine learning methods, which can significantly promote the application of RC in the field.

Data Availability

The Microsoft Excel Worksheet data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

Acknowledgments

This study was financially supported by the National Key Research and Development Program (Grant no. 2016YFC0600901), National Natural Science Foundation of China (Grant nos. 51574224 and 51704277), and Natural Science Fundamental Research Program Enterprise United Fund of Shaanxi Province of China (Grant no. 2019JLZ-04). The authors are grateful to Huaibei Mining (Group) Co. Ltd. Special thanks are due to Dr Zuqi Wang for her encouragement and help.