Research Article  Open Access
Estimating Concrete Workability Based on Slump Test with Least Squares Support Vector Regression
Abstract
Concrete workability, quantified by concrete slump, is an important property of a concrete mixture. Concrete slump is generally known to affect the consistency, flowability, pumpability, compactibility, and harshness of a concrete mix. Hence, an accurate prediction of this property is a practical need of construction engineers. This research proposes a machine learning model for predicting concrete slump based on the Least Squares Support Vector Regression (LSSVR). LSSVR is employed to model the nonlinear mapping between the mix components and slump values. Since the learning process of the LSSVR necessitates two hyperparameters, the regularization and the kernel parameters, the grid search method is employed search for the most desirable set of hyperparameters. Furthermore, to construct the hybrid model, this research collected a dataset including actual concrete slump tests from a hydroelectric dam construction project in Vietnam. Experimental results show that the proposed model is capable of predicting concrete slump accurately.
1. Introduction
Concrete workability is defined as the effort required to manipulate a freshly mixed quantity of concrete with minimum loss of homogeneity [1]. This property of concrete is generally known to affect the consistency, flowability, pumpability, compactibility, and harshness of a concrete mix. Thus, concrete workability is a very crucial factor that must be considered in order to produce high quality concrete [2–4].
The slump test is the most common method for assessing the flow properties of fresh concrete; the slump provides a measure of workability [5]. Using this test, the slump can be derived by measuring the drop from the top of the slumped fresh concrete. In the task of concrete mixture design, the prediction of concrete flowability is critical for onsite construction. As the complexity of concrete construction escalates, there is an increasing pressure on material engineers to achieve high workability as well as to maintain the necessary mechanical properties to meet design specifications.
Concrete has been increasingly utilized in highrise building and infrastructure development projects and special ingredients are often employed to make the material satisfy a specific set of performance requirements [6]. Superplasticizers are often included to enhance the concrete workability [7–9]. This situation makes the concrete mixes to be highly complex materials and modeling their properties becomes a very challenging task. There are complex and nonlinear relationships between the characteristics and the components that constitute the concrete mixes [8, 10, 11].
Due to the importance of the research topic, various studies have been dedicated to concrete slump prediction. Traditional statistical models and machine learning are prevailing approaches to tackle the problem at hand. Öztaş et al. [2], Yeh [1, 3], Chine et al. [12], and Bilgil [13] employed the regression analysis and Artificial Neural Network (ANN) models to estimate concrete slump; the common finding is that ANN is an effective nonlinear modeling method and its results are more accurate than the models based on the traditional regression analysis approach.
Baykasoğlu et al. [14] utilized the gene expression programming (GEP) to model highstrength concrete slump. Chen et al. [15] constructed a parallel hypercubic GEP to forecast the slump of highperformance concrete; this research showed that the improved method is better than the GEP and similar to the performance of ANN. Chandwani et al. [16] proposed a Genetic Algorithm assisted ANN; the study showed that the integrated approach can enhance the convergence speed of ANN and its prediction accuracy.
Due to the popularity of concrete in the construction industry, better alternatives for concrete slump prediction are of practical need for construction engineers in concrete mix design. This research contributes to the body of knowledge by proposing a new approach for improving the accuracy of concrete slump prediction which is based on the Least Squares Support Vector Regression (LSSVR). LSSVR is an advanced machine learning method which is designed for nonlinear modeling [17]; the superiority of the approach has been illustrated in recent applications [18–22].
Furthermore, a dataset that contains slump test records, collected from a hydroelectric dam construction project in central Vietnam, is used to establish and verify the proposed approach. The rest of the article is organized as follows: the second section presents the research method. The proposed slump prediction model is described in the third section. The next section reports the experimental results. The conclusion of this study is stated in the final section.
2. Research Method
2.1. The Concrete Slump Test Dataset
This research recorded testing results of 95 concrete mixes during the construction progress of the Song Bung 2 hydroelectric dam construction project in central Vietnam (http://www.sb2.vn/). The test is in conformity with the Vietnamese standard (TCVN3106) for slump test which is equivalent to the ASTMC143. The equipment for the slump test includes a hollow frustum of a cone and a ruler as the measuring device (see Figure 1). The height of the cone is 30 cm. The diameter of the top and bottom of the cone is 10 cm and 20 cm, respectively. The cone is filled with fresh concrete and then lifted vertically. The height difference between the concrete and the cone is the slump value.
In this study, the concrete slump conditioning factors are selected based on reviewing previous works [1, 12, 16, 23] on slump flow modeling and the availability of measuring equipment. The amounts of cement (kg/m^{3}), natural sand (kg/m^{3}), crushed sand (kg/m^{3}), coarse aggregate (kg/m^{3}), water (liter/m^{3}), and superplasticizer (liter/m^{3}) are mix ingredients. For each mix design, the slump value obtained from the actual slump test experiment is recorded. Statistical descriptions of all specimens are shown in Table 1. The whole dataset is partially described in Table 2. It is noted that the amounts of cement (), natural sand (), crushed sand (), coarse aggregate (), water (), and superplasticizer () are used as input factors to predict the outputs which are the concrete slump ().


2.2. Least Squares Support Vector Regression (LSSVR)
LSSVR, proposed by Suykens et al. [17], is an advanced machine learning algorithm which is constructed on the principal of structural risk minimization. This approach has been proved to be very efficient in nonlinear modeling. Notably, the learning process of the LSSVR is very fast since it only requires solving a set of linear equations.
To construct the prediction model, it is needed to prepare a dataset of slump test record in the form: , Herein, denotes the th data sample and is the total number of data samples. It is noted that is a vector with six elements; , , , , , and denote the amount of cement, natural sand, crushed sand, coarse aggregate, water, and superplasticizer, respectively. Meanwhile, is the output of concrete slump of the th data sample.
We aim to establish a mapping function that derives the output of concrete slump based on the input vector that describes the concrete mix components. Since the functional mapping between concrete mix components () and slump value () is possibly nonlinear, LSSVR first maps the data from the original input space to a highdimensional feature space via a mapping function . Accordingly, linear regression analysis can be possibly performed in such highdimensional feature space. The operation of LSSVR in concrete slump modeling is illustrated in Figure 2.
In the training phase of LSSVR, the learning objective can be formulated as the following optimization problem [17, 24]: where are error variables; denotes a regularization constant.
In order to solve the above optimization problem, the Lagrangian function is formulated as [17]where are Lagrange multipliers.
The Karush–Kuhn–Tucker conditions for optimality are used by differentiating the Lagrangian function with the variables as follows [17]:
By solving linear system (3), the resulting LSSVR model is expressed as follows [17, 18]:where and are the solution to the linear system. and are the index and the total number of data points in the training set. and denote an input pattern in the training and testing set. It is worth reminding that and are both input vectors of concrete mix components with six elements. is the kernel function which maps the input data from the feature space into the highdimensional space. The radial basis kernel function is often employed [17, 19]:where represents the radial basis kernel function parameter.
3. The Proposed Model for Concrete Slump Prediction
This section of the article describes the concrete slump prediction using LSSVR (CSPLSSVR). The prediction model relies on LSSVR to discover the nonlinear mapping relationship between the concrete components and the slump. The flowchart of the CSPLSSVR is demonstrated in Figure 3.
Given the input data of concrete mix ingredients (the amounts of cement, natural sand, crushed sand, coarse aggregate, water, and superplasticizer), the first step of the model is to carry out the data normalization process within which the whole data is normalized into a (0, 1) range. This process can help prevent the circumstance in which inputs with greater magnitudes dominate those with smaller magnitudes. The function used for normalizing data is provided as follows:where is the normalized data. is the original data. and denote the maximum and minimum values of the data, respectively.
The dataset, featuring six input factors and the output variable of concrete slump, is then randomly divided into a training set and a testing set. The training dataset is employed to establish the LSSVR model. Since the LSSVR with radial basis kernel function is employed, the learning process requires hyperparameters, the regularization parameter and the kernel parameter , and the grid search method [17, 25] is employed search for the most desirable set of hyperparameters.
In the grid search for tuning parameters, various pairs of ( and ) are tried and the one with the best fivefold crossvalidation accuracy is chosen. Using exponential growing sequences of () and () is a common way to identify good parameters. The grid search approach is straightforward and easy to implement. After the hyperparameters have been determined appropriately and the training process is finished, the proposed CSPLSSVR can be used to predict the slump flow values of new concrete samples.
4. Experimental Results
When the training process finishes, the slump of concrete mix in the testing cases can be predicted by providing mixture components for the trained model. In the experiments, besides the proposed CSPLSSVR, the Artificial Neural Network (ANN) and the multiple linear regression (MLR) are utilized as benchmark methods. In order to measure model performance, this research employs Root Mean Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination ().
The motivation for using these benchmark approaches is that the ANN is an effective tool for nonlinear modeling and has been successfully employed for predicting concrete slump [3, 12, 23]. The MLR model is a basic statistical predictive method and comparing its result with other machine learning models may reveal useful insights [26].
To construct an ANN, the user needs to specify the network structure and the learning rate. Such parameters of the ANN model are usually selected via a trialanderror process [26]. Based on experiments, the network configuration is set as follows: the number of hidden layers is set to be 1; the learning rate is 0.001; the number of neurons in the hidden layer is set to be 6. The LevenbergMarquardt algorithm [27] is employed to train the ANN model.
In the first experiment, the dataset is randomly divided into 2 sets: the training set that occupies 80% of the dataset and the testing set that includes 20% of the dataset. In detail, the training and testing sets consist of 76 and 19 mixes, respectively. The training and testing results of the CSPLSSVR are illustrated in Figures 4 and 5, respectively.
The MLR model for predicting concrete slump based on the collected dataset is established via the Least Squares Estimation method [28] and shown as follows:where the symbols of , and represent the amount of cement, natural sand, crushed sand, coarse aggregate, water, and superplasticizer within the concrete mix, respectively.
The ANN model structure, which contains the input, hidden, and output layers, is illustrated in Figure 6. It is noted that and are the weight matrices of the hidden layer and the output layer, respectively; denotes the number of neurons in the hidden layer; represents the bias vector of the hidden layer; denotes the bias vector of the output layer; is the output of the th neuron in the hidden layer; is the tansigmoid activation function which is commonly used in the hidden layer [29, 30]:where denotes an input for the function.
It is noted that the weight matrices ( and ) and the bias vectors ( and ) of the ANN model for concrete slump estimation are learnt via a training process with the error backpropagation algorithm [31]. After the training phase, results of the ANN parameters are shown as follows:
Table 3 provides the result comparison between the proposed method and other benchmark models. The result of the MLR in the testing process is very poor (RMSE = 0.28, MAPE = 12.08%, = 0.28); this indicates that the linear model is insufficient to explain the behavior of concrete slump.

The ANN and CSPLSSVR models achieve much better performances; both models have the values which are greater than 0.8. According to Smith [32], such high values of imply strong correlations between the predicted and measured concrete slumps. Furthermore, the CSPLSSVR has achieved the lowest prediction error (MAPE = 3.68% and RMSE = 0.54). Thus, benchmarked with the ANN, the new method has attained 38% and 49% reductions in terms of MAPE and RMSE.
Moreover, to avoid the randomness in selecting testing samples, the second experiment carries out a 10fold crossvalidation process. Using the crossvalidation process, the whole dataset is randomly divided into 10 data folds in which each fold in turn serves as a testing set; and the performance of the model can be assessed by averaging results of the 10 folds. Because all of the subsamples are mutually exclusive, this experiment can evaluate the CSPLSSVR more accurately.
Table 4 summarizes the result of the crossvalidation process. Observably, the proposed approach has attained the lowest prediction error in both training and testing processes. The average RMSE and MAPE for testing data of the CSPLSSVR are 0.50 and 2.81%, respectively. These prediction errors are significantly lower than the ANN (RMSE = 0.62 and 4.44%) and the MLR (RMSE = 1.36 and 10.64%). The proposed approach also yields the highest (0.90) when predicting the slump of testing concrete mixes. Hence, the experimental results have strongly demonstrated the superior predictive capability of the CSPLSSVR model.
 
Note: Avg. denotes the average result. 
5. Conclusion
This study has established a new method for predicting concrete workability quantified by the slump values. The research extends the body of knowledge by investigating the capability of LSSVR for concrete slump prediction. To establish the proposed CSPLSSVR, a dataset consisting of actual concrete slump tests has been collected. From the experiments, the proposed model has achieved the most accurate prediction results.
The average MAPE of the method obtained from the crossvalidation process is less than 3% which is very desirable because modeling concrete slump is known to be very complex and highly nonlinear. Since the tenfold crossvalidation process is a very reliable way for model performance evaluation [33], it is expected that the proposed CSPLSSVR can predict the flow of concrete based on the similar conditioning factors with the same accuracy. Accordingly, the newly established method can be a very useful tool to assist the engineers in the task of concrete mix design.
Nevertheless, in addition to the currently used six conditioning factors of concrete slump, other factors (e.g., the type, size, absorption, and the water amount of the fine and coarse aggregates) can be relevant and should be considered by the model. Furthermore, another limitation of the current study is that the employed dataset only consists of 95 data points. Thus, this dataset should be expanded in a future study to further enhance the generalization of the current model and better ensure the predictive accuracy of the model when dealing with new concrete mixes.
Competing Interests
The authors declare that there is no conflict of interests regarding the publication of this manuscript.
References
 I.C. Yeh, “Modeling slump flow of concrete using secondorder regressions and artificial neural networks,” Cement and Concrete Composites, vol. 29, no. 6, pp. 474–480, 2007. View at: Publisher Site  Google Scholar
 A. Öztaş, M. Pala, E. Özbay, E. Kanca, N. Çağlar, and M. A. Bhatti, “Predicting the compressive strength and slump of high strength concrete using neural network,” Construction and Building Materials, vol. 20, no. 9, pp. 769–775, 2006. View at: Publisher Site  Google Scholar
 I.C. Yeh, “Exploring concrete slump model using artificial neural networks,” Journal of Computing in Civil Engineering, vol. 20, no. 3, pp. 217–221, 2006. View at: Publisher Site  Google Scholar
 Y. Li, J. Wang, and Z. Xu, “Design optimization of a concrete face rockfill dam by using genetic algorithm,” Mathematical Problems in Engineering, vol. 2016, Article ID 4971048, 11 pages, 2016. View at: Publisher Site  Google Scholar
 P. K. Mehta and P. J. M. Monteiro, ConcreteStructure, Properties, and Materials, Prentice Hall Inc, Englewood Cliffs, NJ, USA, 1993.
 Y. Peng, H. Chu, and J. Pu, “Numerical simulation of recycled concrete using convex aggregate model and base force element method,” Advances in Materials Science and Engineering, vol. 2016, Article ID 5075109, 10 pages, 2016. View at: Publisher Site  Google Scholar
 J. Kasperkiewicz, J. Racz, and A. Dubrawski, “HPC strength prediction using artificial neural network,” Journal of Computing in Civil Engineering, vol. 9, no. 4, pp. 279–284, 1995. View at: Publisher Site  Google Scholar
 I.C. Yeh, “Modeling of strength of highperformance concrete using artificial neural networks,” Cement and Concrete Research, vol. 28, no. 12, pp. 1797–1808, 1998. View at: Publisher Site  Google Scholar
 S. U. Khan, M. F. Nuruddin, T. Ayub, and N. Shafiq, “Effects of different mineral admixtures on the properties of fresh concrete,” The Scientific World Journal, vol. 2014, Article ID 986567, 11 pages, 2014. View at: Publisher Site  Google Scholar
 J.S. Chou, C.F. Tsai, A.D. Pham, and Y.H. Lu, “Machine learning in concrete strength simulations: multination data analytics,” Construction and Building Materials, vol. 73, pp. 771–780, 2014. View at: Publisher Site  Google Scholar
 N. Hoang, A. Pham, Q. Nguyen, and Q. Pham, “Estimating compressive strength of high performance concrete with gaussian process regression model,” Advances in Civil Engineering, vol. 2016, Article ID 2861380, 8 pages, 2016. View at: Publisher Site  Google Scholar
 W.H. Chine, H.H. Hsu, L. Chen, T.S. Wang, and C.H. Chiu, “Modeling slump of concrete using the artificial neural networks,” in Proceedings of the International Conference on Artificial Intelligence and Computational Intelligence (AICI '10), pp. 236–239, Sanya, China, October 2010. View at: Publisher Site  Google Scholar
 A. Bilgil, “Estimation of slump value and Bingham parameters of fresh concrete mixture composition with artificial neural network modelling,” Scientific Research and Essays, vol. 6, no. 8, pp. 1753–1765, 2011. View at: Google Scholar
 A. Baykasoğlu, A. Öztaş, and E. Özbay, “Prediction and multiobjective optimization of highstrength concrete parameters via soft computing approaches,” Expert Systems with Applications, vol. 36, no. 3, pp. 6145–6155, 2009. View at: Publisher Site  Google Scholar
 L. Chen, C.H. Kou, and S.W. Ma, “Prediction of slump flow of highperformance concrete via parallel hypercubic geneexpression programming,” Engineering Applications of Artificial Intelligence, vol. 34, pp. 66–74, 2014. View at: Publisher Site  Google Scholar
 V. Chandwani, V. Agrawal, and R. Nagar, “Modeling slump of ready mix concrete using genetic algorithms assisted training of Artificial Neural Networks,” Expert Systems with Applications, vol. 42, no. 2, pp. 885–893, 2015. View at: Publisher Site  Google Scholar
 J. Suykens, J. V. Gestel, J. D. Brabanter, B. D. Moor, and J. Vandewalle, Least Square Support Vector Machines, World Scientific, Singapore, 2002.
 J. Ji, C. Zhang, Y. Gui, Q. Lü, and J. Kodikara, “New observations on the application of LSSVM in slope system reliability analysis,” Journal of Computing in Civil Engineering, 2016. View at: Publisher Site  Google Scholar
 D. Yao, J. Yang, X. Li, and C. Zhao, “A hybrid approach for fault diagnosis of railway rolling bearings using STWDEMDGALSSVM,” Mathematical Problems in Engineering, vol. 2016, Article ID 8702970, 7 pages, 2016. View at: Publisher Site  Google Scholar
 J.S. Chou and A.D. Pham, “Smart artificial firefly colony algorithmbased support vector regression for enhanced forecasting in civil engineering,” ComputerAided Civil and Infrastructure Engineering, vol. 30, no. 9, pp. 715–732, 2015. View at: Publisher Site  Google Scholar
 D. Tien Bui, B. T. Pham, Q. P. Nguyen, and N. Hoang, “Spatial prediction of rainfallinduced shallow landslides using hybrid integration approach of LeastSquares Support Vector Machines and differential evolution optimization: a case study in Central Vietnam,” International Journal of Digital Earth, vol. 9, no. 11, pp. 1077–1097, 2016. View at: Publisher Site  Google Scholar
 D.T. Vu and N.D. Hoang, “Punching shear capacity estimation of FRPreinforced concrete slabs using a hybrid machine learning approach,” Structure and Infrastructure Engineering, vol. 12, no. 9, pp. 1153–1161, 2016. View at: Publisher Site  Google Scholar
 V. Chandwani, V. Agrawal, and R. Nagar, “Modeling slump of ready mix concrete using genetically evolved artificial neural networks,” Advances in Artificial Neural Systems, vol. 2014, Article ID 629137, 9 pages, 2014. View at: Publisher Site  Google Scholar
 M.Y. Cheng and N.D. Hoang, “Interval estimation of construction cost at completion using least squares support vector machine,” Journal of Civil Engineering and Management, vol. 20, no. 2, pp. 223–236, 2014. View at: Publisher Site  Google Scholar
 C. W. Hsu, C. C. Chang, and C. J. Lin, “A practical guide to support vector classification,” Tech. Rep., Department of Computer Science, National Taiwan University, Taipei, Taiwan, 2010. View at: Google Scholar
 J.S. Chou, C.K. Chiu, M. Farfoura, and I. AlTaharwa, “Optimizing the prediction accuracy of concrete compressive strength based on a comparison of datamining techniques,” Journal of Computing in Civil Engineering, vol. 25, no. 3, pp. 242–253, 2011. View at: Publisher Site  Google Scholar
 M. T. Hagan and M. B. Menhaj, “Training feedforward networks with the Marquardt algorithm,” IEEE Transactions on Neural Networks, vol. 5, no. 6, pp. 989–993, 1994. View at: Publisher Site  Google Scholar
 N. R. Draper and H. Smith, Applied Regression Analysis, WileyInterscience, 1998. View at: Publisher Site  MathSciNet
 M. T. Hagan, H. B. Demuth, and M. H. Beale, Neural Network Design, PWS Publishing, Boston, Mass, USA, 1996.
 T. Tran and N. Hoang, “Predicting colonization growth of algae on mortar surface with artificial neural network,” Journal of Computing in Civil Engineering, vol. 30, no. 6, Article ID 04016030, 2016. View at: Publisher Site  Google Scholar
 D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning representations by backpropagating errors,” Nature, vol. 323, no. 6088, pp. 533–536, 1986. View at: Publisher Site  Google Scholar
 G. N. Smith, Probability and Statistics in Civil Engineering, Collins, London, UK, 1986.
 S. Arlot and A. Celisse, “A survey of crossvalidation procedures for model selection,” Statistics Surveys, vol. 4, pp. 40–79, 2010. View at: Publisher Site  Google Scholar  MathSciNet
Copyright
Copyright © 2016 NhatDuc Hoang and AnhDuc Pham. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.