Abstract

In order to build high-quality concrete, it is imperative to know the raw materials in advance. It is possible to accurately predict the quality of concrete and the amount of raw materials used using machine learning-enhanced methods. An automated process based on machine learning strategies is proposed in this paper for predicting the compressive strength of concrete. Fusion-learning-based optimization is used in the proposed approach to generate a strong learner by pooling support vector regression models. The SVR technique proposes an optimization method for finding the kernel radial basis function (RBF) parameters based on improving the innovative gunner algorithm (AIG). As a result of AIG's diverse solutions, local optima are effectively avoided. Therefore, the novelty of our research is that, in solving the uncertainty of predicted outputs based on integrated models, we use fusion-learning-based optimization to improve regression discrimination. We also collected a standard dataset to analyze the proposed algorithm, and subsequently, the dataset was designed from concrete laboratory tests on 244 samples, seven features, and three outputs. Different regression intensities are determined by correlation analysis of responses. Regression fusion is sufficiently accurate to estimate the number of desired outcomes examined based on the appropriate input data sample. The best quality concrete can be achieved with an error rate of less than 5%.

1. Introduction

The structural materials of concrete and the reaction between these materials play a critical and decisive role in explaining high-strength concrete's mechanical properties. Therefore, concrete designers and engineers seek to gain an accurate and thorough understanding of the relationship between the appropriate choice of type and amount of materials for concrete construction [1]. Concrete is a mixture of water, cement, and other materials, and from a chemical perspective, the strength of this material depends more than anything on cement and water [2]. High-strength concrete must exhibit resistance and flexibility against various forces and environmental factors. Hence, the proper mixture of raw materials can enhance the strength and quality of the concrete. The compressive strength of concrete depends on various factors, including mixing properties, mixing methods, mixing conditions, transport, and concrete [3]. Sometimes the concrete design engineer knows the initial composition and the estimated percentage of material mixing; however, the importance of an accurate estimate cannot be overestimated. Therefore, the concrete quality can only be assessed after making efficient concrete, and the elapse of a specific period, often a long interval (usually one month) using special tests [4]. Consequently, making quality concrete is conducted based on resembles a trial and error approach and may lead to extensive raw materials loss. By observing the falling slope in concrete, helpful information about the concrete performance can be obtained [5].

In addition, due to the great importance of concrete in structures and the significant growth of urbanization, the increase in demand for high-quality concrete is substantial [6]. In many developing countries, such as China, the ready-mixed concrete, prepared by mixing concrete, dominates the entire concrete market [7]. As an advanced industry, the ready-mixed concrete trade is of utmost importance in significant sectors of industry and construction, transportation, and after-sales service. However, the current concrete industry management is still at the fundamental information management level because the knowledge generated during the management process is too complex to be used [8]. A goal-oriented and efficient way to address this problem is knowledge management and the adoption of artificial intelligence technology [9].

In recent decades, due to the proliferation of construction globally, a significant body of research has attempted to optimize the quality of concrete. Therefore, artificial intelligence (AI) has received growing attention as an analyzer of information in this field. The main application of soft computing-based methods is to determine concrete strength [1018]. Most researchers focus on accurate estimation of raw materials based on artificial neural networks (ANNs) ability to estimate correct regression outputs [1921]. Some researchers have also used ANN and clustering methods such as adaptive neuro-fuzzy inference system (ANFIS) [22]. In other studies, rapid learning and its combination with Cascade-forward neural network (CfNN) have been highlighted, which has led to the decreased processing time of concrete ingredients and achievement of the desired output [23]. Some investigations have utilized a combination of neural networks and evolutionary algorithms or metaheuristic (MH) to improve the precision of obtaining desired outcome [24, 25]. Combination with evolutionary algorithms (EA) is typically intended to amend inherent defects of ANN or other regression-based estimation methods. These algorithms are extensively used in ANN training for accurate error estimation. These include the genetic algorithm (GA) [26, 27], particle swarm optimization (PSO) [28], or simulated annealing (SA) [29]. The genetic algorithm can find global and local optimums and may be even trapped in local optimums, but it has rapid convergence [26, 27]. A growing number of studies have recently adopted ANN [27, 28, 30] and SVR [29] as a model for regression network estimation. The utilization of methods such as deep learning to estimate the quality of concrete has drawn considerable attention, urging researchers to adopt different deep learning models to improve the quality of concrete [3133]. Sadrossadat and others. [34] have developed an evolutionary-based prediction model of the 28-day compressive strength of high-performance concrete containing cementitious materials.

Estimating the concrete performance is an essential process in ensuring the quality of various concrete components. Concrete testing is a method that measures near-performance parameters and provides valuable information about these parameters. The methods currently used in the world measure the stability of concrete in the laboratory or on-site. In some experiments, a kind of failure is estimated by measuring the reduction in the upper surface of freshly crushed concrete. Low accuracy, uncertainty, computational complexity, and the lack of a generalized method in the automatic determination of concrete raw materials in previous strategies have led us to look for more appropriate and accurate solutions.

Given that specific percentages of materials need to be combined to produce concrete of varying grades, this process is usually based on personal experience, and trial and error deems necessary to avoid wasting materials. For this to happen, an automated method is required for the analysis of raw materials. The importance of research becomes more apparent, knowing that a significant portion of its constituents include water, cement, and sand and are among finite resources that are rapidly depleted. Therefore, either one must possess exceptional skills and expertise, or a solution must be found to estimate the precise percentage of ingredients. Thus, the previous automated models are highly dependent on the optimized regression-learning model and the input components selection. Concrete design decisions on the component level must be correlated to operational cost and emissions on the supply chain level to evaluate commercial and environmental influence [35]. The main difficulty is finding a qualified approach to produce proper outputs on concrete designing in the real world. Fitting outputs are achieved by relying on appropriate learning methods. Recently, the relevant studies have considered optimization algorithms based on support vector models for high-quality concrete design [3640].

Although both SVR and ANN models can map input data to a higher dimensional space to determine the decision boundary, NNs require numerous data input than SVR to better training. In addition, SVRs need minimal or more negligible processing of input data, which saves a lot of time. Besides, the ANN model usually necessitates much more modification, cleaning, data processing, etc. Typically the ANNs involve batch conversion to numbers, feature scaling, etc.

In the current study, we present an effective prediction design of concrete structure based on the proper fusion model of Support Vector Regression (SVR) learner and innovative gunner algorithm (AIG). Also, the AIG algorithm is described by a significant search space exploration, owing to solution vectors, typical for swarming techniques. Thus, the AIG algorithm achieves diverse solutions, which provides it high efficiency in avoiding local optima [41]. Besides, it can be utilized successfully to determine target functions of different shapes and with multiple optima and multidimensional functions.

The proposed method can remarkably reduce the classification error in regression mode to design lightweight and high-performance concrete. Our model has produced desirable results for predicting 28-day compressive strength and helps save the raw materials for concrete production. The method has been applied to a set of laboratory data collected by the authors. Similarly, attempts have been made for innovation to generate actual data in the laboratory within three months and fabricate different concrete types with varying qualities.

The remainder of this paper is organized as follows: the method components are described in Section 3. Section 4 introduces the proposed model, including the used optimization algorithm and the regression learner. Section 5 presents the results and interpretations of the classification under various conditions. Finally, the paper is concluded by summarizing the key points in Section 6.

2. Contribution and Learning Structure

In this section, the components we utilized to optimize the concrete quality prediction based on learning process are described. As shown in Figure 1, the significant contributions of designing lightweight and high-performance concrete have been related to data development, automated analysis of outputs (i.e., effective design), and best learning to discriminate concrete quality.

2.1. Regression Learning

Assuming that one training data is available, if each input has D attributes (i.e., belongs to the D-dimensional space), and each point is assumed to have Y corresponding special, a function can be found that relates the input to the output [42]:

To obtain the function f, the values of and b in the following equation must be minimized:where C1 is a constant value, the value of which is set by the user. The C1 value is intended to create balance and change the weights of the penalty due to the omission of the variable ε and, at the same time, maximize the margin for discrimination. Accordingly, function Lε is introduced according to

The equation is rewritten as a maximum of the following equation [43]:where conditions are defined based on

By solving the above equations, the SVR function, i.e., f in (1), can be obtained using the kernel function:

2.2. Innovative Gunner Algorithm (AIG)

According to Newton's law, a projectile's motion can be directed vertically in a homogeneous gravitational field where the initial velocity in the horizontal direction is not zero [41]. A projectile is thrown with the initial velocity at an angle δ, and its horizontal surface is assumed to be perpendicular to the direction of gravity. It has a parabolic motion in a frame with dimensions d and h. This equation is defined according to (7), where denotes the acceleration caused by the gravitational force:

Computations will be more complicated if drag forces are involved. For the simplest model, where the tensile force is assumed to be proportional to the velocity of the projectile, (7) will be significantly complicated because [41]where k is defined as drag coefficient (unit 1/s), and its value depends on weight and air resistance. Several analytical and experimental methods have been proposed to ensure the accuracy of the projectiles equations based on ballistics. Figure 2 shows the bullet motion curve for three different angles.

However, it should be noted that the actual projectile curve can be expressed aswhere fh(ξ) is a function influenced by interfering factors, including changes in air resistance, temperature, wind, shape, and motion of the Earth. In this case, the angle δ can be assumed as a decision variable for the optimization process, whose objective function is as follows:

Given the complex form of equation (10) and the uncertainty of the function fh (ξ), a metaheuristic method could be used to calculate the value of angle and firing.

3. Proposed Method

Figure 3 shows the overall steps for the suggested implementation of concrete designing.

3.1. Preprocessing

First, as shown in Figure 4, we shuffled the data in the preprocessing step to prevent overtraining of the automatic classification procedure. Although the samples are randomly mixed, the label's position changes regularly according to the classes of each received sample. We are applying such a method to prevent overfitting, leading to a considerable increase in classification accuracy. Because each data's concrete is intrinsically highly distributed and uses numerous diverse samples in a wide range, they must be normalized in the next step of the preprocessing step. Normalization of samples reduces the processing cost and positively affects the optimization of concrete quality prediction [44].

Ynorm is considered the normalized value, and the variables Ysmin and are the minimum and maximum values, respectively. Besides, Ys is the current value of the concrete sample under study.

3.2. Fusion Regression

In the first step, data normalization is performed. This process is conducted using the Min-Max method to reduce excessive dispersion of data. The main goal of support vector machines is to find an optimal hyper-plane as a decision-making level that maximizes the margin between two classes. The data are moved to a considerably larger space by the Φ kernel function to classify highly complex data. The kernel function maps data from the input space on a space with higher dimensions, so that it is possible to separate the data in that space linearly. In the first part, the SVR regression learning pool as regression fusion model is used by applying training data. The general diagram is shown in Figure 5 and depicts the general learning framework. The initialization of γ and s for each regression classifier is initially randomized. The change interval for these two parameters varies from zero to 20. By applying the training data, the structure with the lowest mean square error (MSE) is selected, and its parameters ? and s are given to input to the AIG algorithm. In this way, convergence towards the optimal response is faster and more accurate. The regression fusion of pool structure is evaluated by five repetitions, averaging accuracy, and finding the best corresponding parameters. The best matching parameter is a grid with the highest accuracy rate. In addition, finding the best RBF kernel values facilitates the search for the global optimum in the modified AIG algorithm.

3.3. Improving the Best Model

In order to optimize the model parameters, various methods can be used. The space networked with RBF kernel was identified as the best kernel for the data [45, 46]. This kernel can be expressed for two variables and b:where the parameter γ corresponds to the square of the width of Gaussian kernel. By finding the initial values γ and C in the classification pool, the modified AIG algorithm is used by accounting for air resistant. This constraint is considered more than other constraints for fh(ξ) because other constraints do not always exist in real conditions, but air resistance is present under almost any condition.

The most typical case of air resistance, for Reynolds numbers above 1000, is Newton drag with a drag force proportional to the speed squared, Fair = -kv2. In air, which has a kinematic viscosity around 0.15 cm2/s, this means that the product of speed and diameter must be greater than about 0.015 m2/s. Unfortunately, the equations of motion cannot be easily solved analytically for this case. Therefore, a numerical solution will be examined.where FD, c, ρ and A are defined as drag force, drag coefficient, air density, and cross sectional area of the projectile, respectively, and μ is defined according to

In light of these limitations and initialization in the previous step, the fitness function is defined in the metaheuristic algorithm, according to MSE. The termination condition of the algorithm is based on calculating the minimum error of MSE. Suppose that the algorithm is not realized in a certain number of iterations. In that case, the parameters matching the best regression network are selected, which produce a more significant effect on regression learning than other SVR structures do. On the other hand, considering the drag conditions and air resistance, the shot angle widens, and the best angle is set between 30 and 80°. According to the calculations, the best shot angle is 45°, but in changing the initial velocity of the bullet, the optimal angle is altered considering air resistance.

4. Experimental Results

The proposed method was implemented by MATLAB R2019b in Windows 10 operating system. The hardware platform used for simulation was an Intel® Core ™ i5-8500 system with 8 GB of RAM, plus 16 GB of SSD RAM. Other complementary software such as SPSS was also used. The input data were incorporated into an integrated algorithm, and the inputs were normalized in the first step. Mean square error (MSE) and mean absolute percentage error (MAPE) are expressed in (15) and (16), respectively. They were tested using adaptive algorithms in input analysis.

In these equations, Ti, Pi, and N are the actual output values, the values ​​predicted by the algorithm, and all specimens. In addition to the MSE and MAPE calculation, the maximum and minimum errors were also estimated, and the results of the K-fold cross-validation were calculated for each test.

4.1. Dataset

Data were obtained from the concrete testing laboratory at Imam Khomeini University of Sabzevar, Iran, over six months. The data consists of seven inputs and three outputs, including cement, slag, fly ash, water, superplasticizers, coarse aggregate, and fine aggregate. All of these inputs are in component kg in one M3 concrete. Besides, the outputs of this dataset are SLUMP (cm), FLOW (cm), and 28-day Compressive Strength (Mpa). The number of constructed specimens is higher than the first data samples (243 specimens), and the number of feature inputs and outputs is equal. The necessary information of this dataset is shown in Table 1.

4.2. Assessments

As presented in Table 2, regression predictions for datasets are a combination of different algorithms at the time of testing by selection K value equal to 5. Changing the values ​​of RBF parameters and preventing overfitting in many repetitions, the SVRs pool as regression fusion was used to find the lowest amount of MSE among the five structures in each pool.

If the MSE is less than 5% of the total error in the sample data in this structure, the corresponding SVR is selected as the base network. Otherwise, the minimum MSE and the corresponding network structure should be chosen. Figure 6 shows the results of the RMSE convergence and corresponding Loss function of the algorithm set to reach the minimum value. When the algorithm tries to achieve the optimal value in the training step, more satisfactory outputs can be obtained in the experimental section by increasing the number of search alterations of the AIG algorithm or increasing the number of SVRs.

The change in evaluation criteria is not significant due to the change in K, and thus, Table 3 shows that a slight improvement in K accompanies the experiment.

However, although the training phase results are done offline, due to the numerous repetitions in finding the best SVR structure of the learning pool and the time-consuming AIG optimization algorithm, computational complexity is observed, especially in the training step. A similar implementation for random data in convergence to the optimal value for four validation data is shown in Figure 7. In these plots, a limited number of repetitions are seen, and the level of error originates from the training stage.

On average, for each run of the algorithm, the average value of correlation is greater than 0.9 and, in some cases, reaches as high as 0.98. Assuming that, in the prediction of concrete quality regression, yt and xt are static variables in the detection or estimation of parameters and T and F tests, the results were used by increasing the sample size, and sample variance to population variance was used to demonstrate minimal normal consistency. Estimating the MES for an estimate of the predicted time sequence with dynamic properties may suggest that the variance has not been well defined. Therefore, this value will not fluctuate around a specific mean. To further explain the variables yt and xt, it was assumed that the regression trend was defined as the quality prediction trend in a random step:where ɛ1t and ɛ2t had independent distribution functions, and the relationship between the variables yt and xt was not justifiable.

To perform a regression analysis, we first assumed a relationship between the two variables selected from the seven input variables. Based on the assumption that there is a linear relationship between the two variables, quantitative data was collected from both variables, and the data were plotted as points on a two-dimensional map. As shown in Figure 8, the difference between the main and predicted outputs is small, and the variances between them are negligible. Table 4 shows the calculated variances of ten experiments. Variances indicate great independence of the features selected by SPSS software. Therefore, using the output quality, a small portion of the low-impact features can be distinguished from the others.

As noted earlier, concrete strength is defined in terms of its ingredients, weight, and specific properties. Data were obtained under laboratory conditions based on real conditions. Other factors such as data collection, mixing, and other parameters affecting concrete strength are also included in this section. Although the laboratory data presented in some studies offer valuable information on this subject, there were essential details, the absence of which could significantly predict performance in many cases.

5. Discussion

In concrete preparation by learning-based methods, the most crucial design issue is the lack of significant differences between the predicted outputs and the actual output. Hence, the R-Squared is considered as one of the most specific criteria for comparison between methods, where y, , and are the main, mean, and predicted values, respectively. The R-Squared is expressed as (19):In these experiments, different test conditions are investigated:(1)When SVR is used alone in estimating outputs (Model 1).(2)When fusion between SVRs is used in estimation (Model 2).(3)When the fusion between SVRs is combined with the genetic algorithm in the analysis (Model 3).(4)When the fusion between SVRs is combined with the PSO algorithm in the estimation (Model 4).(5)When the fusion between SVRs is combined with the AIG algorithm in the estimate (Model 5).(6)When the fusion between SVRs is combined with the modified AIG algorithm in the estimation (Model 6).

Experimental representations for five different and random types of data are shown in Figure 9 separately for each of the six models. The p-value in the proposed algorithm indicated a significant relationship between the proposed algorithm’s output and regression values. For this reason, we rounded the outputs (p < 0.05), which manifest the significance of outputs. The comparison with other similar outputs and methods provides a valid justification for rejecting the null hypothesis (H0). This is because the test result was not in the acceptable range of H0 and thus H0 was not confirmed (α = 0.05) and therefore -Zα-1 is equal to -1.65. This meant that the confidence interval was greater than 95% and the outputs more accurately resembled reality, as indicated by the studied samples and K-equal validation. To test the claims regarding the correlation of concrete properties, aside from selection by the genetic algorithms, SPSS software was used to analyze the correlation factor between the measured properties. The results were significantly identical and except for the fifth feature (which had a lower correlation compared to other coefficients), features with 100% correlation were selected correctly.

However, as noted in Abrams' law, in real weather conditions, other factors may also affect the mixture of substances. This rule, which predicts concrete performance based on a combination of water-cement mixture, indicated a slight error for the coefficient of determination. This could be attributed to water interference in the hydration process of cement, which can cause changes in the molecular structure of cement material and thus affect the process of achieving optimal concrete performance.

We know that concreting at a temperature of less than 5 °C slows down the hydration process, and the process of obtaining concrete strength practically stops. As the temperature of concrete decreases, its hardening and achieving strength decreases, and at the temperature below freezing point, the chemical process of hardening of concrete stops. In general, at low temperatures, the rate of gain of concrete strength decreases. Experiments were performed after producing the concrete, which was considered for the built concretes according to the regional conditions and the occurrence of frost.

In the experiments, the decrease of temperature and change of each of the outputs is considered noise, and the algorithm's resistance to change of each of the concrete components is investigated. The effect of freezing is measured, and the temperature drop is considered as the fourth output. It was found that, among the primary materials, changes in some of them can significantly prevent concrete from freezing. In general, for a given degree of hydration, the higher the water-to-cement ratio, or for an offered water-to-cement rate, the lower the degree of hydration, and the larger the pore volume in the hydrated cement paste. Since freezing water settles quickly in large pores, it can be hypothesized that the amount of freezing water for water-to-cement ratios is higher and, in the early processing times at a given freezing temperature, will be more. By performing the above software tests on typical weight concretes exposed to freezing and thawing in wet conditions, the maximum water-to-cement ratio should be considered for tabulations, water ducts, and guardrails, or parts thereof equal to 0.45 and equal to 0.5 for the remaining pieces. It is clear that these water limits to cement ratio are based on the assumption of sufficient hydration of cement. In other words, the amount of water that can freeze in concrete with a specific rate of water to cement increases with decreasing temperature, and the amount of water that freezes at a particular temperature increases with an increasing ratio of water to cement. Based on this, it was concluded that as the temperature decreases, the water-to-cement rate should be considered, and the combination of some slags should be used more sensitively.

High error, computational complexity, and uncertainty challenges in algorithms are common problems in quality recognition optimization methods. Table 5 presents the analytical comparison between the former techniques and the proposed approach. They also tried to predict the compressive strength of lightweight structural concrete.

Future designs should incorporate deep learning models that are transferable, such as transfer learning structures [55, 56]. Optimization algorithms as well as deep learning based on neural networks have a major impact on the classification process. Except for the optimization method employed in the paper, some of the most representative computational intelligence algorithms can be utilized to solve the challenges of high-quality concrete prediction. Some recent optimization algorithms include monarch butterfly optimization (MBO) [57], earthworm optimization algorithm (EWA) [58], elephant herding optimization (EHO) [59], moth search (MS) algorithm [60], Slime mould algorithm (SMA) [61], and Harris hawks optimization (HHO) [62].

6. Conclusion

Predicting the quality of high-strength concrete is a critical issue in the concrete industry. Our investigations in this paper showed that using an improved model, including the fusion of multiple SVR networks and the modified AIG algorithm, can produce satisfactory outcomes in predicting 28-day compressive strength. Unlike previous methods, which only work in fabricating concrete compositions, the estimator model utilizes a fast yet efficient model and facilitates the regression classification process. The research results suggest that the degree of complexity in the sample data can be reduced to establish a correlation in a standard pattern between the features. Accordingly, it can be concluded that employing the proposed method as an automated method can facilitate the analysis of concrete data and, therefore, offers an accurate estimation of the quality of high-strength concrete. In the future, the authors will use several statistical criteria and various machine learning models for further and more extensive investigation. Furthermore, the use of fusion between neural networks (considering the challenge of uncertainty) and support vectors can be effective in mapping input data and extracting the appropriate pattern.

Data Availability

All the data and codes are available from the corresponding authors.

Ethical Approval

The research meets all applicable standards with regard to the ethics of experimentation and research integrity, and the following is being certified/declared true. As an expert scientist and along with co-authors of concerned field, the paper has been submitted with full responsibility, following due ethical procedure, and there is no duplicate publication, fraud, plagiarism, or concerns about animal or human experimentation.

Conflicts of Interest

The authors have no conflicts of interest to declare.

Authors’ Contributions

Gh. Rahchamani and S. M. Movahedifar conceived and planned the experiments. A. Honarbakhsh and Gh. Rahchamani planned and carried out the simulations. A. Honarbakhsh and Gh. Rahchamani contributed to the interpretation of the results. Gh. Rahchamani, S. M. Movahedifar, and A. Honarbakhsh took the lead in writing the manuscript. All authors provided critical feedback and helped shape the research, analysis, and manuscript.

Acknowledgments

The authors gratefully acknowledge the generous support of Imam Khomeini University of Sabzevar and Islamic Azad University of Neyshabur. The authors wish to thank experts from the New Materials Technology and Processing Research Center of Neyshabur for the realization of this project.