Modeling of Compressive Strength of Self-Compacting Rubberized Concrete Using Machine Learning

Kovačević, Miljan; Lozančić, Silva; Nyarko, Emmanuel Karlo; Hadzima-Nyarko, Marijana

doi:10.3390/ma14154346

Open AccessArticle

Modeling of Compressive Strength of Self-Compacting Rubberized Concrete Using Machine Learning

¹

Faculty of Technical Sciences, University of Pristina, Knjaza Milosa 7, 38220 Kosovska Mitrovica, Serbia

²

Faculty of Civil Engineering, Josip Juraj Strossmayer University of Osijek, Vladimira Preloga 3, 31000 Osijek, Croatia

³

Faculty of Electrical Engineering, Computer Science and Information Technology, Josip Juraj Strossmayer University of Osijek, Kneza Trpimira 2B, 31000 Osijek, Croatia

^*

Authors to whom correspondence should be addressed.

Materials 2021, 14(15), 4346; https://doi.org/10.3390/ma14154346

Submission received: 9 July 2021 / Revised: 30 July 2021 / Accepted: 31 July 2021 / Published: 3 August 2021

(This article belongs to the Special Issue Artificial Intelligence for Cementitious Materials)

Download

Browse Figures

Versions Notes

Abstract

:

This paper gives a comprehensive overview of the state-of-the-art machine learning methods that can be used for estimating self-compacting rubberized concrete (SCRC) compressive strength, including multilayered perceptron artificial neural network (MLP-ANN), ensembles of MLP-ANNs, regression tree ensembles (random forests, boosted and bagged regression trees), support vector regression (SVR) and Gaussian process regression (GPR). As a basis for the development of the forecast model, a database was obtained from an experimental study containing a total of 166 samples of SCRC. Ensembles of MLP-ANNs showed the best performance in forecasting with a mean absolute error (MAE) of 2.81 MPa and Pearson’s linear correlation coefficient (R) of 0.96. The significantly simpler GPR model had almost the same accuracy criterion values as the most accurate model; furthermore, feature reduction is easy to combine with GPR using automatic relevance determination (ARD), leading to models with better performance and lower complexity.

Keywords:

self-compacting rubberized concrete; compressive strength; machine learning; artificial neural networks; regression tree ensembles; support vector regression; Gaussian process regression

1. Introduction

The European Union has prohibited all types of waste tire disposal since 2006 because the long process of tire deterioration affects the environment and wildlife. As natural resources are becoming increasingly scarce in the concrete industry, more emphasis is being placed on the utilization of waste products from other industries, such as the replacement of recycled aggregates with recycled rubber [1]. Waste rubber has a major impact on the properties of fresh concrete. The use of recycled rubber decreases the entry of aggressive substances into the material, guaranteeing that the concrete has less permeability and thus is more durable. It enhances impact, wear resistance and durability as well as other mechanical characteristics. It decreases compressive strength; shrinkage; and thermal conductivity coefficient, while increasing freezing resistance and sound absorption coefficient, depending on the amount and size of the rubber portion.

Self-compacting concrete (SCC) is utilized to enhance productivity, which means faster construction, reduced noise level, and improved surface finish, which eliminates the need for patching. SCC is a special type of concrete that does not need to use concrete compacting devices during casting [2,3]. Its self-buildability and self-leveling feature has eliminated the need of expensive vibrating equipment, reduced the cost of time of construction and the number of workers on site and increased safety at the site.

Unlike ordinary concrete, SCC contains a higher proportion of fine aggregate particles, and the water-binding ratio is lower, which affects the required force for the flow of concrete. Such a composition results in a decrease in the viscosity that is resolved by the addition of superplasticizers. Large amounts of recycled rubber particles in the material reduce the possibility of filling the formwork without additional vibration. In order to ensure the same properties or approximate properties of self-compacting rubberized concrete (SCRC) as in the reference mixture, it is necessary to modify and adjust the proportion of admixtures to the concrete. In this way, the concrete behavior with rubber in the fresh state is ensured. Even when the content of rubber particles is limited to a maximum of 20–30%, mechanical properties must be improved by adding higher content of cement and lower water to binder ratio or adding supplementary cementing materials, such as slag, silica fume or/and fly ash.

Moreover, the key indicator, commonly used for assessing the strength, the compressive strength of SCRC, generally decreases with the increase in content of rubber in SCRC. There are no expressions in building codes (for example, Eurocode 2 [4], or ACI Committee 209 [5]) for the prediction of compressive strength of rubberized concrete, especially of SCRC. Models from literature for predicting the compressive strength of rubberized concrete given by researchers are based on a reduction coefficient with respect to the referent mixture of concrete without recycled rubber particles. This implies that it is always necessary to make a reference mixture of concrete without the addition of crumb rubber particles. Therefore, in this article, an effort was made to model the compressive strength of SCRC by adopting one of several machine learning methods. So far, metaheuristic methods, and especially neural networks, have been successfully applied in various fields, such as in the control and optimization of processes, economics, medicine, and engineering [6,7,8,9,10]. They have also been used to model the properties of concrete in fresh or solid state [11,12,13,14,15], but much less in concrete with the addition of rubber [16,17,18].

Various researchers used different methods to model the compressive strength of rubberized concrete and SCRC. Some of the studies are summarized in Table 1. However, this field, especially when SCRC is in question, still requires further exploration.

Apart from ANN models, some other machine learning methods, such as k-nearest neighbor (KNN) and RF, have also been applied to estimate the compressive strength of concrete. KNN assigns unknown data values using the distances to the k-nearest data points. RF, on the other hand, is an ensemble learning method that generates a large number of decision trees during training. Unknown data values are assigned by RF based on the average prediction of the individual trees. For example, Ahmadi-Nedushan [30] developed a KNN model to predict the compressive strength of concrete with 104 experimental data, while Chopra et al. [31] estimated the compressive strength of concrete using an RF model with 49 data points.

The outcomes of the above studies are certainly encouraging, especially considering the fact that applications of ML models to estimate the compressive strength of SCRC are still at an early stage.

The aim of this article is to estimate the compressive strength of SCRC specimens using multilayered perceptron artificial neural network (MLP-ANN), ensembles of MLP-ANNs, regression tree ensembles (random forests, boosted and bagged regression trees), support vector regression (SVR) and Gaussian process regression (GPR). To the best knowledge of the authors, GPR with automatic relevance determination (ARD) has not been previously used for estimating SCRC compressive strength.

2. Methods

2.1. Multilayered Perceptron Artificial Neural Network (MLP-ANN)

Artificial neural networks are based on the parallel processing of various types of information similar to the human brain. They contain artificial neurons that are interconnected into a single parallel structure. A multilayer perceptron is a neural network with forward signal propagation that consists of at least three layers of neurons: input, hidden, and output layers.

In the general case, each neuron of one layer is connected to each neuron of the next layer, as shown in Figure 1 for the example of a three-layer MLP network with n inputs and one output. The properties of the network depend on the number of neurons and the type of activation function, so if the network is to be used as a universal approximator, it must use nonlinear activation functions in the hidden layer to be able to approximate nonlinear relationships between input and output variables [32]. A model with one hidden layer having neurons with a sigmoid activation function and output layer neurons with a linear activation function can approximate an arbitrary function when there is a sufficient number of neurons in the hidden layer [32].

As the number of input neurons is determined by the dimensions of the input vector and the number of output neurons by the dimension of the output vector, determining the network structure is reduced to determining the optimal number of hidden layer neurons if MLP architecture with the property of universal approximator is used.

The method for precise and reliable determination of the minimum required number of neurons has not been determined yet. What can be determined to some extent is the upper limit, i.e., the maximum number of hidden layer neurons, which can be used to model a system represented by a specific set of data. It is proposed to take into account a smaller amount of

N_{H}

from the set of inequalities (1) and (2), where

N_{i}

denotes the number of neural network inputs and

N_{s}

denotes the number of training samples [33,34].

N_{H} \leq 2 \times N_{i}

(1)

N_{H} \leq \frac{N_{s}}{N_{i} + 1}

(2)

In order to improve the generalization of the model, when there is a small data with inherent noise, it is possible to train a larger number of neural networks and find the mean value of their outputs. In this way, ensemble models are created, while the individual models that make up the structure of the ensemble are called base models or submodels. The data set upon which the ensemble models were trained in each iteration is formed by the Bootstrap method [35]. The Bootstrap method forms a set of the same size as the original data set.

2.2. Regression Tree Ensembles

2.2.1. Bagging

Methods based on classification trees (Classification and Regression Trees-CART) use the segmentation of the space of input variables in multidimensional rectangles or so-called boxes and then apply a model where multidimensional rectangles are assigned the appropriate value [35,36,37]. The lines that segment the input space are of the form

X_{i} = t

, with the remark that binary segmentation of the space is applied. Depending on the value of the input variables, the regression model (Figure 2) assigns a constant value of

c_{m}

to each of the mentioned regions, which is equal to the mean value of the output variable for that region

R_{m}

, i.e., in this case:

\hat{f} (X) = \sum_{m = 1}^{8} c_{m} I \{(X_{1}, X_{2}) \in R_{m}\} .

(3)

With the above procedure of forming a regression tree model, there is a possibility that the formed regression tree has good performance on the training set but poor generalization on the test data set. The Bootstrap aggregation-Bagging method allows for the aforementioned problem to be solved.

In order to practically carry out the mentioned procedure, it is necessary to have more training sets to reduce the variance by averaging. The mentioned problem of generating a more significant number of training sets can be overcome by the bootstrap method of sampling, i.e., by repeating sampling within the same training data set. The bagging method applies sampling with replacement [35]. If the model trained on the b-th bootstrap training set has the prediction function

{\hat{f}}^{* b} (x)

at the point x, then by averaging all B models, a model (Figure 3) whose predictive function will be determined by the following expression can be obtained:

{\hat{f}}_{b a g} (x) = \frac{1}{B} \sum_{b = 1}^{B} {\hat{f}}^{* b} (x) .

(4)

2.2.2. Random Forests

The RF method differs from the Bagging method in that it does not use all the variables in generating the model. In the process of generating the ensemble, the method tries to form regression trees that are decorrelated, which leads to reduced variance when the ensemble results are averaged for all the trees within the ensemble [35].

Suppose that training dataset D is composed of

l

observations and

n

features. First, a sample from the training dataset is taken randomly with replacement and bootstrap is created. Before each split,

m

≤

n

features are randomly selected as candidates for splitting. Typical values for

m

are approximately

m / 3

[35,38]. The RF model is obtained by aggregating individual tree models obtained in this way.

2.2.3. Boosting Trees

The Boosting method uses sequential model training, where each new regression tree added to the ensemble has the function of improving the performance of the previous tree collection. In this part of the paper, the application of the Gradient Boosting method in regression trees will be discussed [35,39,40,41].

In the case of a quadratic error function (Figure 4), a new submodel is added to the basic model in each subsequent step, which best estimates the residuals of the previous model. In this way, by adding a model through the application of an iterative procedure, a definite model is obtained that represents an ensemble of previously obtained models.

Estimating the relative influence of the predictor variable in this method is based on the number of times a variable is selected for splitting, weighted by the squared improvement to the model as a result of each split, and averaged over all trees [42].

2.3. Support Vector Regression (SVR)

Suppose a training dataset

\{(x_{1}, y_{1}), (x_{2}, y_{2}), \dots, (x_{l}, y_{l})\} \in R^{n} \times R

is given, where

x_{i} \in R^{n}

is the n-dimenzional vector denoting the model’s inputs and

y_{i}

are the observed responses to these inputs.

The approximation function has the following form:

f (x) = \sum_{i = 1}^{l} (α_{i}^{*} - α_{i}) K (x_{i}, x) + b .

(5)

In Equation (5),

K

denotes the kernel function, and

α_{i}

,

α_{i}^{*}

and b are the parameters obtained by minimizing the error function.

In order for SVR regression to be applied, the empirical risk function is introduced:

R_{e m p}^{ε} (w, b) = \frac{1}{l} \sum_{i = 1}^{l} {|y_{i} - f (x_{i}, w)|}_{ε .}

(6)

With the SVR algorithm, the goal is to minimize the empirical risk

R_{e m p}^{ε}

as well as the

w^{2}

value simultaneously. The so-called Vapnik’s linear loss function (Figure 5) with ε-insensitivity zone is introduced, defined by the following expression [43,44]:

{|y - f (x, w)|}_{ε} = \{\begin{matrix} \begin{matrix} 0 i f |y - f (x, w)| \leq ε \end{matrix} \\ |y - f (x, w)| - ε o t h e r w i s e . \end{matrix}

(7)

Considering the above expression, the problem can be reduced to minimizing the following function:

R = \frac{1}{2} {||w||}^{2} + C \sum_{i = 1}^{l} {|y_{i} - f (x_{i}, w)|}_{ε .}

(8)

The constant

C

has the role of balancing between the approximation error and the norm of the weight vector

w

. Minimizing

R

is equivalent to minimizing:

R_{w, ξ, ξ^{*}} = \frac{1}{2} [{||w||}^{2} + C (\sum_{i = 1}^{l} ξ + \sum_{i = 1}^{l} ξ_{*})],

(9)

where

ξ

and

ξ^{*}

are the slack variables, which are shown in Figure 5.

Linear, RBF and sigmoid kernels used in this paper are defined as [45]:

K x_{i}, x = x_{i}, x

(10)

K x_{i}, x = \exp (- γ x_{i} - x^{2}), γ > 0

(11)

K x_{i}, x = \tanh (γ x_{i}, x + r), γ > 0

(12)

In this paper, LIBSVM software with SMO optimization algorithm was used [46,47]. The LIBSVM software was used within the MATLAB program [47].

2.4. Gaussian Proces Regression

A Gaussian process model is a probability distribution over possible functions that fit a set of points. Consider a problem of nonlinear regression:

y = f (x) + ε, ε ~ Ν (0, σ^{2}) .

(13)

where the function

f (\cdot) : R^{n} \to R

is unknown and needs to be estimated,

y_{i}

is target variable,

x

are input variables and

ε

is normaly distributed additive noise. Gaussian process regression [48] assumes that

f (\cdot)

follows a Gaussian distribution with mean function

μ (\cdot)

and covariance function

k (\cdot, \cdot)

. The

n

observations in an arbitrary data set

y = \{y_{1}, \dots, y_{n}\}

can always be imagined as a sample from some multivariate (

n

variate) Gaussian distribution:

{(y_{1}, \dots, y_{n})}^{T} ~ N (μ, Κ),

(14)

where

μ = (μ (x_{1}), \dots, μ (x_{n}))^{T}

is the mean vector and K is

n \times n

covariance matrix of which the

(i, j)

th element

K_{i j} = k (x_{i}, x_{j}) + σ^{2} δ_{i j}

. Here,

δ_{i j}

is Kronecker delta function. Let

x^{*}

be any test point and

y^{*}

be corresponding response value. The joint distribution of

(y_{1} \dots, y_{n}, y^{*})

is an (

n

+ 1) variate normal distribution

(y_{1}, \dots, y_{n}, y^{*}) ~ N (μ^{*}, \sum)

, where

μ^{*} = (μ (x_{1}), \dots, μ (x_{n}), μ (x^{*}))^{T}

and covariance matrix:

\sum = [\begin{matrix} K_{11} & K_{12} & \cdot \cdot \cdot & K_{1 n} & K_{1 *} \\ K_{21} & K_{22} & \cdot \cdot \cdot & K_{2 n} & K_{2 *} \\ \cdot \cdot \cdot & \cdot \cdot \cdot & \cdot \cdot \cdot & \cdot \cdot \cdot & \cdot \cdot \cdot \\ K_{n 1} & K_{n 2} & \cdot \cdot \cdot & K_{n n} & K_{n *} \\ K_{* 1} & K_{* 2} & \cdot \cdot \cdot & K_{* n} & K_{* *} \end{matrix}] = [\begin{matrix} K & K^{*} \\ K^{* T} & K^{* *} \end{matrix}]

(15)

where

K^{*} = {(K (x^{*}, x_{1}), \dots, K (x^{*}, x_{n}))}^{T}

and

K^{* *} = K (x^{*}, x^{*})

.

The conditional distribution of

y^{*}

, given

y = {(y_{1}, \dots, y_{n})}^{T}

is then

N ({\hat{y}}^{*}, {\hat{σ}}^{*}^{2})

with

{\hat{y}}^{*} = μ (x^{*}) + K^{* T} K^{- 1} (y - μ),

(16)

{\hat{σ}}^{*}^{2} = K^{* *} + σ^{2} - K^{* T} K^{- 1} K^{*} .

(17)

For some covariance functions, hyperparameters can be used to determine which inputs (variables) are more relevant than the others, using the automatic relevance determination (ARD). For example, consider the squared exponential covariance function with different length scale parameters for each input (ARD SE):

k (x_{p}, x_{q}) = v^{2} e x p [- \frac{1}{2} \sum_{i = 1}^{n} {(\frac{x_{p}^{i} - x_{q}^{i}}{r_{i}})}^{2}] .

(18)

where

r_{i}

denotes the length scale of the covariance function along the input dimension

i

. If

r_{i}

is very large, the relative importance of the

i

-th input is smaller [48]. The hyperparameters

\{v, r_{1}, \dots, r_{n}\}

and the noise variance

σ^{2}

can be estimated by the maximum likelihood method. The log-likelihood of the training data is given by:

L (v, r_{1}, \dots, r_{n}, σ^{2}) = - \frac{1}{2} \log \det K - \frac{1}{2} y^{T} K^{- 1} y - \frac{n}{2} \log 2 π .

(19)

3. Evaluation and Performance Measures

The aim of defining the procedure for forming a model is to divide the whole process of forming a model into a certain number of steps (Figure 6) so that each time a model is formed, the same procedure is applied and all models are formed under the same conditions. The same criteria for accuracy assessment were defined and applied to the models.

The root mean square error (RMSE), mean absolute error (MAE), Pearson’s Linear Correlation Coefficient (R) and mean absolute percentage error (MAPE) were used to assess the quality of the model.

The RMSE criterion for evaluating the accuracy of a model is a measure of the general accuracy of the model and is expressed in the same units as the quantity to be modeled:

R M S E = \sqrt{\frac{1}{N} \sum_{k = 1}^{N} {(d_{k} - o_{k})}^{2},}

(20)

where:

d_{k}

—actual value (target value),

o_{k}

—output or forecast given by the model,

N—number of training samples.

The MAE criterion is a measure of the absolute accuracy of the model and is used to represent the mean absolute error of the model:

M A E = \frac{1}{N} \sum_{k = 1}^{N} |d_{k} - o_{k}|

(21)

Pearson’s linear correlation coefficient R represents a relative criterion for evaluating the accuracy of the model:

R = \sqrt{{[\sum_{k = 1}^{N} (d_{k} - \bar{d}) (o_{k} - \bar{o})]}^{2} \times {[\sum_{k = 1}^{N} {(d_{k} - \bar{d})}^{2} {(o_{k} - \bar{o})}^{2}]}^{- 1}}

(22)

where

\bar{o}

represents the mean value of the prediction obtained by the corresponding model and

\bar{d}

represents the mean target value. Correlation coefficient values greater than 0.75 indicate a good correlation between the variables [49].

The mean absolute percentage error (MAPE), defined by Equation (23), represents a relative criterion for evaluating the accuracy of the model,

M A P E = \frac{100}{N} \sum_{k = 1}^{N} |\frac{d_{k} - o_{k}}{d_{k}}|

(23)

The paper uses the procedure with ten-fold cross-validation of the model.

4. Dataset

A systematic search for papers examining the properties of SCRC in fresh and solid state was performed in April 2020.

Based on papers published regarding the modelling of compressive strength of self-compacting rubberized concrete, input parameters were selected (Table 2) and all data that were incomplete or that did not include any of the selected input parameters were removed from the database.

With the aim of improving the mechanical properties of SCRC, various supplementary cementing materials, such as slag, silica fume or/and fly ash, which are used in concrete mixtures, have also been reported in the literature. Therefore, the SCRC mixtures containing supplementary cementing material, such as slag, silica fume and fly ash, are included in the database. The data collected by searching through research papers contain the results of 166 SCRC samples (Table 3). An overview of some of the data sources is provided as follows.

Emiroglu et al. [50] provided experiments with the aim of investigating the bonding performances of crumb rubber and reinforced bars in SCRC. The authors prepared four different R-SCC mixtures with the replacement of crumb rubber by volume with the natural aggregate in percentages of 15%, 30%, 45% and 60%. In the database, only mixtures with 15% of crumb rubber replacement were considered.

In order to investigate durability properties, Yung et al. [51] replaced part of the fine aggregate with waste tire rubber powder in volume ratios of 5%, 10%, 15% and 20%. They concluded that the best level of replacement achieved was with the addition of 5% waste tire rubber powder (that had been passed through a #50 sieve).

The results obtained by Li et al. [52] indicated that an SCC with adequate workability can be successfully created with partial replacement of sand or coarse aggregate with rubber particles of the same volume. When the replacement rate of sand with rubber particles was 30%, the value of loss of compressive strength of SCC was about 30%. Therefore, this mixture with 30% replacement rate was not considered.

Khalil et al. [53] prepared SCC specimens with different ratios of crumb rubber (10%, 20%, 30% and 40% of volume replacement of sand), but the last two mixtures (with 30% and 40% of sand replacement) were not added in the database.

Yu [54] conducted a study on the effect of changing regularity of waste rubber on deformation performance of SCRC. The results showed that the rubber particles in a more uniform distribution reduced the maximum compressive strength.

Zaoiai et al. [55] compared the rheological and mechanical performance between different mixtures formulations in order to obtain the optimum dose for rubber particles. The results of experimental testing showed that the compressive strength of SCC slightly decreased by replacing natural aggregate with rubber granulates.

Ismail and Hassan [56] investigated the mechanical properties and impact resistance of SCRC mixtures in which steel fibers were added in order to reinforce SCRC. However, all mixtures with steel fibers were removed from the database. The results showed that the addition of crumb rubber to concrete improved impact energy absorption and ductility, while the mechanical properties decreased with increasing content of crumb rubber.

5. Results and Discussion

MLP-ANN with one hidden layer was trained using Levenberg-Marquardt algorithm [71]. The criterion to stop the training was either the maximum number of epochs (set to 1000), the minimum gradient magnitude (set to 10⁻⁵) or the network performance (measured as the mean square error and set to 0). All input data were normalized in the range [−1, 1] prior to training. The following variables are defined as input variables of the model: water, cement, fine natural aggregate, coarse natural aggregate, fine rubber, coarse rubber, superplasticizer, slag, silica fume and fly ash, and these input variables determine the number of neurons in the input layer (i.e., the variable water corresponds to the first neuron, the variable cement the second neuron of the input layer, etc.) of the ANN model. There are ten input variables, and the number of input layer neurons is ten. The number of neurons in the output layer in the regression problem is one, and the output of this neuron corresponds to the prediction of the compressive strength of concrete. The sigmoid activation function is used in the hidden layer, while the linear activation function is used in the output layer.

The maximum number of neurons in the hidden layer was determined experimentally using Equations (1) and (2) and equals 16.

Figure 7a shows the obtained performance using RMSE and MAE as absolute measures, while Figure 7b presents results using R and MAPE as relative measures. By analyzing models with different numbers of neurons in the hidden layer, it was concluded that the configuration with eight neurons in the hidden layer was optimal considering all four criteria.

In order to improve the generalization of the model, ensemble models were created. The use of base models of neural networks having up to 16 neurons in their hidden layer were analyzed, where each of the base models in the ensemble could have a different number of neurons in the hidden layer. The optimal base model in the current iteration was defined based on the minimum RMSE value of the 16 generated models in the current iteration. After that, the procedure continues until a total number of 100 base models of the ensemble were generated.

The bootstrap method formed a sample of the same size as the original sample. Since the evaluation used a ten-fold cross-validation procedure, sampling was performed within nine folds. The remaining fold was used to test the ensemble. The procedure was repeated 10 times so that the whole set of data was used for testing the ensemble, and the evaluation of the prediction of the ensemble was represented by the mean value of all base models in terms of the considered model performance.

In Figure 8, there is no trend in terms of accuracy to which an ANN model with a certain number of neurons in the hidden layer stands out. Figure 8 shows 100 ANN models generated on different samples obtained by bootstrap aggregation, where the optimal number of hidden layer neurons varies, as well as the achieved accuracy with respect to RMSE which is generally unsatisfactory in all models. The accuracy of individual ANN models varies mainly between 4 and 8. By including these individual models in the ensemble, although the complexity increases, the accuracy in terms of RMSE criteria becomes significantly higher (the red circle in Figure 8a represents the RMSE value for the ensemble). There is also a significant increase in accuracy with regard to other accuracy criteria.

It can be seen in Figure 9 and Figure 10 that the model of an ensemble composed of individual neural networks contributes to a significant improvement in generalization. Comparative values in terms of defined criteria for model evaluation for the optimal individual neural network model and ensemble model are shown in Table 4, and the regression plot of the modelled and target values for the ensemble model is shown in Figure 11.

The application of models based on decision trees was also analyzed. MSE values were used as a criterion in model training, and numerical values of MSE of the base models of the generated ensemble were presented cumulatively. The parameters of the optimal models were determined by the grid-search method.

The analysis was performed using the following methods:

Bagging method (TreeBagger),
RF method,
Boosted Trees method.

The data used for training (in bag data) in the TreeBagger model are extracted from the entire data set by sampling with replacement. Data that are not extracted from the whole set (out of bag data) represent test data. During the model building process, an out-of-bag error is calculated by finding the difference between the out-of-bag sample and the prediction for that same sample, and it is stored. The procedure is repeated for all trees within the ensemble.

During the implementation of the Bagging method, different values of model parameters were analyzed, as follows:

Number of generated trees B. Within this analysis, the maximum number of generated trees was limited to 500.
The minimum number of data or samples assigned to the leaf (min leaf size) within the tree. Values from 2 to 15 samples with a step size of 1 per tree leaf were considered.

The lowest value of MSE (Figure 12) of the analyzed models has a model that has a minimum number of data per tree leaf 2, marked in darker blue. The saturation of the learning curve occurs after 269 trees in the ensemble.

In order to determine the significance of the j-th variable, it is necessary that after training the model, the values of the j-th variable be permuted within the training data and that the out of bag error for such permutated data be recalculated. The significance of the variable (Figure 13) is determined by calculating the mean value of the difference before and after permutation for all trees within the ensemble. This value is then divided by the standard deviation of these differences. The variable for which a higher value was obtained in relation to the others is ranked as more significant in relation to the variables in which smaller values were obtained [36].

During the implementation of the RF method, different values of the adaptive parameters of the model were analyzed, as follows:

Number of generated trees B. Within this analysis, the maximum number of generated trees was limited to 500.
The number of variables that are used for splitting in the tree. In the paper Random Forests by L. Breiman [38], it is recommended that the subset m of variables on which splitting is performed is p/3 of the predictor. In this paper, the values of m from 2 to 9 (Figure 14) are examined.
The minimum number of data or samples assigned to a leaf (min leaf size) within a tree. Values from 2 to 10 samples per tree leaf were considered.

The application of a narrowed set of splitting variables in this case did not yield results. Figure 14 shows the values of the accuracy criteria of the ensemble of 500 basic models, from which one can see the tendency to increase the number of variables upon which the splitting is performed to increase the accuracy of the model in terms of all defined criteria. When 2 variables, upon which the potential splitting of the tree is considered, are randomly selected from a set of 10 input variables, the model of least accuracy is obtained (RMSE = 11.16, MAE = 8.4493, MAPE/100 = 0.2732, R = 0.5769). On the other hand, randomly selecting a large number of variables for potential splitting increases the accuracy of the model and is greatest when using a set of 9 randomly selected variables from a set of 10 input variables (RMSE = 7.7321, MAE = 5.7174, MAPE/100 = 0.1785, R = 0.8425). An analysis that takes into account all 10 input variables when creating trees of the ensemble has already been done with the TreeBagger model.

The analysis showed greater accuracy in tree models in which the number of data per leaf is equal to 2. The relevance of individual variables for RF model is shown in Figure 13.

With Boosting Trees method, the following model parameters were considered:

Number of generated trees B. With the Gradient Boosting method, there is a possibility of overtraining the model when forming too many trees. Due to the large number of analyzed models in the research, the number of base models within the ensemble was limited to a maximum of 100.
Learning rate λ. This parameter determines the training speed of the model. The paper investigates a number of values, as follows: 0.001; 0.01; 0.1; 0.25; 0.5; 0.75 and 1.0.
Number of splits in the tree d. Models of trees with a maximum number of splits of 2⁰ = 1, 2¹, 2², 2³, 2⁴, 2⁵, 2⁶, 2⁷ = 128 were generated.

The optimal model obtained (marked in yellow in the Figure 15) had 100 generated trees, a reduction parameter value of 0.10 and a maximum number of splits of 64. As in other tree-based models, the relevance of individual model variables is determined and shown in Figure 13. A comparison of all tree-based models (Bagging, RF and Boosted Trees methods) is given in Table 5.

In order to obtain a good regression model using the support vector method, it is necessary to select the appropriate kernel function. For the selected kernel functions, it is necessary to determine their parameters, as well as the value of the penalty parameter C (Figure 16).

In this paper, therefore, the use of several different kernel functions is investigated in order to find the best one. The use of SVR models with linear, RBF and sigmoid kernel was analyzed. Normalization, by which all input data were transformed into the range (0, 1), was done before training and testing the model. The optimal model was determined using the grid search algorithm for all kernels (C = 4.22195 and ε = 0.105765 for the linear kernel; C = 7.67645; ε = 0.0230915; γ = 1.89915 for the RBF kernel; C = 1113.70875; ε = 0.0920525; γ = 0.000945345 for sigmoid kernel).

Comparative analysis of different SVR models shows that the models have different accuracy depending on the adopted criteria depending on the kernel function. Models with linear and sigmoid kernels have similar accuracy according to different criteria. The model with the RBF kernel function (Table 6) has significantly higher accuracy with respect to all criterion functions.

During the development of the Gaussian process model, covariance functions that have one length scale parameter for all input variables (exponential, square-exponential, Matern 3/2, Matern 5/2) and rational quadratic covariance function as well as their equivalent ARD covariance functions that have a separate length scale for each input variable were considered. Standardization procedure was performed using Z-score, i.e., the data is transformed to have a mean value of zero and a variance equal to one. Models with constant base functions were analyzed.

Parameter values are determined (Table 7 and Table 8) by maximizing the log marginal probability. By using ARD covariance functions, it is possible to see the relevance of individual variables or predictors in the model (Automatic Relevance Determination-ARD). Higher parameter values for ARD covariance functions indicate less relevance of a particular variable to which they refer.

According to the three defined criteria, the RMSE, MAE and R, the model with ARD Matern 3/2 covariance function (Table 9) can be considered optimal, while according to the MAPE criterion it is second in accuracy with a difference of 0.0011 compared to the first ranked model according to that criterion.

The analysis of the relevance of the variable models will be performed based on the parameters of the covariance function on the model with the ARD Matern 3/2 function as the most accurate model. The values of the distance scale parameters are shown in the logarithmic scale (logarithms with base 10) in Figure 17.

It can be seen (Figure 17) that in the optimal model with ARD Matern 3/2 function the input variables 2 (cement) and 10 (fly ash) have the greatest relevance and the greatest impact on the model’s accuracy. Variables 1 (water), 3 (fine natural aggregate), 4 (coarse natural aggregate), 5 (fine rubber), 6 (coarse rubber), 7 (superplasticizer) and 9 (silica fume) have similar relevance. Input variable 8 (slag) has the least relevance and the most negligible impact on the model’s accuracy.

In further analysis, the models that used the most relevant variables were considered due to the possibility that the presence of irrelevant variables may reduce the accuracy of the model. By using a narrowed set of variables in certain cases, it is possible to obtain a model (Figure 18 and Figure 19) of the same or higher accuracy. Additionally, in this way, the complexity of the model is reduced, and the model training process is accelerated.

In the further analysis, a comparison was made (Table 10) of the two models, as follows:

Model where variable 8 is excluded as less relevant (slag),
A model that includes all variables.

The application of individual models of neural networks gave models with unsatisfactory accuracy in terms of all criteria. For this reason, the use of neural network ensembles was considered. A limit is defined in terms of the maximum number of hidden layer neurons, while the number of inputs and outputs defines the number of input and output layer neurons. The analysis showed that the application of ensembles gives models of significantly higher accuracy than the optimal individual model of the neural network with the architecture 10-8-1. The value of the ensemble correlation coefficient was increased to the value 0.9610, which is a satisfactory value. The values of RMSE, MAE and MAPE are approximately halved in relation to the individual optimal neural network model. With an ensemble of 40 base models, there is convergence in terms of defined criteria, and a further increase in the complexity of the model does not lead to an increase in accuracy.

A comparative analysis of all tree-based models shows that the model obtained by applying Boosted Trees has the best accuracy, while models based on the Bagging and Random Forests methods showed lower and similar values in terms of all criteria. All models based on regression trees can determine the relevance of individual input variables in the model. As the least relevant input variable, all models identified the variable denoting slag. In the event of a significant expansion of the database, the complexity of the method could be analyzed, whereby the models in the Bagging and Random Forests methods can be processed in parallel, which is not the case with the Boosted Trees method, where models are trained sequentially within the ensemble. In addition, it should be pointed out that a significant number of basic models are needed to saturate the learning curve.

Comparative analysis of different SVR models shows that the models, depending on the kernel function, have different accuracies depending on the adopted criteria. The use of the RBF kernel function, in this case, gave satisfactory results, while the use of linear and sigmoid kernels gave significantly worse results. Models with linear and sigmoid kernel function have values tht are almost twice as bad in terms of the three criteria RMSE, MAE, and MAPE. The value of the correlation coefficient R is significantly lower than the RBF model. Training models with the RBF function are relatively simple, but these models do not provide direct insight into the relevance of individual variables in the model.

GPR models that do not use scale parameters of different lenghts for individual input variables in the considered problem in most cases have worse criterion values than models that use different length scale parameters (ARD models). The best model is a model with ARD Matern 3/2 covariance function. The values of the length scale parameters for the optimal model can be used to assess the relevance of the predictor or variable in the model. This model singles out fly ash and cement as the most significant variables. Slightly smaller but of similar relevance are the variables representing coarse natural aggregate, silica fume and fine rubber. The variables coarse rubber and superplasticizer represent the next group of variables that have similar and lower relevance than the previous. The least relevant are the variables fine natural aggregates and slag.

In further analysis, GPR models that used the most relevant variables were considered due to the possibility that the presence of irrelevant variables may reduce the accuracy of the model accuracy. It has been shown that by eliminating the variable of smaller significance, which in this case represents the variable Slag, a model of higher accuracy is obtained. Additionally, in this way, the complexity of the model is reduced, and the model training process is accelerated.

6. Conclusions

This paper gives a comprehensive overview of machine learning methods that can be used for estimating SCRC compressive strength, including MLP-ANN, ensembles of MLP-ANNs, regression tree ensembles (random forests, boosted and bagged regression trees), SVR and GPR, with different covariance functions.

As a basis for the development of the forecast model, a database containing a total of 166 samples of SCRC was obtained from various experimental studies.

Ensembles of neural networks and GPR models with ARD covariance function stood out as the models of the highest accuracy. Other analyzed models are more complex, and optimization of their parameters requires significant search efforts, but they are still less accurate.

The following values of accuracy criteria were obtained for the ensemble of neural networks: RMSE = 3.6888, MAE = 2.8099, MAPE = 0.0854 and R = 0.9610, while the GPR model with Matern 3/2 covariance function had values of RMSE = 4.3934, MAE = 3.0583, MAPE = 0.0942 and R = 0.9482.

The application of an ensemble of neural networks has the greatest complexity, where satisfactory accuracy is achieved only with the formation of an ensemble with a significant number of basic models. Neural network ensembles do not allow direct consideration of the relevance of individual input variables.

The use of the GPR method gives models of satisfactory accuracy and, at the same time, significantly less complexity. The proposed model with the ARD Matern 3/2 covariance function enables the ranking of the influence of individual variables on the accuracy of the model. The complexity of this model is reduced, and the model training process is accelerated.

Based on the proposed models for the neural network ensemble and GPR models, a model application was developed in MATLAB and deposited on the Github website.

Author Contributions

Conceptualization, M.K., S.L., E.K.N. and M.H.-N.; methodology, M.K., S.L., E.K.N. and M.H.-N.; software, M.K. and E.K.N.; validation, M.K., S.L., E.K.N. and M.H.-N.; formal analysis, M.K. and E.K.N.; investigation, S.L. and M.H.-N.; resources, S.L. and M.H.-N.; data curation, M.K., S.L. and E.K.N.; writing—original draft preparation, M.K., S.L., E.K.N. and M.H.-N.; writing—review and editing, M.K., S.L., E.K.N. and M.H.-N.; funding acquisition, S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The research described in this paper has been fully supported by Croatian Ministry of Science and Education and Serbian Ministry of Education, Science and Technological Development under scientific research project entitled “Microstructural and mechanical characteristics of concrete with recycled materials” 2019–2021.

Conflicts of Interest

The authors declare no conflict of interest.

References

Yildirim, S.T.; Duygun, N.P. Mechanical and physical performance of concrete including waste electrical cable rubber. IOP Conf. Ser. Mater. Sci. Eng. 2017, 245, 022054. [Google Scholar] [CrossRef]
Hadzima-Nyarko, M.; Nyarko, E.K.; Djikanovic, D.; Brankovic, G. Microstructural and mechanical characteristics of self-compacting concrete with waste rubber. Struct. Eng. Mech. 2021, 78, 175–186. [Google Scholar] [CrossRef]
Alaloul, W.S.; Musarat, M.A.; Haruna, S.; Law, K.; Tayeh, B.A.; Rafiq, W.; Ayub, S. Mechanical properties of silica fume modified high-volume fly ash rubberized self-compacting concrete. Sustainability 2021, 13, 5571. [Google Scholar] [CrossRef]
Eurocode 2: Design of Concrete Structures—Part 1-1: General Rules and Rules for Buildings; EN 1992-1-1:2004 1 AC:2010; British Standards Institution: London, UK, 1992.
ACI COMMITTEE 209. Creep Shrinkage Temperature in Concrete Structures; American Concrete Institute: Detroit, MI, USA, 2008; pp. 258–269. [Google Scholar]
Harirchian, E.; Kumari, V.; Jadhav, K.; Raj Das, R.; Rasulzade, S.; Lahmer, T. A machine learning framework for assessing seismic hazard safety of reinforced concrete buildings. Appl. Sci. 2020, 10, 7153. [Google Scholar] [CrossRef]
Martínez-Álvarez, F.; Schmutz, A.; Asencio-Cortés, G.; Jacques, J. A novel hybrid algorithm to forecast functional time series based on pattern sequence similarity with application to electricity demand. Energies 2019, 12, 94. [Google Scholar] [CrossRef] [Green Version]
Ahmad, M.; Hu, J.-L.; Hadzima-Nyarko, M.; Ahmad, F.; Tang, X.-W.; Rahman, Z.U.; Nawaz, A.; Abrar, M. Rockburst hazard prediction in underground projects using two intelligent classification techniques: A comparative study. Symmetry 2021, 13, 632. [Google Scholar] [CrossRef]
Zhu, S.; Lu, H.; Ptak, M.; Dai, J.; Ji, Q. Lake water-level fluctuation forecasting using machine learning models: A systematic review. Environ. Sci. Pollut. Res. 2020, 27, 44807–44819. [Google Scholar] [CrossRef] [PubMed]
Naderpour, H.; Mirrashid, M. Proposed soft computing models for moment capacity prediction of reinforced concrete columns. Soft Comput. 2020, 24, 11715–11729. [Google Scholar] [CrossRef]
Lin, C.-J.; Wu, N.-J. An ANN model for predicting the compressive strength of concrete. Appl. Sci. 2021, 11, 3798. [Google Scholar] [CrossRef]
Ahmad, M.; Hu, J.-L.; Ahmad, F.; Tang, X.-W.; Amjad, M.; Iqbal, M.J.; Asim, M.; Farooq, A. Supervised learning methods for modeling concrete compressive strength prediction at high temperature. Materials 2021, 14, 1983. [Google Scholar] [CrossRef]
Aalimahmoody, N.; Bedon, C.; Hasanzadeh-Inanlou, N.; Hasanzade-Inallu, A.; Nikoo, M. BAT algorithm-based ANN to predict the compressive strength of concrete—A comparative study. Infrastructures 2021, 6, 80. [Google Scholar] [CrossRef]
Sadowski, L.; Piechowka-Mielnik, M.; Widziszowski, T.; Gardynik, A.; Mackiewicz, S. Hybrid ultrasonic-neural prediction of the compressive strength of environmentally friendly concrete screeds with high volume of waste quartz mineral dust. J. Clean. Prod. 2019, 212, 727–740. [Google Scholar] [CrossRef]
Nikoo, M.; Moghadam, F.T.; Sadowski, L. Prediction of concrete compressive strength by evolutionary artificial neural networks. Adv. Mater. Sci. Eng. 2015, 2015, 849126. [Google Scholar] [CrossRef]
Hadzima-Nyarko, M.; Nyarko, E.K.; Ademović, N.; Miličević, I.; Kalman Šipoš, T. Modelling the influence of waste rubber on compressive strength of concrete by artificial neural networks. Materials 2019, 12, 561. [Google Scholar] [CrossRef] [Green Version]
Diaconescu, R.-M.; Barbuta, M.; Harja, M. Prediction of properties of polymer concrete composite with tire rubber using neural networks. Mater. Sci. Eng. B 2013, 178, 1259–1267. [Google Scholar] [CrossRef]
Abdollahzadeh, A.; Masoudnia, R.; Aghababaei, S. Predict strength of rubberized concrete using artificial neural network. WSEAS Trans. Comput. 2011, 2, 31–40. [Google Scholar]
Topcu, I.B.; Sarıdemir, M. Prediction of rubberized concrete properties using artificial neural network and fuzzy logic. Constr. Build. Mater. 2008, 22, 532–540. [Google Scholar] [CrossRef] [Green Version]
Gesoglu, M.; Guneyisi, E.; Ozturan, T.; Ozbay, E. Modeling the mechanical properties of rubberized concretes by neural network and genetic programming. Mater. Struct. 2010, 43, 31–45. [Google Scholar] [CrossRef]
El-Khoja, A.M.N.; Ashour, A.F.; Abdalhmid, J.; Dai, X.; Khan, A. Prediction of rubberised concrete strength by using artificial neural networks (version 10009743). Int. J. Struct. Constr. Eng. 2018, 12. [Google Scholar] [CrossRef]
Hadzima-Nyarko, M.; Nyarko, E.K.; Lu, H.; Zhu, S. Machine learning approaches for estimation of compressive strength of concrete. Eur. Phys. J. Plus 2020, 135, 682. [Google Scholar] [CrossRef]
Gregori, A.; Castoro, C.; Venkiteela, G. Predicting the compressive strength of rubberized concrete using artificial intelligence methods. Sustainability 2021, 13, 7729. [Google Scholar] [CrossRef]
Dat, L.T.M.; Dmitrieva, T.L.; Duong, V.N.; Canh, D.T.N. An artificial intelligence approach for predicting compressive strength of eco-friendly concrete containing waste tire rubber. IOP Conf. Ser. Earth Environ. Sci. 2020, 612, 012029. [Google Scholar] [CrossRef]
Huang, X.; Zhang, J.; Sresakoolchai, J.; Kaewunruen, S. Machine learning aided design and prediction of environmentally friendly rubberised concrete. Sustainability 2021, 13, 1691. [Google Scholar] [CrossRef]
Sun, Y.; Li, G.; Zhang, J.; Qian, D. Prediction of the strength of rubberized concrete by an evolved random forest model. Adv. Civ. Eng. 2019, 2019, 5198583. [Google Scholar] [CrossRef] [Green Version]
Bachir, R.; Mohammed, A.M.S.; Habibm, T. Using artificial neural networks approach to estimate compressive strength for rubberized concrete. Period. Polytech. Civ. Eng. 2018, 62, 858–865. [Google Scholar] [CrossRef] [Green Version]
Cheng, M.-Y.; Hoang, N.-D. A self-adaptive fuzzy inference model based on least squares svm for estimating compressive strength of rubberized concrete. Int. J. Inf. Technol. Decis. Mak. 2016, 15, 603–619. [Google Scholar] [CrossRef]
Zhang, J.; Ma, G.; Huang, Y.; Sun, J.; Aslani, F.; Nener, B. Modelling uniaxial compressive strength of lightweight self-compacting concrete using random forest regression. Constr. Build. Mater. 2019, 210, 713–719. [Google Scholar] [CrossRef]
Ahmadi-Nedushan, B. An optimized instance based learning algorithm for estimation of compressive strength of concrete. Eng. Appl. Artif. Intell. 2012, 25, 1073–1081. [Google Scholar] [CrossRef]
Chopra, R.; Sharma, K.; Kumar, M.; Chopra, T. Comparison of machine learning techniques for the prediction of compressive strength of concrete. Adv. Civ. Eng. 2018, 2018, 5481705. [Google Scholar] [CrossRef] [Green Version]
Beale, M.H.; Hagan, M.T.; Demuth, H.B. Neural Network Toolbox; The Mathworks, Inc.: Natick, MA, USA, 2010. [Google Scholar]
Kovačević, M.; Ivanišević, N.; Dašić, T.; Marković, L. Application of artificial neural networks for hydrological modelling in Karst. Gradjevinar 2018, 70, 1–10. [Google Scholar] [CrossRef] [Green Version]
Kovačević, M.; Ivanišević, N.; Petronijević, P.; Despotović, V. Construction cost estimation of reinforced and prestressed concrete bridges using machine learning. Građevinar 2021, 73, 727. [Google Scholar] [CrossRef]
Hastie, T.; Tibsirani, R.; Friedman, J. The Elements of Statistical Learning; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
Breiman, L.; Friedman, H.; Olsen, R.; Stone, C.J. Classification and Regression Trees; Chapman and Hall/CRC: Wadsworth, OH, USA, 1984. [Google Scholar]
Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef] [Green Version]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Freund, Y.; Schapire, R.E. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 1997, 55, 119–139. [Google Scholar] [CrossRef] [Green Version]
Elith, J.; Leathwick, J.R.; Hastie, T. A working guide to boosted regression trees. J. Anim. Ecol. 2008, 77, 802–813. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Friedman, J.H.; Meulman, J.J. Multiple additive regression trees with application in epidemiology. Stat. Med. 2003, 22, 1365–1381. [Google Scholar] [CrossRef]
Vapnik, V. The Nature of Statistical Learning Theory; Springer: New York, NY, USA, 1995. [Google Scholar]
Kecman, V. Learning and Soft Computing: Support. In Vector Machines, Neural Networks, and Fuzzy Logic Models; MIT Press: Cambridge, MA, USA, 2001. [Google Scholar]
Smola, A.J.; Sholkopf, B. A tutorial on support vector regression. Stat. Comput. 2004, 14, 199–222. [Google Scholar] [CrossRef] [Green Version]
Chang, C.C.; Lin, C.J. LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2011, 2. [Google Scholar] [CrossRef]
LIBSVM—A Library for Support Vector Machines. Available online: https://www.csie.ntu.edu.tw/~cjlin/libsvm/ (accessed on 21 February 2021.).
Rasmussen, C.E.; Williams, C.K. Gaussian Processes for Machine Learning; The MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
Matić, P. Kratkoročno Predviđanje Hidrološkog Dotoka Pomoću Umjetne Neuronske Mreže. Ph.D. Thesis, University of Split, Split, Croatia, 2014. [Google Scholar]
Emiroğlu, M.; Yildiz, S.; Keleştemur, O.; Keleştemur, M.H. Bond performance of rubber particles in the self-compacting concrete. In Proceedings of the 4th International Symposium Bond in Concrete 2012—Bond, Anchorage, Detailing, Brescia, Italy, 17–20 June 2012; pp. 779–785. [Google Scholar]
Yung, W.H.; Yung, L.C.; Hua, L.H. A study of the durability properties of waste tire rubber applied to self-compacting concrete. Constr. Build. Mater. 2013, 41, 665–672. [Google Scholar] [CrossRef]
Li, N.; Long, G.; Zhang, S. Properties of self- compacting concrete incorporating rubber and expanded clay aggregates. Key Eng. Mater. 2014, 629, 417–424. [Google Scholar] [CrossRef]
Khalil, E.; Abd-Elmohsen, M.; Anwar, A.M. Impact resistance of rubberized self-compacting concrete. Water Sci. 2015, 29, 45–53. [Google Scholar] [CrossRef] [Green Version]
Yu, J. Research on the mechanical properties of self-compacting waste rubberised aggregate concrete. In Proceedings of the 2016 International Conference on Civil, Transportation and Environment (ICCTE 2016), Guangzhou, China, 30–31 January 2016. [Google Scholar]
Zaoiai, S.; Makani, A.; Tafraoui, A.; Benmerioul, F. Optimization and mechanical characterization of self-compacting concrete incorporating rubber aggregates. Asia. J. Civ. Eng. (BHRC) 2016, 17, 817–829. [Google Scholar]
Ismail, M.K.; Hassan, A.A.A. Impact resistance and mechanical properties of self-consolidating rubberized concrete reinforced with steel fibers. J. Mater. Civ. Eng. 2017, 29, 04016193. [Google Scholar] [CrossRef]
Turatsinze, A.; Garros, M. On the modulus of elasticity and strain capacity of Self-Compacting Concrete incorporating rubber aggregates. Resour. Conserv. Recycl. 2008, 52, 1209–1215. [Google Scholar] [CrossRef]
Guneyisi, E. Fresh properties of self-compacting rubberized concrete incorporated with fly ash. Mater. Struct. 2010, 43, 1037–1048. [Google Scholar] [CrossRef]
Long, G.; Ma, K.; Li, Z.; Xie, Y. Self-compacting concrete reinforced by waste tyre rubber particle and emulsified asphalt. In Proceedings of the Second International Conference on Sustainable Construction Materials, Design, Performance, and Application, Wuhan, China, 18–22 October 2012. [Google Scholar]
Ganesan, N.; Bharati Raj, J.; Shashikala, A.P. Flexural fatigue behavior of self-compacting rubberized concrete. Constr. Build. Mater. 2013, 44, 7–14. [Google Scholar] [CrossRef]
Ismail, M.K.; De Grazia, M.T.; Hassan, A.A.A. Mechanical properties of self-consolidating rubberized concrete with different supplementary cementing materials. In Proceedings of the International Conference on Transportation and Civil Engineering (ICTCE’15), London, UK, 21–22 March 2015. [Google Scholar] [CrossRef] [Green Version]
Mishra, M.; Panda, K.C. An experimental study on fresh and hardened properties of self compacting rubberized concrete. Indian J. Sci. Technol. 2015, 8, 1–10. [Google Scholar] [CrossRef] [Green Version]
Guneyisi, E.; Gesoglu, M.; Naji, N.; Ipek, S. Evaluation of the rheological behaviour of fresh self-compacting rubberized concrete by using the Herschel-Bulkley and modified Bingham models. Arch. Civ. Mech. Eng. 2016, 16, 9–19. [Google Scholar] [CrossRef]
Padhi, S.; Panda, K.C. Fresh and hardened properties of rubberized concrete using fine rubber and silpozz. Adv. Concr. Constr. 2016, 4, 49–69. [Google Scholar] [CrossRef]
Bideci, A.; Öztürk, H.; Bideci, O.S.; Emiroglu, M. Fracture energy and mechanical characteristics of self-compacting concretes including waste bladder tyre. Constr. Build. Mater. 2017, 149, 669–678. [Google Scholar] [CrossRef]
AbdelAleem, B.H.; Hassan, A.A.A. Development of self-consolidating rubberized concrete incorporating silica fume. Constr. Build. Mater. 2018, 161, 389–397. [Google Scholar] [CrossRef]
Aslani, F.; Ma, G.; Wan, D.L.Y.; Le, V.X.T. Experimental investigation into rubber granules and their effects on the fresh and hardened properties of self-compacting concrete. J. Clean. Prod. 2018, 172, 1835–1847. [Google Scholar] [CrossRef]
Hamza, B.; Belkacem, M.; Said, K.; Walid, Y. Performance of self-compacting rubberized concrete. MATEC Web Conf. 2018, 149, 01070. [Google Scholar] [CrossRef]
Yang, G.; Chen, X.; Guo, S.; Xuan, W. Dynamic mechanical performance of self-compacting concrete containing crumb rubber under high strain rates. KSCE J. Civ. Eng. 2019, 23, 3669–3681. [Google Scholar] [CrossRef]
Bušić, R.; Benšić, M.; Miličević, I.; Strukar, K. Prediction models for the mechanical properties of self-compacting concrete with recycled rubber and silica fume. Materials 2020, 13, 1821. [Google Scholar] [CrossRef] [Green Version]
Hagan, M.T.; Demuth, H.B.; Beale, M.H.; Jesus, O.D. Neural Network Design. Martin, T.H., Ed.; Oklahoma State University: Stillwater, OK, USA, 2014. [Google Scholar]

Figure 1. (a) Multilayer perceptron artificial neural network; (b) an ensemble of neural networks formed by the Bootstrap Aggregating (Bagging) approach [33,34].

Figure 2. Space segmentation into regions and 3D regression surface in regression tree.

Figure 3. Bootstrap aggregation–Bagging in regression tree ensembles [34].

Figure 4. Gradient boosting in regression tree ensembles [34].

Figure 5. Nonlinear SVR with ε-insensitivity zone.

Figure 6. Systematic approach in forming prediction models.

Figure 7. Comparison of performance measures using MLP-ANNs with different configurations: (a) RMSE and MAE, (b) R and MAPE.

Figure 8. (a) RMSE value for each of the iterations and the corresponding architecture; (b) the optimal number of neurons in the hidden layer in each iteration.

Figure 9. Comparison of performance measures using ensembles of MLP-ANNs with different number of base models: (a) RMSE and MAE, (b) R and MAPE.

Figure 10. Prediction of individual neural networks (yellow color), ensemble model prediction (dark blue color) and target values (red color) of compressive strength.

Figure 11. Regression plot for modelled and target values for optimal ensemble model.

Figure 12. MSE vs. number of trees in the ensemble for different minimum leaf sizes using regression tree ensembles realized with bootstrap aggregation (bagging).

Figure 13. Significance of individual variables in the ensemble model when applying the Bagging method.

Figure 14. Influence of the number of variables on which is performed splitting on the accuracy of the model in the RF method.

Figure 15. Dependence of MSE values on learning rate and number of base models in Boosted Trees method.

Figure 16. RMSE vs. hyperparameters C and γ for ε = 2⁻⁶ using SVR with RBF kernel in rough grid search.

Figure 17. Variable selection using ARD Matern 3/2 covariance function.

Figure 18. Modelled and target values for reduced GPR ARD Matern 3/2 model.

Figure 19. Regression plot of modelled and target values of compressive strength for reduced GPR ARD Matern 32 model.

Table 1. Algorithm used in modelling the compressive strength of rubberized concrete and SCRC.

Type of Concrete	Algorithm	Data Points	Authors	Reference
Rubberized concrete	ANN, fuzzy logic (FL)	36	Topçu et al.	[19]
Rubberized concrete	ANN, gene-expression programming (GEP)	70	Gesoglu et al.	[20]
Rubberized concrete	ANN	287	El-Khoja et al.	[21]
Rubberized concrete	ANN, k-nearest neighbor (KNN), regression trees (RT) and random forests (RF)	457	Hadzima-Nyarko et al.	[22]
Rubberized concrete	GPR, SVM	89	Gregori et al.	[23]
Rubberized concrete	ANN	129	Dat et al.	[24]
Rubberized concrete	ANN	353	Huang et al.	[25]
Rubberized concrete	RF	138	Sun et al.	[26]
Rubberized concrete	ANN	122	Bachir et al.	[27]
Rubberized concrete	Self-adaptive fuzzy least squares support vector machines inference model (SFLSIM)	70	Cheng and Hoang	[28]
SCRC	Gaussian process regression (GPR)	144	Hadzima-Nyarko et al.	[2]
SCRC	Beetle antennae search (BAS)-algorithm-based random forest (RF)	131	Zhang et al.	[29]

Table 2. Average, minimum and maximum values of input and output variables used for modelling.

Variable	Average Value	Minimum Value	Maximum Value
Water (kg/m³)	197.15	170.00	246.00
Cement (kg/m³)	402.39	180.00	550.00
Fine nat. aggregate (kg/m³)	764.32	375.20	1192.00
Coarse nat. aggregate (kg/m³)	744.45	364.00	898.00
Fine rubber (kg/m³)	41.33	0	198.73
Coarse rubber (kg/m³)	18.20	0	355.80
Superplasticizer (kg/m³)	4.71	1.06	22.00
Slag (kg/m³)	23.08	0	175.00
Silica fume (kg/m³)	9.97	0	60.00
Fly ash (kg/m³)	74.26	0	330.00
SCRC compr. Strength (MPa)	74.26	0	330.00

Table 3. Statistical analysis of the properties of the database of SCRC specimens.

Year	Author(s)	Ref.	Type of Aggregate Replaced by Rubber	No. of Specimens	SCM ¹	No. of SCM
2008	Turatsinze and Garros	[57]	Coarse	5	-	-
2010	Guneyisi	[58]	Coarse	16	FA ²	12
2012	Emiroglu et al.	[50]	Coarse	3	S ³	3
2012	Long et al.	[59]	Fine	6	SF ⁴ + FA	6
2013	Ganesan et al.	[60]	Fine	3	FA	3
2013	Yung et al.	[51]	Fine	5	S + FA	5
2014	Li et al.	[52]	Fine	7	SF + FA	7
2015	Ismail et al.	[61]	Fine	5	-	-
2015	Khalil et al.	[53]	Fine	5	-	-
2015	Mishra and Panda	[62]	Coarse	5	-	-
2016	Guneyisi et al.	[63]	Fine and Coarse	21	FA	21
2017	Ismail and Hassan	[56]	Fine	16	FA	3
2017	Ismail and Hassan	[56]	Fine	16	S	3
2016	Padhi and Panda	[64]	Fine	4	-	-
2016	Yu	[54]	Fine	6	FA	6
2016	Zaoiai et al.	[55]	Fine and Coarse	5	-	-
2017	Bideci et al.	[65]	Coarse	4	S	4
2018	AbdelAleem and Hassan	[66]	Fine	12	S	1
					FA	1
					SF	10
2018	Aslani et al.	[67]	Fine and Coarse	13	S + FA + SF	13
2018	Hamza et al.	[68]	Fine	4	-	-
2019	Yang et al.	[69]	Fine	4	SF + FA	4
2020	Bušić et al.	[70]	Fine	17	SF	10
-	-	-	Total	166	-	-

¹ Supplementary cementing material; ² FA—Fly Ash; ³ S—Slag; ⁴ SF—Silica Fume.

Table 4. Comparison of the optimal individual neural network and ensemble model.

Model	RMSE	$M A E$	MAPE/100	$R$
NN-10-8-1 *	7.4424	5.5434	0.1768	0.8481
Ensemble	3.6888	2.8099	0.0854	0.9610

* NN-10-8-1 is a model of an artificial neural network that is optimal according to four defined criteria RMSE, MAE, MAPE and R (Figure 7) that has 10 neurons in the input layer, 8 neurons in the hidden layer, and 1 neuron in the output layer.

Table 5. Comparative analysis of results in Bagging, RF and Boosted Trees methods.

Method	RMSE	MAE	MAPE/100	R
TreeBager	8.1890	6.0546	0.1881	0.8214
RF	7.7321	5.7174	0.1785	0.8425
Boosted Trees	7.4821	5.4248	0.1573	0.8432

Table 6. Comparative analysis of results using linear, RBF and sigmoid kernel in SVR method.

Model	RMSE	MAE	MAPE/100	R
Lin. kernel	8.7154	6.6468	0.2105	0.7751
RBF kernel	4.9646	3.5352	0.1171	0.9332
Sig. kernel	8.7104	6.6094	0.2073	0.7718

Table 7. Parameters of GPR ARD model covariance functions.

GP Model Covariance Function	Covariance Function Parameters
Exponential	$k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} e x p [- \frac{1}{2} \frac{r}{σ_{l}^{2}}]$
Exponential	$σ_{l} =$ 51.7642		$σ_{f} =$ 44.6486
Squared Exponential	$k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} e x p [- \frac{1}{2} \frac{{(x_{i} - x_{j})}^{T} (x_{i} - x_{j})}{σ_{l}^{2}}]$
Squared Exponential	$σ_{l} =$ 1.9621		$σ_{f} =$ 21.4682
Matern 3/2	$k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} (1 + \frac{\sqrt{3} r}{σ_{l}}) e x p [- \frac{\sqrt{3} r}{σ_{l}}]$
Matern 3/2	$σ_{l} =$ 4.4183		$σ_{f} = 27.5201$
Matern 5/2	$k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} (1 + \frac{\sqrt{5} r}{σ_{l}} + \frac{5 r^{2}}{3 σ_{l}^{2}}) e x p [- \frac{\sqrt{5} r}{σ_{l}}]$
Matern 5/2	$σ_{l} =$ 2.8760		$σ_{f} = 23.1271$
Rational Quadratic	$k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} {(1 + \frac{r^{2}}{2 a σ_{l}^{2}})}^{- α}$
Rational Quadratic	$σ_{l} =$ 2.8568	$a = 0.3520$		$σ_{f} =$ 28.5379

where

r = \sqrt{{(x_{i} - x_{j})}^{T} (x_{i} - x_{j})}

.

Table 8. Parameters of GPR ARD model covariance functions.

Covariance Function Parameters
$σ_{1}$	$σ_{2}$	$σ_{3}$	$σ_{4}$	$σ_{5}$	$σ_{6}$	$σ_{7}$	$σ_{8}$	$σ_{9}$	$σ_{10}$
ARD Exponential: $k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} \exp (- r)$ ; $σ_{F}$ =61.7884; $r = \sqrt{\sum_{m = 1}^{d} \frac{{(x_{i m} - x_{j m})}^{2}}{σ_{m}^{2}}}$
136.5920	34.7143	149.6719	87.1631	160.3637	137.0799	174.2748	380.2212	84.5739	42.4424
ARD Squared exponential: $k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} e x p [- \frac{1}{2} \sum_{m = 1}^{d} \frac{{(x_{i m} - x_{j m})}^{2}}{σ_{m}^{2}}];$ $σ_{f}$ =24.1382
2.7611	0.8842	4.3944	3.1684	2.3792	2.6952	1.8852	4125.1386	5.6892	0.7528
ARD Matern 3/2: $k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} (1 + \sqrt{3} r) e x p [- \sqrt{3} r];$ $σ_{f}$ =32.6244
6.5869	2.1073	8.8734	5.4483	6.1835	6.9731	7.1678	5445.6855	6.0925	2.0265
ARD Matern 5/2: $k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} (1 + \sqrt{5} r + \frac{5 r^{2}}{3}) e x p [- \sqrt{5} r];$ $σ_{f}$ =26.5499
3.8496	1.2738	6.9524	4.5500	3.5908	3.9290	2.6596	2276.2435	5.8061	1.3211
ARD Rational quadratic: $k ((x_{i}, x_{j} \| Θ)) = σ_{f}^{2} {(1 + \frac{1}{2 α} \sum_{m = 1}^{d} \frac{{(x_{i m} - x_{j m})}^{2}}{σ_{m}^{2}})}^{- α}; α$ =0.3332; $σ_{f}$ =61.7884
3.6879	1.3166	7.3725	4.6652	3.7047	3.9596	2.6138	4383.7159	5.4661	1.4843

where

r = \sqrt{\sum_{m = 1}^{d} \frac{{(x_{i m} - x_{j m})}^{2}}{σ_{m}^{2}}}

.

Table 9. Comparative analysis of results of GPR with various covariance functions.

GP Model Covariance Function	RMSE	MAE	MAPE/100	R
Exponential	5.0574	3.5064	0.1038	0.9316
ARD-Exponential	4.6120	3.1634	0.0947	0.9427
Squared Exponential	5.0447	3.4686	0.1133	0.9300
ARD-Squared Exponential	4.9670	3.4076	0.1101	0.9334
Matern 3/2	4.7244	3.2487	0.1021	0.9386
ARD-Matern 3/2	4.4341	3.1022	0.0958	0.9474
Matern 5/2	4.8275	3.3133	0.1061	0.9360
ARD-Matern 5/2	4.6527	3.2691	0.1037	0.9424
Rational Quadratic	4.6467	3.2022	0.0997	09407
ARD Rational quadratic	4.5937	3.1894	0.1006	0.9435

Table 10. Comparative analysis of GPR models with different sets of input variables.

Model	$x_{1}$	$x_{2}$	$x_{3}$	$x_{4}$	$x_{5}$	$x_{6}$	$x_{7}$	$x_{8}$	$x_{9}$	$x_{10}$	RMSE	MAE	MAPE/100	R
1.	1	1	1	1	1	1	1	0	1	1	4.3934	3.0583	0.0942	0.9482
2.	1	1	1	1	1	1	1	1	1	1	4.4341	3.1022	0.0958	0.9474

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kovačević, M.; Lozančić, S.; Nyarko, E.K.; Hadzima-Nyarko, M. Modeling of Compressive Strength of Self-Compacting Rubberized Concrete Using Machine Learning. Materials 2021, 14, 4346. https://doi.org/10.3390/ma14154346

AMA Style

Kovačević M, Lozančić S, Nyarko EK, Hadzima-Nyarko M. Modeling of Compressive Strength of Self-Compacting Rubberized Concrete Using Machine Learning. Materials. 2021; 14(15):4346. https://doi.org/10.3390/ma14154346

Chicago/Turabian Style

Kovačević, Miljan, Silva Lozančić, Emmanuel Karlo Nyarko, and Marijana Hadzima-Nyarko. 2021. "Modeling of Compressive Strength of Self-Compacting Rubberized Concrete Using Machine Learning" Materials 14, no. 15: 4346. https://doi.org/10.3390/ma14154346

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Modeling of Compressive Strength of Self-Compacting Rubberized Concrete Using Machine Learning

Abstract

1. Introduction

2. Methods

2.1. Multilayered Perceptron Artificial Neural Network (MLP-ANN)

2.2. Regression Tree Ensembles

2.2.1. Bagging

2.2.2. Random Forests

2.2.3. Boosting Trees

2.3. Support Vector Regression (SVR)

2.4. Gaussian Proces Regression

3. Evaluation and Performance Measures

4. Dataset

5. Results and Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI