Artificial Neural Network Model For Strength Prediction of Ultra-High-Performance Concrete
Artificial Neural Network Model For Strength Prediction of Ultra-High-Performance Concrete
Ultra-high-performance concrete (UHPC) is the outcome of the specimens without nano-CaCO3. The results of the partial
mixture of several constituents, leading to a highly complex mate- substitution of silica fume with fine glass powder (FGP),
rial, which makes it more difficult to understand the effect of each with a mean particle size (d50) of 3.8 µm, in UHPC were
component and their interactions on compressive strength. This analyzed.17 They showed that compressive strength of 235
research goal is developing an artificial neural network (ANN)
and 220 MPa could be achieved when using steam curing
approach to predict the compressive strength of UHPC, being able
and replacing 30% and 50% of SF with FGP, respectively.
to incorporate supplementary cementitious materials (SCMs), and
even different situations in relation to the aggregate: from pastes In another study, recycled glass powder was used to replace
to incorporation of coarse aggregate. The one-hidden-layer ANN quartz sand, cement, and quartz powder particles.18,19 RHA
model was trained with 927 data by using the R-code language. was also analyzed as an SCM in UHPC with successful
The data was produced by collecting data from 210 experiments results in strength and durability due to the internal curing
combined with 717 dosages from previous research. The Olden process of the concrete.20 These investigations demonstrated
algorithm was used to analyze the relationships between the the possibility of partial replacement of cement and silica
UHPC’s components and strength. The results indicated that the fume in UHPC by using other SCMs, while maintaining its
ANN is an efficient model for predicting the compressive strength of outstanding mechanical and durability features.
UHPC, regardless of the SCM used or maximum size of aggregate Nevertheless, the key properties of UHPC with several
considered.
SCMs need to be checked experimentally because of the
Keywords: activation function; artificial neural network (ANN); compres- unclear combination effects of different materials and
sive strength; resilient backpropagation algorithm (Rprop); supplementary dosages used in the mixture. Laboratory tests are often
cementitious material (SCM); ultra-high-performance concrete (UHPC). costly, time-consuming, and labor-intensive.3 To reduce the
number of experimental works and mixture designs, proba-
INTRODUCTION bilistic models can be successfully employed to forecast the
Ultra-high-performance concrete (UHPC) is a new compressive strength of concrete. However, these approaches
high-tech type of concrete characterized by its ultra-high cannot be applied when the modeling problem involves too
compressive strength (over 150 MPa), low permeability, many independent variables and the interactions among
and improved durability.1-7 However, due to the absence of them are either unknown or too complex to perform.3,4,21 In
a coarse aggregate, together with its high packing density, the case of the UHPC, due to its large number of compo-
UHPC contents of cement, quartz sand, quartz powder, and nents, the use of traditional techniques of regression fails to
silica fume are relatively higher in relation to current concrete, achieve the expected precision and predictability.21
increasing the cost of UHPC and resulting in a significant Developments in artificial neural networks (ANNs) have
environmental impact.8-10 Hence, considering those disad- resulted in an increasingly wider use of those methods for
vantages that restrict its wider use, some industrial by-prod- modeling real-world problems because of its immense
ucts such as fly ash (FA), ground-granulated blast-furnace ability to capture inter-relationships among input and output
slag (GBFS), recycled glass powder (GP), rice husk ash data pairs that are unknown, nonlinear, or too difficult to
(RHA), and fluid catalytic cracking residue (FCC), as well formulate, demonstrating its effectiveness in solving diffi-
as other supplementary cementitious materials (SCMs) and cult and complex engineering problems.22 Among others,
mineral fillers such limestone powder (LP) and metakaolin ANN models have been employed in applications such as
(MK), have been used as replacements for cement (partial), water resources engineering, traffic engineering, detection
quartz powder (total), and silica fume (partial/total).11,12 of structural damage, structural system identification, mate-
Among others, studies of UHPC containing FA, SF, and rial behavior modeling, concrete mixture proportioning, and
FCC—a by-product of the crude oil industry—were carried concrete strength prediction.23,24 Furthermore, one of the
out.13,14 Their conclusion showed the possibility of a partial most fertile fields in the scientific literature of civil engi-
replacement of cement and silica fume. Blending limestone neering production is the application of ANN to predict
powder in a UHPC binder improved the hydration process
at the early age, producing denser particle packing and ACI Materials Journal, V. 118, No. 4, July 2021.
MS No. M-2019-494.R2, doi: 10.14359/51732710, received December 15, 2020,
improving mechanical properties.15 The use of nano-CaCO3 and reviewed under Institute publication policies. Copyright © 2021, American
Concrete Institute. All rights reserved, including the making of copies unless
as a component of binder was also studied.16 Compressive permission is obtained from the copyright proprietors. Pertinent discussion including
strength increased 17% compared to the UHPC control author’s closure, if any, will be published ten months from this journal’s date if the
discussion is received within four months of the paper’s print publication.
x − xmin
xnorm − sigmoid = (1)
xmax − xmin
x − xmin
xnorm − tanh = 2 −1 (2)
xmax − xmin
where xnorm is the normalized value of the variable x; and xmax
and xmin are the minimum and maximum values of variable
x, respectively.
Table 2 shows the range of variation of all variables
Fig. 2—Histogram of 28-day compressive strength reached considered in this research.
by dosages in database.
Fig. 3—Pie chart from data: (a) number of components of binder; (b) ASTM cement types reported; (c) maximum size of aggre-
gate; and (d) type of coarse aggregate, when used.
EXPERIMENTAL INVESTIGATION ball mill for 90 minutes. After treatment, RHA had a d50 of
Cube specimens with 50 mm sides (1.968 in.) of 210 7.5 μm (0.0030 in.) and a specific gravity of 2.16. FCC, a
separate UHPC dosages were used to measure the 28-day by-product of the crude industry, with a specific gravity of
compressive strength following ASTM C109.43 2.76 and d50 of 9.5 μm (0.0037 in.), was also used in some
experimental works. GP and GF were obtained by grounding
Materials locally available recycled glass with a jet mill to different
The materials used to elaborate the UHPC were locally degrees of fineness, by applying different grinding speeds.
available in Colombia. ASTM Type III HE cement was used. The GF with d50 of 28 μm (0.0011 in.) and GP with d50 of
The HE cement had a specific gravity of 3.15 and a mean 7 μm (0.0003 in.), both with a specific density of 2.55, were
particle diameter (d50) of 8 μm (0.0003 in.). The SF used taken into account. Commercially available MK employed
in the mixture proportioning complied with ASTM C1240 in some of the dosages had a specific gravity of 2.66 and d50
specifications and had a specific gravity of 2.20 and d50 of of 10 μm (0.0004 in.). Finally, two sizes of commercialized
0.15 μm (0.0006 in.). The UHPC was also manufactured limestone powder with a specific gravity of 2.73 were used.
with a silica sand of a specific gravity of 2.65, maximum LP with d50 of 15 μm (0.0006 in.) and MLP with d50 of 2 μm
particle size (dmax) of 600 μm (0.0240 in.), and d50 of (0.0001 in.) were considered. Table 3 shows the chemical
165 μm (0.0065 in.). Besides cement, silica fume, and silica composition of the materials used in this study. Figure 5
sand, the following SCMs were used as components of the provides the particle size distribution (PSD) of the cement,
binder: locally available FA, GBFS, GP, recycled glass flour SF, FA, RHA, GBFS, GP, GF, FCC, MK, LP, MLP, and SS.
(GF), RHA, FCC, MK, LP, and micro limestone powder A polycarboxylate (PCE)-based HRWRA with a specific
(MLP). Locally available FA with a specific gravity of 2.32 gravity of 1.07 and solid content of 40% was used.
and d50 of 30 μm (0.0012 in.) was considered as part of The supplementary cementitious materials used were
the binder for some mixtures. The GBFS considered in the analyzed by scanning electron microscopy (SEM). Some
experimental campaign had a specific gravity of 2.95 and of these analyses are depicted in Fig. 6. Results signaled
d50 of 2.4 μm (0.0009 in.). RHA used in this research was the lack of porosity of the two products related to recycled
produced by combustion with a peak temperature of approx- glass, the high porosity of the RHA particle, and the spher-
imately 700°C. The ash obtained was ground in a vibrating ical shape of the FA and SF particles, besides the small size
of the latter.
Fig. 5—Particle size distribution of cement, sand, and supplementary cementitious materials.
Specimens Items of investigation
The experimentally run dosages correspond to 15 groups Once the dosage was designed, a 5 L mortar mixer
of component combinations. For each of these combinations, was used to produce the concrete. After the mixing was
a three-factor design of experiment (DoE) with 14 runs each completed, the UHPC was cast in molds and compacted on
was proposed. The first factor corresponded to the amount of a vibrating table. The prisms were demolded approximately
cement, the second to the amount of water, and the third to 24 hours after casting and then cured in a moisture room at
the amount of HRWRA. The other components were ruled 20°C until the day of the test, without any heat treatment
by the A&Amod curve44 according to Eq. (3) applied. For the determination of the compressive strength,
50 mm (1.968 in.) cubes were tested. A compression testing
P ( D) =
(D q q
− � Dmin ) � (3)
machine with a capacity of 3000 kN was used, following
ASTM C109.43 Three samples were tested for determining
(D q
max −D q
min ) the compressive strength at 28 days.
where D is the particle size; P(D) is the weight fraction of
total solids that are smaller than D; Dmax and Dmin are the ANALYTICAL INVESTIGATION
maximum and minimum particle sizes, respectively; and q is Artificial neural networks
the Fuller exponent. The optimal value of q was determined ANN models are composed of many highly intercon-
in previous research.2 nected processing elements (neurons) working together.23
weights and receives an input signal from neurons linked v j = ∑wij xi + b (4)
i =1
to it. The successive layers of nodes receive input from the
previous layers; the outputs of nodes in each layer are inputs where vj is the net input of the j-th neuron for the signals
to nodes in the next layer. The simplest form of ANN archi- received from the preceding layer with n neurons; wij is the
tecture is the perceptron, devised by Rosenblatt45 in 1958, weight between the j-th neuron and the i-th neuron in the
which consists of one neuron with two inputs and one output. preceding layer; xi is the output of the i-th neuron in the
The schematic diagram of the artificial neuron model is preceding layer; and b is a constant also known as bias.47
composed of some interconnected sets of artificial neurons, The activation function processes the input obtained from
which are closely linked to each other the by connection the sum function and determines the neuron output. Some
weights and obtained input signs from neurons that linked to activation functions examples include step, linear, ramp,
it, as illustrated in Fig. 7, where the vector xi = (x1, x2,...,xn) tanh (hyperbolic tangent), sigmoid functions, and relu (recti-
represents an input signal applied to the neuron. Besides, a fied linear unit function).48 In this work, the performance of
bias is also added to the neuron along with inputs. On the hyperbolic tangent and sigmoid functions as activation func-
other hand, the vector wi = (w1, w2,...,wn) represents the tions were evaluated. Hyperbolic tangent function maps the
weights for each input. The weighted sum function calcu- inputs between –1 and +1,21 while the sigmoid function takes
lates the net input (vj) that approaches to a neuron,46 as the input, which can have any value between minus and plus
depicted in Eq. (4) infinite, and squashes the output in the range 0 to 1.49 The latter
has an implication that the way in which the normalization
100 n ai − â i
MAPE = ∑ (5)
n i =1 â i Fig. 10—Proposed ANN architecture.
wherein the prediction is the mean of all samples.61 R2 statis-
∑ i =1 ( ai − â i )
n
RMSE = (6) tics are dependent on the linear relationships between the
n observed and predicted values and may sometimes give
biased results when this relationship is not linear or when the
values contain many outliers. A R2 value close to unity indi-
∑ i =1 ( ai − â i )
1 n
cates perfect association between the observed and predicted
NMBE ( %) = n × 100 (7) values. A combined use of the performance metrics narrated
ai
previously can provide an unbiased estimate for prediction
ability of the neural network models.
∑ i =1 ( ai − â i )
n 2
ANN architecture
where a is the target or experimental value; ā represents the Selecting the model architecture is the first important step
mean observed target; â is the predicted value or model’s in developing an ANN model that best suits the problem at
output; and n is the total number of observations in the hand. Figure 9 depicts the effect of neuron number in hidden
current data. layer and activation function on the performance of different
MAPE is a normalized statistic that provides an effective architectures during the training using k-fold validation. It is
way of comparing the residual error for each data point with important to highlight that Fig. 9 only shows the best perfor-
respect to the observed or target value.21 A lower value of mance initial weight allocation of the neurons for each archi-
MAPE indicates good prediction performance of the model; tecture and activation function combination. According to
RMSE is one of the commonly used error-index statistics.59 this plot, k-fold validation RMSE stopped improving signifi-
RMSE compares the observed values to the predicted values cantly after four neurons in the hidden layer when dealing
and computes the square root of the average residual error, with the hyperbolic tangent activation function, reaching a
indicating error in the units (or squared units) of the constit- minimum RMSE value of 10.82 MPa. On the other hand, the
uent of interest, which aids in analysis of the results. RMSE hidden layer with seven neurons gave the minimum k-fold
values of zero indicate a perfect fit. However, RMSE gives validation RMSE value when using the sigmoid activation
more weightage to large errors21; the NMBE provides infor- function, reaching a value of 10.69 MPa. Consequently,
mation on the mean bias in the predictions from a model. the network architecture shown in Fig. 10, which uses the
A positive NMBE indicates under-prediction and a negative sigmoid activation function, was selected.
NMBE indicates over-prediction of the model.60 The coef- The weights that define the ANN model are presented in
ficient of determination (R2) compares the accuracy of the Table 4. It is important to note that to use the network in
model with the accuracy of a superficial benchmark model future research, it is necessary to normalize the data entering
the network according to Eq. (1) by using the range of values be the fact that these researchers worked only with their
described in Table 2. own experimental works. The high amount of data used in
this research, together with the use of k-fold validation, led
Performance evaluation of trained model to a lower R2 value of 0.840. Nevertheless, the proposed
The results in terms of the performance statistics of the model allows one to predict the compressive strength of a
selected model are presented in Table 5. It is observed in wider spectrum of UHPC: form pastes to coarse aggregate
Table 5 that performance evaluation indicators have a concrete, and consider combinations of several SCMs.
similar value in train and test set. The latter indicates the Furthermore, Fig. 11 clearly exhibits the strong coherence
proper carrying out of the k-fold validation used as a proce- between the predicted compressive strength values and the
dure to avoid the overfitting of the ANN model48 during the real ones.
training. Besides, the proposed ANN model predicted the Some of the factors that introduced statistical noise into
28-day compressive strength on the test data set with RMSE, the system, preventing a more precise model, could be:
MAE, and AE values of 9.106, 8.054, and 3.977 MPa, 1) the use of different types of cement; 2) in the data obtained
respectively—that is, the differences between predicted and from scientific literature, the lack of knowledge of whether
experimental compressive strength values were relatively the specimens have been vibrated or not after pouring into
small for UHPC. E and RSR values of 0.808 and 0.438 also the molds; 3) the different nature of aggregates used in
indicated the accuracy of the proposed model. MAPE shows the database; and 4) the different technology of HRWRAs
that the predicted compressive strength deviated on average considered in the database.
by 5.854% from target. NMBE statistics of 2.835% pointed
out a little under-prediction of the model, which also could Connection Weight Approach (CWA)
be observed in Fig. 11. For the other index values (MAE), ANNs have been usually known as “black box” because
on the other hand, it can be noticed that the coefficients of they are believed to present little explanatory insight into the
multiple determination presented (R2) were less than those contributions of the input signals in the prediction process.
obtained by other authors consulted, who obtained correla- However, Olden et al.62 presented the Connection Weight
tion coefficients between 0.96 and 0.98 when predicting Approach (CWA) to assess the importance of the variables
UHPC’s properties using ANN.3,4 One of the reasons could of each model. This method estimates importance as the
CONCLUSIONS
This study provides an accurate ANN model for predic-
tion of the 28-day compressive strength of UHPC using
different combinations of supplementary cementitious
materials and fillers, such as silica fume, fly ash, ground-
granulated blast-furnace slag, glass powder, rice husk ash,
fluid catalytic cracking residue, metakaolin, limestone
Fig. 11—Regression plot for UHPC pastes.