Birhane Gemeda, Shear Capacity Prediction
Birhane Gemeda, Shear Capacity Prediction
Birhane Gemeda, Shear Capacity Prediction
A Thesis
Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science
i
ii
UNDERTAKING
I certify that research work titled “Application of Machine learning methods for shear
capacity of RC beams” is my own work. The work has not been presented elsewhere for
assessment. Where material has been used from other sources it has been properly
acknowledged / referred.
Signature of Student
iii
ABSTRACT
Accurate determination of the capacity of reinforced concrete (RC) beams in shear remains
a demanding problem due to its complex failure mechanism and the nonlinear relationship
between different factors influencing the shear capacity. This research employs different
types of single and ensemble machine learning (ML) based techniques; namely, decision
tree, support vector machine, extremely randomized trees, gradient boosting, random
forest, and extreme gradient boosting (xgBoost) to correctly predict the shear capacity of
reinforced concrete beams. To this end, a dataset of experimental test results of RC beam
with and without stirrups comprised of various beam geometry, concrete strength,
reinforcing steel strength, longitudinal and shear reinforcement ratios, and shear span-to-
effective depth ratio is used to develop the models.
The proposed models were calibrated for different values of hyperparameters to achieve
optimized ML models. The results of the analysis evidenced that the xgBoost model can
be effectively utilized to predict the shear capacity of RC beams. The comparison of the
predictions of the proposed and existing models evidenced that the efficiency of the
proposed model is superior to the existing models and guidelines in terms of accuracy,
safety, and economic aspects with significantly lowest bias and variability.
A solid correlation exists between the shear capacities predicted using the proposed model
and the corresponding experimental values as evidenced by the value of 𝑅 2 (𝑅 2 = 0.99)
for RC beams without stirrup and (𝑅 2 = 0.995) for RC beams with stirrup.
Keywords
Shear Capacity Prediction; RC Beams; Machine Learning
iv
ACKNOWLEDGMENTS
First and foremost, praises and thanks to the Almighty God, for His blessings throughout
my research work to complete the research successfully.
I would like to express my deep sincere gratitude to my thesis advisor Dr. Bedilu Habte
for his guidance and support
I am extremely grateful to my brother Dr. Tadesse Gemeda for his support and guidance.
He has taught me the methodology to carry out the research and to present the research
works as clearly as possible.
I am very much thankful to my mother for her love, prayers, caring and sacrifices for
educating me. Also I express my thanks to my husband for his support to complete this
research work.
v
TABLE OF CONTENTS
ABSTRACT.................................................................................................................... IV
ACKNOWLEDGMENTS ............................................................................................... V
2.2.3 Depth........................................................................................................... 13
vi
CHAPTER 3 EXPERIMENTAL DATABASE...................................................... 24
vii
CHAPTER 6 CONCLUSIONS AND RECOMMENDATIONS .......................... 85
viii
LIST OF TABLES
ix
LIST OF FIGURES
Figure 2.1: Internal forces along diagonal shear crack [43]. ............................................ 10
Figure 2.2: Components of shear resistance at cracked concreate section for RC beams
without stirrup [46]. .......................................................................................................... 12
Figure 2.3: Concrete and stirrups shear resisting mechanisms (a) and graphical illustration
of their contribution (b) [59]. ............................................................................................ 18
Figure 2.4: MCFT equations [46] ..................................................................................... 22
Figure 3.1: Geometry of RC beams in the database. ....................................................... 26
Figure 3.2: Schematic of the loading types ...................................................................... 26
Figure 3.3: Distribution of RC beams in terms of the loading type. ................................ 27
Figure 3.4: Distribution of the input variables in the database. ........................................ 31
Figure 3.5: Relationship between the input parameters and the shear capacity. .............. 35
Figure 3.6: Beam type (slender beams/deep beams) ........................................................ 37
Figure 3.7: Distribution of the input variables in the database. ........................................ 41
Figure 3.8: Relationship between the input parameters and the shear capacity of RC beams
with stirrups ...................................................................................................................... 44
Figure 4.1: 10-fold cross-validation. ................................................................................ 49
Figure 4.2: model development ........................................................................................ 50
Figure 4.3: Decision tree flow chart. ................................................................................ 52
Figure 4.4: Conceptual schematic of random forest. ........................................................ 54
Figure 4.5: Conceptual schematic of gradient boosting. .................................................. 56
Figure 5.1: Comparisons of shear capacity predictions provided by the single ML models
to the experimental shear capacity for RC beams without stirrup. ................................... 61
Figure 5.2: Comparisons of shear capacity predictions provided by the ensemble ML
models to the experimental shear capacity for RC beams without stirrup. ...................... 63
Figure 5.3: Residual plots for RC beams without stirrup. ................................................ 66
Figure 5.4: Predictions of ACI and proposed model for RC beams without stirrup. ....... 68
Figure 5.5: Histogram of predicted to experimental shear capacity ratio for RC beams
without stirrup. .................................................................................................................. 69
Figure 5.6 : Experimental versus predicted shear capacities for RC beams without stirrup
based on the proposed xgBoost model and existing code equations. ............................... 70
Figure 5.7: Properties of dataset for RC beams with stirrups. .......................................... 73
x
Figure 5.8: Comparisons of shear capacity predictions provided by single ML models to
the experimental shear capacity for RC beam with stirrup............................................... 75
Figure 5.9: Comparisons of shear capacity predictions provided by ensemble ML models
to the experimental shear capacity for RC beam with stirrup. ......................................... 77
Figure 5.10: Predictions of ACI, Eurocode and proposed model for RC beams with stirrup.
.......................................................................................................................................... 80
Figure 5.11: Histogram of predicted to experimental shear capacity for RC beams with
stirrup ................................................................................................................................ 81
Figure 5.12 : Experimental versus predicted shear capacities for RC beams with stirrup
based on the proposed xgBoost model and existing code equations. ............................... 82
xi
Application of Machine learning methods for shear capacity of RC beams
CHAPTER 1 INTRODUCTION
1.1 Background
Shear failure is sudden and often catastrophic as it occurs without warning. On the
contrary, flexural failure involves the yielding of reinforcement bars which is ductile as it
provides enough warning. Thus, reinforced concrete (RC) beams should be designed with
adequate shear capacity in order to avoid such catastrophic and sudden shear failure and
assure ductile flexural failure. It is crucial to understand the shear behavior of RC beams
for a safe, economical, accurate design of the beams. Despite the various research efforts,
fully assessing the shear behavior of RC beams remains a challenging task when compared
to assessing its flexural behavior that can be predicted with an acceptable accuracy [1,2].
beams. Most design codes and guidelines including ACI 318 [3] and Eurocode 2 [4] for
the shear strength of RC elements are empirically derived using a limited number of
important factors. Hence, most of the available models results in unsafe and/or
uneconomical design. As a result, a more rigorous prediction model is required for better
Machine learning (ML) has recently gained immense attention owing to its ability to
effectively determine the relationship between the input features and the response
solve different civil engineering problems [5–15]. This is attributed to their ability to
estimate the relationship between the factors and the response parameter (s) [16]. In
contrast to the case of most empirical models, ML techniques does not require the prior
assumption or knowledge of the underlying mathematical and physical models [16]. Some
columns and walls [11,17,18], and seismic damage assessment of RC buildings and
bridges [19–21]. Artificial neural network (ANN) is among the most popular ML
techniques [9]. It has been successfully employed for different structural engineering
studies; some of which include the prediction of mechanical properties of concrete [6,8–
10], damage assessment of bridges [22–24] and buildings [25], ultimate deformation
capacity of RC columns [11], shear strength of circular RC columns [26], and compressive
strength [13–15] and the stress-strain relationship [12] of concrete confined with FRP. A
review of the application of different artificial intelligent techniques in the field of civil
technique that has been applied to structural engineering problems [27–29]. Solhmirzaei
et al. [29] used three different ML techniques, namely, SVM, k-nearest neighbor (kN), and
ANN to classify the failure modes of Ultra High Performance Concrete (UHPC) beams.
The results showed that ANN outperformed the other two methods in predicting the failure
modes of UHPC beams with an overall accuracy of 89%. In this study, several ML
techniques have been explored to propose the best predictive model for shear capacity of
beams. Ensemble models are another type of emerging ML technique with high
learners known as base learners (e.g. kernel regression, support vector machine, and
decision tree) to enhance the efficiency, robustness, and stability of the base learners [30].
The basic idea behind the ensemble model is to combine multiple base learners in
computing the final response rather than relying on an individual model. Ensemble
MSc Thesis Page 2
Application of Machine learning methods for shear capacity of RC beams
learners can be formed in sequential or parallel styles with the aim to exploit the
dependence and independence between the base models, respectively. Bagging (short for
Bootstrap Aggregating) [31] is one of the ensemble models in which multiple base learners
are independently trained in parallel using different bootstrap sample. Each bootstrap
training dataset contains an average of 63.2% of the original training set. The final
prediction is then taken as the mean of the predictions from the base learners [31]. As a
result a better prediction with reduced variance is obtained [31]. Random forest (RF) is a
particularly, decision trees are used. An ensemble model can also be formed from multiple
base learners of different types or heterogeneous base learners [30]. Stacking ensemble is
one of the heterogeneous ensemble models in which different types of base learners are
combined via a meta-model with the objective of enhancing the accuracy of the individual
model [32,33].
In the previous study, different researchers have attempted to examine the efficacy of ML
models to estimate the capacity of RC beams in shear [34–39]. Sanad and Saka [34] studied
the application of ANN in predicting the capacity of deep RC beams in shear using the test
results of only 111 RC beams considering 10 input variables. The results of the ANN-
based model were compared against that of the ACI equation. The authors concluded that
the shear capacity predictions from the ACI code equations are very conservative and thus
uneconomical. The proposed ANN model outperformed the ACI equation in predicting
the capacity of the beams in shear. However, it is worth mentioning here that a small
number of datasets used to train ML algorithms will lead to overfitting and a false sense
of high predictive performance. As per Friedman et al. [40], a sufficient training and testing
dataset should be at least 10 times the number of independent parameters. Chou et al. used
support vector regression (SVR) for the prediction of the shear strength of RC deep beams
Cladera and Mari [35] investigated the application of ANN to predict the shear capacity
of RC beams without stirrups based on the dataset of 177 experiments on RC beams. Five
parameters; namely, effective beam depth, longitudinal reinforcement ratio, width of the
beam, concrete compressive strength, and aspect ratio were considered as input
parameters. The authors concluded that ANN can be applied to predict the shear capacity
of RC beams without stirrups. In another study, the same authors applied ANN to predict
the shear capacity of RC beams reinforced with stirrups based on the test results of 123
Chou et al. [38] used different ML algorithms to estimate the shear strength of RC beams
using the experimental dataset of RC beams compiled by Collins et al. [41] and Zhang et
al. [42]. For RC beams reinforced without stirrups, the dataset comprises 1849 RC beam
test results. However, the database used for RC beams with stirrups comprised only 194
test results of RC beams. In this study, different single models including multi-layer
perceptron ANN, decision tree, support vector regression, linear regression, ensemble
beams in shear, all the above studies generally failed to propose a user-friendly and
practical shear design approach based on the results of the trained ML models. As it is well
by others. Thus, practical implementation and simplification of the ML-based models are
essential. Moreover, most of the models are developed using a limited number of
In lieu of the above-mentioned limitations, the current study investigated the application
of several ML models (both single and ensemble models) to propose the best predictive
model based on a large database of slender and deep RC beams with/without stirrups.
Moreover, the best model among the developed models is deployed into a user-friendly
web-based application, which can be used by practitioners and researchers in the field of
developed web-based application facilitates a rapid and accurate prediction of the capacity
of RC beams in shear.
This is a pioneer research work leveraging the power of different machine learning models
to produce accurate and robust model for shear design of reinforced concrete (RC) beams.
statistical and empirical models; however, the accuracy of machine learning models can
drastically vary from one type of problem to other. One of the reasons for the variation in
the accuracy of the ML models with the problem type is attributed to the complex
relationship of input and output. In this context, among different ML models, ensemble
learners combine different base learners to produce a powerful model. Thus, this research
is aimed to produce a robust ensemble machine learning model to accurately estimate the
capacity of RC beams in shear considering key design factors that influence their shear
capacity. Moreover, a web-based application is developed using the best ML model. The
the field of Civil Engineering to accurately determine the capacity of RC beams in shear.
Develop a novel ML-based model for estimating the shear capacity of RC beams
with/without stirrups.
Compare the prediction accuracy of the proposed ensemble model with other
Identify the main factors that influence the shear capacity of RC beams based on a
and select the best predictive model using different statistical performance indices.
Compare the predictive capability of the proposed model with that of the existing
models.
practitioners and researchers in the field of civil engineering without the need of
1.3 Scope
The scope of this thesis is to propose novel machine learning-based shear design models
for RC beams with/without stirrups. Various ML-based are developed in order to propose
the best shear capacity predictive models. Finally, the predictive performance of the
There are different approaches and guidelines to determine the shear capacity of RC
beams. Numerical, analytical, and mechanical models are widely-used models. All these
models are based on assumptions hence fail short of producing accurate predictions.
Besides, there are large discrepancies and uncertainty in the existing shear design models.
the shear capacity of RC beams. Furthermore, the existing empirical models are generally
based on a simple truss analogy without considering the interaction between different
factors and the variation of the shear crack angle. In contrast, a machine learning model
learns the relationship between the input features and the response variable from data
models. In nature, machine learning models work better on large data to solve
accurate and reliable shear design model using different ML techniques considering all
collected. The collected data comprised various beam geometries, material strengths for
both concrete and steel, longitudinal and transverse reinforcement ratios, and shear span-
to-effective depth ratios. The performance of the ML model is greatly influenced by the
value of its hyperparameters, which are parameters that control the learning process of a
given ML model. The values of the hyperparameters for each ML model are tuned to
obtain the best model. Finally, the prediction performance of the proposed ML models is
compared with that of the existing models and code equations. A comparative
investigation of the proposed and existing models revealed the superior prediction
capability of the proposed model in terms of accuracy, safety, and economical aspects.
Hence, this research is significant in assessing the best model in determining the shear
Chapter 1 presents the background information of the study, objective, significance, scope,
and methodology of the study. Chapter 2 presents a literature review on the shear strength
of RC beam focused on parameters influencing the shear strength of RC beam and existing
shear design models and code equations. Chapter 3 presents the experimental database
used in this study. Different machine learning techniques are introduced in Chapter 4.
Moreover, Chapter 4 discusses different single and ensemble ML models. Six ML models
(two single models and four ensemble models) are analyzed in this chapter. Chapter 5
describes the results from the ML model and comparative study with existing models and
code equations. The performance of each model is evaluated using different performance
indices. Chapter 6 discuss the conclusion from this study and recommendations for future
research.
1.6 Methodology
and slender RC beams with/without stirrups critical in shear will be compiled form
the literature.
ML models. The input variables that affect the capacity of RC beams will be
preprocessing of the collected database, which includes but is not limited to the
given ML model. The hyperparameters for each model will be tuned carefully to
achieve superior prediction accuracy. In order to select the best predictive model,
Model training, validation, and testing: the database will be randomly split into test
and train datasets. The training database will be used for the model training and
development, while the testing dataset will be used to finally appraise the
Comparative study: the efficacy of the proposed models will be finally compared
with that of the existing models and code equations including ACI 318 and
Eurocode.
The formation of diagonal tension crack occurring at an angle with respect to the beam
axis causes the shear failures. Shear failure is sudden and occurs without prior warning,
thus it is a brittle failure, unlike flexural failure. Hence, it is vital to make sure that the
shear supply is greater than the shear demand in order to avoid such undesirable shear
failure. The primary goal of shear design is to prevent such failure and ensure flexural
failure. However, shear failure involves a complex failure mechanism, and as such difficult
to develop simple models like that used to characterize flexural behavior, where the
assumption of plane sections remain plane is valid. Several studies have been conducted
to propose shear strength models based on the principles of mechanics. Shear behavior is
the sum of various mechanisms, thus difficult to establish a mechanics-based shear design
model. The free-body diagram in Figure 2.1 shows the internal forces in RC beam after
In the above figure, C denotes uncracked compressive force in uncracked concrete, 𝑉𝑐𝑧 is
the shear in the compression zone, 𝑉𝑠 is the shear capacity provided by transverse
reinforcement, 𝑉𝑖 is the shear capacity provided by the aggregate interlock, 𝑇 is the tension
reinforcement bars.
reinforcement (stirrups), aggregate interlock, uncracked concrete, and dowel action of the
flexural bars [44,45]. For instance, Figure 2.2 illustrates the relative contributions of
As can be seen in this figure, concrete aggregate interlock provides the highest shear
contribution in the compression zone. Various factors influence the shear capacity of RC
beams. The following section briefly discusses the influence of different key variables on
Figure 2.2: Components of shear resistance at cracked concreate section for RC beams
without stirrup [46].
strength, yield strength and reinforcement ratio of stirrups, cross-sectional depth of the
beam, 𝑎/𝑑 ratio, and longitudinal reinforcement ratio, as will be discussed below.
formulations for the shear strength provided by concrete. For instance, according to ACI
318-14 [47], the concrete contribution to shear capacity has been assumed to be 2√𝑓𝑐, 𝑏𝑑.
However, in Eurocode 2 [4], the shear strength is correlated with the cubic root of 𝑓𝑐, , 𝑉 ∝
3 ,
√𝑓𝑐 . Shear forces are transmitted through concrete cracks by virtue of roughness of the
concrete interfaces.
shear strength of RC beams which in turn increases their ductility. In addition to providing
additional shear strength, stirrups are used to hold the main reinforcements and link
together the flexural compression and tension sides of a member and ensure that the two
sides act as a unit. Breakdown of those linkages may cause shear failure. Besides, shear
reinforcement limits the development of inclined cracks and prevents the cracks from
widening. The strain in shear reinforcement prior to the onset of crack is equal to that in
concrete. However, with the formation of the inclined cracks shear reinforcement
contributes substantially to shear resistance as diagonal tension cracks form at very small
strains. The contribution of stirrups to the capacity of RC beams in shear depends on the
2.2.3 Depth
Increasing the effective depth of RC beams intuitively increases the shear capacity of the
beams. In most of the models and code equations, the shear capacity of RC beams is
directly proportional to the effective depth of the beam. The average shear strength at the
The longitudinal reinforcement provides shear resistance through the dowel action. In
reinforcement controls the propagation of flexural cracks and leads to an increase in the
neutral axis depth, which in turn increases the depth of uncracked concrete in compression.
The increase in the depth of uncracked concrete in compression, in turn, enhances the shear
capacity of the beam. Thus, an increase in the longitudinal reinforcement ratio enhances
the shear strength of the beam owing to an increase in dowel action [48–50]. However,
there is a discrepancy in international codes and standards for the shear design of RC
beams in accounting for the shear strength provided by longitudinal reinforcement. For
instance, in Eurocode2 [4], the shear strength of RC beams has a direct relation with the
cubic root of the longitudinal reinforcement ratio (𝜌𝑠𝑥 ). However, this effect has not been
The shear span-to-depth (𝑎/𝑑) ratio is another important factor that influences the shear
mechanisms. The concrete contribution to the shear capacity decreases as 𝑎/𝑑 ratio
increases [51]. However, the shear strength provided by transverse reinforcement increases
with an increase in 𝑎/𝑑 ratio [51]. In addition, shear resistance provided by first diagonal
crack is significantly larger than shear resistance provided by concrete when the shear
span-to-depth ratio is large but much smaller than shear resistance provided by concrete
when 𝑎/𝑑 ratio is small. Generally, an increase in 𝑎/𝑑 ratio cause reduction of ultimate
There exist different standards and models for shear capacity determination of RC beams.
Collins et al. [52] provided a review on the shear models proposed in the early 20th century
(1948 to 2008). In a traditional truss analogy of Ritter and Morsch, the shear stress was
assumed to be resisted by stirrups and diagonal compressive concrete struts inclined at 45°
to the beam height [53]. According to this model, the ultimate shear capacity of the beam
corresponds to the yielding of the stirrups, while neglecting the tensile stress in a cracked
concrete and thus gives a conservative prediction [54]. Moreover, the model ignores the
shear resisted by uncracked concrete section, aggregate interlock along the cracks, and
dowel action of the longitudinal reinforcement [53]. The truss model has provided a basis
for different codes of practice including ACI 318-14 [55] and Eurocode-2 (2004) [4]
standards. ACI 318-14 standard assumes a constant strut angle of 45° while considering
the contribution of the concrete in the tension. On the other hand, Eurocode-2 (2004) is
based on a variable angle truss model but ignores concrete contribution to the shear
strength for beams reinforced with ITSR. The most widely used models and code equations
200
𝑘 =1+√ ≤ 2.0 (𝑑 in mm)
𝑑
𝜌𝑠𝑥 ≤ 0.02
0.18
𝐶𝑅𝑑,𝑐 = , 𝛾𝑐 = 1.5
𝛾𝑐
AASHTO-
𝑉𝑛 = 𝛽𝜆√𝑓𝑐 ′ + 𝜌𝑠𝑣 𝑓𝑠𝑣 𝑏𝑤 𝑑 cot 𝜃 (𝑓𝑐 ′ is in MPa)
LRFD [56]
where,
𝛽
0.4
, for section with minimum amount of ISR
1 + 750𝜀𝑥
=
0.4 1300
, esle
{1 + 750𝜀𝑥 1000 + 𝑆𝑥𝑒
𝜃 = 29 + 3500𝜀𝑠𝑥
ACI 318 [47] 𝑉𝑛 = 𝑉𝑐 + 𝑉𝑠
√𝑓𝑐
𝑉𝑐 = ( ) 𝑏𝑤 𝑑, 𝑓𝑐 < 70 MPa
6
𝐴𝑠𝑤
𝑉𝑠 = 𝑑𝑓𝑦𝑡
𝑆
260
𝑜𝑟 𝑉𝑐 = √𝑓 ′ 𝑏 𝑑
1000 + 𝑑 𝑐 𝑤
0.06√𝑓𝑐′ 𝑏𝑤 𝑠
𝑤ℎ𝑒𝑛, 𝐴𝑠𝑤 ≤ 𝑜𝑟 𝑑 ≥ 300 𝑚𝑚
𝑓𝑦
𝐴𝑠𝑤
𝑉𝑠 = 𝑑𝑓𝑦𝑡 where, 𝑉𝑠 ≤ 0.8√𝑓𝑐′ 𝑏𝑤 𝑑
𝑆
superposition of the capacity provided by concrete (𝑉𝑐 ) and stirrups (𝑉𝑠 ). Figure 2.3 depicts
the shear contributions provided by stirrups and concrete. Thus, in ACI 318, the nominal
𝑉𝑛 = 𝑉𝑐 + 𝑉𝑠 (1)
√𝑓𝑐
𝑉𝑐 = ( ) 𝑏𝑤 𝑑, 𝑓𝑐 < 70 MPa (2)
6
𝐴𝑠𝑤
𝑉𝑠 = 𝑑𝑓𝑦𝑡 (3)
𝑆
where,
𝑉𝑐 = Shear capacity provided by concrete
(a)
(b)
Figure 2.3: Concrete and stirrups shear resisting mechanisms (a) and graphical illustration
of their contribution (b) [59].
According to Eurocode (EC2) [4], different models are used to determine the capacity of
1⁄
𝑉𝑛 = 𝑉𝑅𝑑,𝑐 = 𝐶𝑅𝑑,𝑐𝑘 (100𝜌𝑠𝑥 𝑓𝑐, ) 3 𝑏𝑤 𝑑 (4)
200
𝑘=1+√ ≤ 2.0
𝑑
(5)
𝜌𝑠𝑥 ≤ 0.02
0.18
𝐶 𝑅𝑑,𝑐 = , 𝛾𝑐 = 1.5
𝛾𝑐 (6)
For beams internally reinforced with stirrups, the shear capacity is given by [4]:
𝑉𝑛 = 𝑉𝑅𝑑,𝑐 = 𝜌𝑠𝑡 𝑓𝑠𝑡𝑑 𝑏𝑤 𝑍𝑐𝑜𝑠𝜃 ≤ 𝛼𝑐𝑤 𝑏𝑤 𝑍𝑣1 𝑓𝑐𝑑 /(𝑐𝑜𝑠𝜃 + 𝑡𝑎𝑛𝜃) (7)
𝑍 = 0.9𝑑 (9)
where,
𝜃 = shear crack angle relative to the longitudinal axis of the beam, and
The Japanese JSCE code (2007) [57] considers the effect of effective depth of member,
the reinforcement ratio of longitudinal bars, and the concrete compressive strength in its
3 ′
𝑓𝑣𝑐𝑑 = 0.20 √𝑓𝑐𝑑 ≤ 0.72 (MPa) (13)
1⁄
100 4
(14)
𝛽𝑑 = ( ) ≤ 1.5, (𝑑 𝑖𝑛 mm)
𝑑
1⁄
𝛽𝑝 = (100𝜌𝑠𝑡 ) 3 (15)
2𝑀0
0 ≤ 𝛽𝑛 = 1 + ≤ 2.0, 𝑓𝑜𝑟 𝑁𝑑′ ≥ 0 (16)
𝑀𝑢𝑑
where,
𝛽𝑑 = Depth factor,
𝑀0 = Flexural moment required to balance the stress due to axial load at the extreme
tension fiber,
𝑀𝑢𝑑 = Pure flexural capacity neglecting the effect of axial force, and
The simplified method of Canadian Standard [58] is based on the 45-degree truss model.
0.06√𝑓𝑐′ 𝑏𝑤 𝑠
𝑉𝑐 = 0.2√𝑓𝑐′ 𝑏𝑤 𝑑 𝑤ℎ𝑒𝑛 𝐴𝑠𝑤 ≥ 𝑜𝑟 𝑑 ≤ 300 𝑚𝑚
𝑓𝑦
(17)
260 0.06√𝑓𝑐′ 𝑏𝑤 𝑠
𝑉𝑐 = √𝑓 ′ 𝑏 𝑑 𝐴𝑠𝑤 ≤ 𝑜𝑟 𝑑 ≥ 300 𝑚𝑚
1000 + 𝑑 𝑐 𝑤 𝑓𝑦
𝐴𝑠𝑤
𝑉𝑠 = 𝑑𝑓𝑦𝑡 ≤ 0.8√𝑓𝑐′ 𝑏𝑤 𝑑 (18)
𝑆
The modified compression field theory (MCFT) [60] used the principles of equilibrium,
shear model. Unlike the traditional truss-based models (e.g. ACI 318 [47]), the MCFT
accounts for the contribution of the concrete tensile stresses of a cracked section.
Moreover, a variable angle of diagonal struts was considered compared to the fixed value
relationship of cracked concrete section of MCFT are shown in the figure below.
The MCFT entails solving a large number of equations iteratively [61,62]. To ease this
procedure, Collins et al. [61] proposed a general method of shear design based on the
MCFT where the shear capacity is determined as a function of tensile stress factor (𝛽) and
inclination of the principal compressive stress (𝜃). The authors provided a table for
approximating the values of 𝛽 and 𝜃, for members with as well as without stirrups, for a
limited range of longitudinal strain values (𝜀𝑠𝑥 ≤ 0.2%). On the other hand, a simplified
compression field theory (SCFT) was developed by Bentz et al. [62] in which the values
of 𝛽 and 𝜃 are determined as a function of the longitudinal tensile strain (ɛ𝑥 ) and crack
spacing (𝑆𝑥𝑒 ). In the SCFT, simple expressions for 𝛽 and 𝜃 are used, reducing the required
number of parameters and iterations, while predicting the shear capacity of the concrete
0.4 1300
𝛽= (19)
1 + 1500𝜀𝑥 100 + 𝑠𝑥𝑒
1
𝑆𝑥𝑒 =
sin 𝜃 cos 𝜃 (20)
𝑠𝑚𝑥 + 𝑠𝑚𝑦
For beams not reinforced with stirrup the crack spacing is given by as follows as per.
35𝑆𝑥
𝑆𝑥𝑒 = ≥ 0.85𝑆𝑥 (21)
𝑎𝑔 + 16
𝑆𝑥𝑒
𝜃 = (29 + 7000𝜀𝑥 ) × (0.88 + ) ≤ 75° (23)
2500
𝑓𝑠𝑥 = 𝜀𝑥 𝐸𝑠 ≤ 𝑓𝑦𝑥
where,
This chapter presents the experimental database of shear critical RC beams (with and
results, data preprocessing, and identification of input and output variables is a critical step
in the development of a machine learning model. Thus, in the following sections, the
details on the experimental database of RC beams with or without stirrups are provided.
Moreover, the distribution of the database for both RC beams with and without stirrups is
provided. The development of an accurate shear model requires the incorporation of all
parameters affecting the shear capacity of RC beams. Thus, the input features (key design
parameters) are identified and the effects of these factors on the response variable (shear
capacity in this case) are discussed based on the collected database for both RC beams
3.1.1 Introduction
al. [41] is used in this study. The authors collected the test results of RC beams tested over
60 years of research between 1948 and 2006. A total of 1849 RC beams are contained in
the database. The experimental database covers a wide range of beam geometries, material
reinforcement steel bars, and shear 𝑎/𝑑 ratios. The following general criteria were used to
Material strength: RC beams with no limit on the material strength, i.e., no limit
on the 𝑓𝑐′ and yield strength of longitudinal reinforcement bars. Details on the
Beam shape: both rectangular and T-section beams are included in the database.
Load type: point load or uniform load (details will be provided in the following
subsection).
As mentioned above, both RC rectangular and T-section beams are contained in the
database. As shown in Figure 3.1, among the total of 1849 RC beams included in the
database, 1599 beams were rectangular beams representing 86% of the beams, while 250
The beams were tested under three types of loading conditions: (a) point load (Type P),
(b) uniform load with one concentrated load (Type U1), and (c) uniform load with two-
point concentrated loads (Type U2). Figure 3.2 shows the details of the three loading
types.
beams, 1701 beams were tested under three-point load (Type P), which represents 92% of
the beams. Thus, the majority of the beams were tested under Type P loading condition.
For the uniform loads, 11% were tested under Type U1, while the remaining 3% were
As discussed earlier, the development of efficient machine learning models requires proper
identification of the input features or key design parameters that influence the response
variables; namely, the shear capacity of the beams in this case. In this study, a total of eight
important factors are identified based on a thorough literature review. These factors
include the width of the web (𝑏𝑤 ), the width of the flange (b), cross-sectional depth (ℎ),
effective depth of the section (𝑑), shear span-to-effective depth ratio (𝑎/𝑑), concrete
compressive strength (𝑓𝑐′ ), longitudinal reinforcement ratio (𝜌𝑠 ), and yield strength of steel
bars (𝑓𝑦 ). The details on the distribution of these factors and their effect on the shear
A wide range of input parameters is comprised in the database. Figure 3.4 shows the
distribution of each factor. Moreover, the distribution of the input features in terms of the
average (mean) value, standard deviation (STD), minimum, first quartile (Q1), second
quartile (Q2), third quartile (Q3), and maximum values are listed in Table 3.1. As can be
observed in Figure 3.4 and Table 3.1, an extensive range of the factors is included in the
database. For instance, the values of 𝑓𝑐′ ranged between 6.1 MPa to 127.5 MPa with a mean
strength of 34.86 MPa and a standard deviation of 18.34 MPa. As shown in Figure 3.4
and Table 3.1, 75% of the beams were constructed with concrete having a compressive
strength of 24.1 MPa or higher strength. The compressive strength of concrete in most of
the beams ranged between 24.1 MPa and 50 MPa, as shown in Figure 3.4. With regard to
the beam section, the width of the beam web and flange ranged between 21 mm to 3000
mm, the height of the beam section was in the range of 51 mm–3140 mm, and the effective
depth of the beam ranged from 41 mm to 3000 mm, as listed in Table 3.1. Moreover, the
𝑎/𝑑 ratio is in the range of 0.25 to 15.06. Thus, both slender and deep beams are included
in the database. As per the ACI- ASCE Committee 445 [53], RC beams with 𝑎/𝑑 ratio of
less than 2.50 are characterized as deep beams, while slender beams have 𝑎/𝑑 ratio of
Table 3.1: Geometry and material characteristics of the beams included in the database
Parameters Mean STD Minimum Q1 Q2 Q3 Maximum
As discussed earlier, the flexural reinforcement bars had a deformed shape. The yield
strength of flexural reinforcement bars ranged between 172.14 MPa to 1779 MPa, as listed
in Table 3.1; however, most of the beams were reinforcement with flexural bars with yield
strength in the range of 379 MPa–550 MPa, as shown in Figure 3.4. Finally, the
longitudinal reinforcement ratio ranged between 0.1% and 9.5% with a mean value of
2.24% and standard deviation of 1.52%. It is worth mentioning here that the concrete
compressive strength and yield strength of longitudinal reinforcement bars are limited to
100 MPa and 1000 MPa, respectively, in the final database used in the development of the
the input parameters on the shear capacity of RC beams is briefly discussed in the
following section.
In Section 2.2, the effect of different key design factors on the shear capacity is discussed
based on a thorough literature review. In this section, the collected database is used to
investigate the influence of the factors on the shear capacity of RC beams without stirrups.
Figure 3.5 depicts the variation of the shear capacity (𝑉𝑢 ) with the change in each factor
Figure 3.5: Relationship between the input parameters and the shear capacity.
As can be observed in Figure 3.5, each parameter significantly influences the shear
capacity of RC beams. Moreover, there exists a nonlinear relationship between the factors
and the shear capacity of RC beams. Thus, a simple linear model cannot be used to
determine the shear capacity of the beams. Generally, the shear capacity of the beam
increased with an increase in the width of the web, width of flange, height of the beam
section, and effective depth of the beam, as shown in Figure 3.5. However, an increase in
the shear span-to-effective depth showed a negative effect on the shear capacity, thus, the
shear capacity decreased with an increase in 𝑎/𝑑 ratio. The concrete compressive strength,
yield strength of longitudinal bars, and reinforcement ratio of longitudinal bars showed a
complex trend, which shows the complex nonlinear relationship between the factors and
the shear capacity of the beams and interaction between the factors.
3.2.1 Introduction
A database of RC beams with stirrups, which is collected by Zhang et al. [42], Jung et al.
[63], and Cladera et al. [64] is used in this study. The authors collected the experimental
results of a total of 348 RC beams. The experimental database covers a wide range of beam
longitudinal and transversal reinforcement steel bars, and shear span-to-effective depth
ratios. Both rectangular and T-section beams are included in the database. And also both
slender beam and deep beam are included in the database. As shown in Figure 3.6, among
the total of 348 RC beams included in the database, 116 beams were slender beams
representing 33.33% of the beams, while 232 RC beams had deep beam, which denotes
As discussed in section 4.1, the development of efficient machine learning models requires
proper identification of the input features or key design parameters that influence the
response variables; namely, the shear capacity of the beams in this case. In this study, a
total of seven important factors (input features) are identified based on a thorough literature
review. These factors include the width of the flange (b), effective depth of the section (𝑑),
shear span-to- depth ratio (𝑎/𝑑), compressive strength of concrete (𝑓𝑐′ ), reinforcement ratio
of longitudinal bars (𝜌𝑠𝑦 ), transversal reinforcement ratio (𝜌𝑠𝑦 ), and yield strength of steel
bars (𝑓𝑦 ). The details on the distribution of these factors and their influence on the shear
Figure 3.7 illustrates the distribution of each input parameters. Moreover, the distribution
of the input features are listed in Table 3.2. As can be observed in Figure 3.7 and Table
3.2, an extensive range of the factors is included in the database. The values of 𝑓𝑐′ ranged
between 12.8 MPa to 125 MPa with an average strength of 41.21 MPa and a standard
deviation of 20.67 MPa, the yield strength of flexural reinforcement bars ranged between
0 MPa to 844 MPa, the longitudinal reinforcement ratio ranged between 0.76 MPa and 5.8
MPa with a mean value of 2.86 MPa, and standard deviation of 0.97, and the transversal
reinforcement ratio range between 0 MPA and 1.90 MPa, with a mean value of 0.31 MPa,
and standard deviation of 0.23 MPa, With regard to the beam section, the width of the
beam ranged between 76 mm to 457 mm, and the effective depth of the beam ranged from
95 mm to 1200 mm, and the shear span-to-effective depth ratio is in the range of 1 to 5.98,
both slender and deep beams are included in the database, as listed in Table 3.2.
Table 3.2: Geometry and material characteristics of the beams included in the database
Parameters Mean STD Minimum Q1 Q2 Q3 Maximum
STD: standard deviation; Q1: first quartile; Q2: second quartile; Q3: third quartile
In the final database used in the development of the machine learning-based models, as
will be discussed further in Chapter 6. The effect of the input parameters on the shear
In this section, the collected database is used to investigate the influence of the factors on
the shear capacity of RC beams with stirrups. Figure 3.7 shows the variation of the shear
capacity (𝑉𝑢 ) with the change in each factor based on the experimental results of RC beams
Figure 3.8: Relationship between the input parameters and the shear capacity of RC beams
with stirrups
As observed from Figure 3.8, all parameter significantly influences the shear capacity of
RC beams and there is the complex nonlinear relationship between the factors and the
Machine learning has recently gained immense attention owing to its ability to effectively
determine the relationship between the input features and the response variable (s) in a
complex system. In contrast to the case of most empirical models, ML techniques does not
require the prior assumption or knowledge of the underlying mathematical and physical
models [16]. Artificial neural network (ANN) is one of the most widely utilized ML
techniques [9]. It has been successfully employed for different structural engineering
studies; some of which include the prediction of mechanical properties of concrete [6,8–
10], damage assessment of bridges [22–24] and buildings [25], ultimate deformation
capacity of RC columns [11], shear strength of circular RC columns [26], and compressive
strength [13–15] and the stress-strain relationship [12] of concrete confined with FRP.
ANN has also proved to be able to predict the shear capacity of pristine [35,39,65] and
Support vector machine (SVM) is another powerful ML technique that has been applied
to structural engineering problems [27–29]. Solhmirzaei et al. [29] used three different ML
techniques, namely, SVM, k-nearest neighbor (kN), and ANN to predict the failure modes
of Ultra High Performance Concrete (UHPC) beams. The results showed that ANN
outperformed the other two methods in predicting the failure modes of UHPC beams with
Ensemble models are another type of emerging ML technique in the field of structural
engineering with high performance. These algorithms combine two or more learners
known as base learners (e.g. kernel regression, support vector machine, and decision tree)
in order to improve the prediction performance, robustness, and stability of the base
learners [30]. Ensemble learners can be formed in sequential or parallel styles with the aim
to exploit the dependence and independence between the base models, respectively.
Bagging (short for Bootstrap Aggregating) [31] is one of the ensemble models in which
multiple base learners are independently trained in parallel using different bootstrap
sample. Each bootstrap training dataset contains an average of 63.2% of the original
training set. The final prediction is then taken as the mean of the predictions from the base
learners [31]. As a result a better prediction with reduced variance is obtained [31].
homogeneous base learners, particularly, decision trees are used. An ensemble model can
also be formed from multiple base learners of different types or heterogeneous base
learners [30]. Stacking ensemble is one of the heterogeneous ensemble models in which
different types of base learners are combined via a meta-model with the objective of
In the previous study, different researchers have attempted to examine the efficacy of ML
limited number of experimental databases were used in the previous study. Moreover,
beams in shear, all previous studies generally failed to propose a user-friendly and practical
shear design approach based on the results of the trained ML models. As it is well
by others. Thus, practical implementation and simplification of the ML-based models are
essential. Moreover, most of the models are developed using a limited number of
In lieu of the above-mentioned limitations, the current study investigated the application
of several ML models (both single and ensemble models) to propose the best predictive
model based on a large database of slender and deep RC beams with/without stirrups. Four
types of homogeneous ensemble models that are based on decision trees; namely, random
forest, gradient boosting, extremely randomized trees, and extreme gradient boosting
(xgBoost) are considered. The xgBoost model, which is a comparatively new algorithm,
uses regularization parameters in order to overcome the overfitting problem [68]. It has
Thus, this study adopted different advanced ensemble ML techniques, arguably for the
first time, to predict the shear capacity of RC beams. Furthermore, the prediction capability
of the ensemble models is compared with that of the base learners including support vector
regression and decision tree. The proposed model has been compared against the existing
models and code equations. The results of the analysis evidenced that the proposed model
is superior to other existing models and guidelines in predicting the RC beam shear
capacity. Finally, the best model among the developed models is deployed into a user-
friendly web-based application, which can be used by practitioners and researchers in the
field of civil engineering to accurately determine the capacity of RC beams in shear. The
developed web-based application facilitates a rapid and accurate prediction of the capacity
of RC beams in shear.
To overcome the problems related to low rate of learning at the extreme values, the dataset
𝑥𝑜𝑟𝑖𝑔 − 𝑥𝑚𝑖𝑛
𝑥𝑛 = (25)
𝑥𝑚𝑎𝑥 − 𝑥𝑚𝑖𝑛
where 𝑥𝑜𝑟𝑖𝑔 is the original value of the variable, while 𝑥𝑛 is its corresponding normalized
value and 𝑥𝑚𝑖𝑛 and 𝑥𝑚𝑎𝑥 are the minimum and maximum values of the variable,
respectively.
Different statistical indices are commonly used to evaluate the performance of ML-based
models. The following four indices are used in this study; namely, root mean squared error
(RMSE), mean absolute percentage error (MAPE), mean absolute error (MAE), and
1
𝑀𝐴𝐸 = 𝑛 ∑𝑛𝑖=1|𝑦𝑖 − 𝑦̂𝑖 | (26a)
1 (26c)
𝑅𝑀𝑆𝐸 = √𝑛 ∑𝑛𝑖=1(𝑦𝑖 − 𝑦̂𝑖 )2
2
∑𝑛𝑖=1(𝑦𝑖 − 𝑦̂𝑖 )2 (26d)
𝑅 =1− 𝑛
∑𝑖=1(𝑦𝑖 − 𝑦̅)2
where 𝑦 is the target response, 𝑦̂ is the predicted response, and 𝑦̅ is the mean of the
responses.
The performance of a given machine learning model depends on the values of its
hyperparameters, which are parameters that control the learning process of ML models. In
this study, the optimal hyperparameters are determined using grid search and K-fold cross
validation. Following the normalization of the dataset and identification of the input and
response vectors, the dataset is randomly divided into the training set and test set including
80% and 20% of the complete dataset, respectively, as shown in Figure 4.1.
The ML model is trained on the training set, while the test dataset is used to finally appraise
model, where K denotes the number of partitions. Here, the data is randomly split into K
parts of equal sizes, and then the model is fitted on the 𝐾 − 1 parts, while the remaining
one part is used to validate the model, as shown in Figure 4.1 (for 𝐾 = 10). Therefore,
each fold is used as a validation set and the cross-validation is repeated K times. The
performance of the model is then determined as the average of the results from the K data
folds, as illustrated in Figure 4.1. This study adopted a ten-fold (𝐾 = 10) cross-validation,
as shown in Figure 4.1. The training of each model is performed using scikit learn [77],
primarily applied for classification problems using structural risk minimization principle
[78]. The SVM-based classifications are based on the optimal separation of classes. The
algorithm in the SVM finds a hyperplane or decision surface that distinctly classifies the
data points. It can also be used to efficiently perform both linear and non-linear regression
by indirectly mapping the original input vectors into a very high-dimensional feature space
in which they become separable, using kernel functions [78]. Different kernel functions
can be used for SVM with the radial basis function (RBF) kernel being the most popular
type. The RBF kernel, 𝐾𝑅𝐵𝐹 (𝑋1 , 𝑋1 ), on two vectors 𝑋1 and 𝑋2 is defined as follows:
The other two types of nonlinear kernels used in SVM include sigmoid and polynomial
The above three nonlinear kernel functions; namely, polynomial, RBF, sigmoid kernels
are considered in this study. The other two hyperparameters that greatly affect the SVR
predictive capacity are the regularization parameter C and 𝜀-insensitive zone [79]. These
three types of nodes; the root node, interior node, leaf node (terminal node) as showed in
Figure 4.3. The CART method splits the feature space into multiple smaller disjoint
regions with similar response values using a set of rules to predict a class label (in
classification) and value (in regression) of the response variable. Each internal node in
CART specifies a test on an attribute of the data, while each branch represents the test
output. The root node, which is the topmost node in CART denotes the most relevant
feature, while the leaf node or terminal node provides the predicted response variable.
including the maximum depth of the tree, minimum samples number needed to be at a
leaf/terminal node, and minimum number of samples required to split an internal node.
Decision tree algorithm is easy to visualize and interpret; however, it has less
generalization ability, high bias, and high variance. Ensemble models can be used to
overcome the problems associated with a single decision tree as discussed below.
Ensemble learners are supervised machine learning paradigm that combine multiple single
learners (a.k.a. weak learners or base learners) into one predictive model to decrease
variance error, bias, and produce a strong model with enhanced generalization capability
and superior performance [80]. The basic idea behind the ensemble model is to combine
multiple base learners in computing the final response rather than relying on an individual
model. The ensemble models can be formed in either parallel or sequential manner.
Moreover, ensemble models can be formed from homogeneous base learners (same type
of base learner algorithms) or heterogeneous base learners (different types of base learner
algorithms).
The most popular type of meta-algorithms that combines base learners are bootstrap
aggregation (bagging) [31] and boosting [81] ensembles. In bagging ensemble (e.g.,
random forest), multiple base learners are independently trained in parallel mode on a
different bootstrap sample, while in boosting ensemble (e.g., gradient boosting) the base
Random forest (RF) is a forest of randomly created CART models. It is the most used type
decision [82]. However, it uses a bootstrap sample to train a series of decision trees and
make the final prediction unlike a single decision tree that is construct using the complete
training dataset [83]. It has emerged as a versatile and highly accurate methodology to
solve both classification as well as regression problems and with a capability to handle
large features with a small sample size and reduce the variance of prediction while keeping
low bias [82,84,85]. In this study, RF is combined with 10-fold cross-validation which
Figure 4.4 shows the training process involved in random forest regression. Given training
parameter. The RF creates 𝑁 numbers of bootstrap sample (𝑆) and trains 𝑁 number of
decision trees each bootstrap sample, as can be observed in Figure 4.4. The final prediction
is then obtained as the average of the predictions from each decision tree, Figure 4.4.
was proposed by Geurts et al. [86]. It can be applied to solve classification as well as
regression based problems. Extremely randomized trees differs from random forest in two
major features. Firstly, each decision tree in ERT is trained using the entire dataset in
contrast to random forest algorithm that uses a bootstrap sample. Secondly, ERT adds
randomization in selecting the split points of each node [86]. Similar type of
However, the hyperparameters are independently tuned for both random forest and
enhanced compared to the base learners [87]. Similar to other boosting algorithms,
gradient boosting regression (GBR) trains multiple learners sequentially. Given a training
examples 𝑋 = {(𝑥1 , 𝑦1 ), (𝑥2 , 𝑦2 ), … , (𝑥𝑛 , 𝑦𝑛 )}, the GBR fit the model 𝐹𝑚 (𝑥) using 𝑀 trees
[88]:
where ℎ𝑚 is the 𝑚𝑡ℎ tree and 𝐹 represents set of all possible trees.
The objective of a newly added decision tree at each iteration is to reduce the loss given
by:
In 2016, Chen and Guestrin [68] proposed an enhanced form of gradient boosting
algorithm know as extreme gradient boosting (xgBoost). The main difference of the
xgBoost from the gradient boosting is in its objective function. The xgBoost algorithm
adds a regularization parameter to reduce complexity of the model and produce a better
generalization ability [68]. Its objective function (𝐽) is given as follows in which the
𝐽 = ∑ 𝐿(𝑦̂,𝑖 𝑦𝑖 ) + ∑ 𝛺(𝑓𝑚 )
𝑖 𝑚
(33)
1
𝛺(𝑓𝑚 ) = 𝛾𝑀 + 𝜆‖𝜔‖2 (34)
2
where 𝑀 is the number of decision trees, 𝛾 and 𝜆 are the regularization coefficients, and
The first step in the development of machine learning model is data collection and
dataset is randomly partitioned into the train and test sets, which comprises 80% of the
dataset for training and 20% of the dataset for testing. The training set is used to train and
develop the models, while the test set is used to finally appraise the models based on
each model are tuned to find the most powerful and best model. The results of
Table 5.1: Optimized hyperparameters for each model for RC beams without stirrup
Models Optimum parameters
7, subsample = 0.3
The models are train in train dataset and tested in test dataset, as described in previous
chapter six models (two single models and three ensemble models) namely; support vector
machine, decision tree, random forest, extremely randomized tree, gradient boosting and
extremely gradient boosting are included in the study. The result from each model for both
train and test dataset in terms of performance indices; MAE, MAPE, RMSE, R2 for RC
All ensemble models model showed higher prediction capability on the test dataset
compared to the base model (CART and SVM), as listed in Table 5.2. The evaluation of
the performance of the selected model showed a very small difference between the
experimental shear capacity and the predictions of the proposed xgBoost model in the
testing phases as indicated by the RMSE of 26.522 kN for the test dataset. For the GBR
model, this value was 31.603 kN. The RMSE for the other models on the test dataset
ranged between 41.848 kN to 57.416 kN. A similar trend was observed for other
performance indices, as listed in Table 5.2. This observation illustrates the excellent
prediction capacity of the proposed xgBoost model among all models, as listed in Table
5.2. Compared to all models, the CART model revealed the smallest performance with the
maximum root mean square error and least 𝑅 2 on both train and test dataset, as presented
in Table 5.2.
Figure 5.1: Comparisons of shear capacity predictions provided by the single ML models to
the experimental shear capacity for RC beams without stirrup.
Figures 5.1a–5.1b and Figures 5.2a–5.2d compare the shear capacity predicted by the
ML models against the corresponding experimental values. As can be seen in these figures,
the predictions provided by most of the models are in excellent agreement with the
experimental values. It can also be observed from the same figures that the xgBoost model
provided the best prediction for the load capacity of the beams, with the highest correlation
between the experimental and predicted shear capacity for both the training and test
datasets. A great correlation exists among the experimental and predicted shear capacities
Figures 5.3a and 5.3f further compare the prediction capability of the models in terms of
the residuals of the predicted shear capacity of the beams, which is the difference between
the predicted shear capacity and the corresponding experimental value on the normalized
training and test datasets. The figures also provide the correlation coefficient for both the
training and test datasets. The residuals for all models are distributed around zero, as can
be seen in Figs. 5.3a–5.3f. In addition, the proposed models generally resulted in a good
correlation between the predicted and experimental load capaciies as can be evidenced
Compared to all models, xgBoost model showed the strongest correlation between the
experimental shear capacity and its predicted value, as can be observed in Fig. 5.3f.
(a)
(b)
(c)
(d)
(e)
(f)
Figure 5.3: Residual plots for RC beams without stirrup.
5.1.1 Comparison with existing design guide line for RC beams without stirrup
The accuracy of the proposed ML model; particularly, xgBoost model is compared with
that of the ACI 318 equation [55] and Eurocode 2. According to ACI 318 [55] and
𝑉𝑛 = 𝑉𝑐 + 𝑉𝑠 (35)
Thus, for RC beams without stirrups, the nominal shear capacity (𝑉𝑛 ) is given by the shear
capacity provided by concrete (𝑉𝑐 ). It is determined by equation (2) and equation (4) for
Table 5.3: Descriptive statics for 𝑽𝒑𝒓𝒆𝒅 /𝑽𝒆𝒙𝒑 for RC beams without stirrup
Model mean std min 25% 50% 75% max
ACI-
Figure 5.4 compares the predictive performance of the proposed xgBoost model with that
of the ACI 318 equation [55] and Eurocode 2. As can be observed from this figure, the
ACI and Eurocode 2 equation is inaccurate in predicting the shear capacity of the beams,
as opposed to the proposed model that showed high accuracy in predicting the shear
capacity of the beam. It can also be observed in Fig. 5.4 that the ACI 318 and Eurocode 2
provision significantly underestimates the shear capacity for most of the beams.
Figure 5.4: Predictions of ACI and proposed model for RC beams without stirrup.
Furthermore, the histogram in Figures. 5.5a –5.5c show the distribution of the predicted
to experimental shear capacity ratio using the proposed xgBoost model, ACI 318 code
equation and Eurocode 2. The proposed xgBoost model resulted in the most accurate and
stable predictions, as can be seen in Figures. 5.5a – 5.5c and table 5.3. The average (μ)
of the predicted to experimental shear capacity ratio was 1.03 for the proposed xgBoost
model compared with an average of 0.63 and 0.53 for ACI 318 code equation and
Eurocode 2 equation respectively. Furthermore, the standard deviation (σ) of 0.23 for
proposed model and 0.36 and 0.22 for the ACI 318 code equation and EC-2 respectively.
Figure 5.5: Histogram of predicted to experimental shear capacity ratio for RC beams
without stirrup.
Moreover, as showed in figure 5.13 and table 5.4 the root mean square error between
experimental and predicted shear capacity is 15.54 for xgBoost model when it is 138.23
for ACI-318, and 139.47 for EC-2. The coefficient of determination (R2) between
experimental and predicted shear capacity is 99% for proposed xgBooost model, when it
is 15% for ACI 138, and 18% Eurocode 2 equation. Compared to ACI-318 and EC-2 the
proposed xgBoost model showed high performance with the lowest RMSE and highest
correlation. Generally, it is concluded that the developed xgBoost model can reasonably
Figure 5.6 : Experimental versus predicted shear capacities for RC beams without stirrup
based on the proposed xgBoost model and existing code equations.
In chapter four the dataset is prepared, the total of 348 RC beam with stirrup are included
in the study. This section discusses the result from each models and comparative study
MSc Thesis Page 70
Application of Machine learning methods for shear capacity of RC beams
between proposed model, ACI 318 and EC-2. Similar to previous section, for RC beam
with stirrup the hyperparameters for single and ensemble ML models are optimized by
Table 5.5: Optimized hyperparameters of each models for RC beams with stirrup
Models Optimized hyperparameters
SVR Kernel type = radial basis function (rbf), ε = 0.1, C = 195
CART Maximum depth = 8, maximum features = 5, minimum sample leaf
= 1, minimum sample split = 2
RFR Number of estimators = 34, depth_max = 15, features_max = 5,
minimum sample split = 2
ERT Number of estimators = 29, depth_max = 10, maximum features = 7
Figure 5.7 shows a 7 7 matrix in which the diagonal of the matrix shows the histogram
for the distribution of each variable, whereas the lower and upper triangular matrices show
the scatter plot and Pearson correlation coefficient (𝑟) between the input variables,
respectively. As can be observed in this figure, there is complex relationship between the
The total of 348 RC beams with stirrup are included in dataset, the dataset is randomly
split into 80% and 20% of test and train dataset respectively. The performance indices
regarding to the test and train dataset for each model (SVM, CART, RFR, ERT, GBR,
xgBoost) are evaluated and listed in Table 5.6. Compering the base models and ensemble
models, the ensemble models showed the highest performance to predict the shear capacity
of RC beam with RMSE of between 42.873 kN and 35.743 kN for test dataset, when it is
range between 112.416 kN and 55.851 kN for single or base models. From all models the
selected model showed a very small difference between the experimental shear capacity
and the predictions of the proposed xgBoost model in the testing phases as indicated by
the RMSE and 𝑅 2 of 35.743 kN and 0.995 respectively. A similar trend was observed for
other performance indices, as listed in Table 5.6. This observation illustrates the superior
prediction capability of the proposed xgBoost model among all models, as listed in Table
5.6. Compared to all models, the SVR model showed the lowest performance with the
highest root mean square error and least 𝑅 2 on both the test and train dataset, as presented
in Table 5.6.
Figure 5.8: Comparisons of shear capacity predictions provided by single ML models to the
experimental shear capacity for RC beam with stirrup.
For RC beams with stirrup Figures 5.9a–5.9b compare the shear capacity predicted by
single ML models and the corresponding experimental values. From single model CART
models showed highest shear capacity predictions with R2 of 0.981. However Figures
provided by ensemble ML models are in excellent agreement with the experimental values.
It can also be observed from the same figures that the xgBoost model provided the best
prediction for the shear capacity of the beams, with the highest correlation between the
experimental and predicted shear capacity for both the test and train datasets. A solid
correlation exists between the experimental and predicted shear capacities based on the
5.2.1 Comparison with existing design guide line for RC beams with stirrup
The performance of the proposed xgBoost model ML model is compared with that of the
ACI 318 equation [55] Eurocode equation. According to ACI 318 [55] and Eurocode 2
design guideline equation, the nominal shear capacity (𝑉𝑛 ) of RC beam with stirrups is
transverse reinforcement (𝑉𝑠 ). It is evaluated by equation (1) and equation (7) respectively.
Table 5.7: Descriptive statics for 𝑽𝒑𝒓𝒆𝒅 /𝑽𝒆𝒙𝒑 for RC beams with stirrup
Model mean std min 25% 50% 75% max
ACI-
Figure 5.11 compares the predictive performance of the proposed xgBoost model with
that of the ACI 318 equation [55] and Eurocode 2. As can be observed from this figure,
the ACI equation and EC-2 equation are inaccurate in predicting the shear capacity of the
RC beam with stirrup, as opposed to the proposed model that showed high accuracy in
predicting the shear capacity of the beam. It can also be observed in Fig. 5.11 that the ACI
318 and Eurocode 2 provision significantly underestimates the shear capacity for most of
the beams.
Figure 5.10: Predictions of ACI, Eurocode and proposed model for RC beams with stirrup.
Moreover, the histogram in Figures. 5.12a –5.12c show the distribution of the predicted
to experimental shear capacity ratio using the proposed xgBoost model, ACI 318 code
equation and Eurocode 2. The proposed xgBoost model resulted in the most accurate and
stable predictions, as can be seen in Figures. 5.12a – 5.12c and table 5.7. The average (μ)
of the predicted to experimental shear capacity ratio was 1.01 for the proposed xgBoost
model compared with an average of 0.74 and 0.27 for ACI 318 code equation and
Eurocode 2 equation respectively. Furthermore, the standard deviation (σ) of 0.102 for
proposed model and 0.45 and 0.26 for the ACI 318 and EC-2 code equation respectively.
Figure 5.11: Histogram of predicted to experimental shear capacity for RC beams with
stirrup
Furthermore, as showed in figure 5.13 and table 5.8 the root mean square error between
experimental and predicted shear capacity is 19.20 for xgBoost model when it is 210.90
for ACI-318, and 323.57 for EC-2. The value of R2 between experimental and predicted
shear capacity is 99.5% for proposed xgBooost model. Compared to ACI-318 and EC-2
the proposed xgBoost model showed high performance with the lowest RMSE and highest
R2. Generally, it can be concluded that the proposed xgBoost model can accurately predict
Figure 5.12 : Experimental versus predicted shear capacities for RC beams with stirrup
based on the proposed xgBoost model and existing code equations.
The deployed model is under development. It can be used for the reliable and accurate
shear design of RC beams. Particularly, it is of great interest to practitioners and designers
as it is user-friendly and superior in terms of its prediction capability compared to other
available models and code equations. The developed web-based application does not
require knowledge of the machine learning algorithms. This makes it very attractive to the
practitioners and researchers in the field of civil engineering. It can also be used for the
teaching purpose as a quick and accurate determination of the capacity of RC beams in
shear. The screenshot of the deployed model is shown below.
As can be seen in the screen shoot above, the input parameters are listed on the left hand
side. The user should define the input variables by scrolling under each input parameter.
Once the definition for the input variables are complete, the summary of the values of the
input parameters can be seen on the right hand side under “user defined variables”. The
capacity of RC beams in shear based on the defined variables can then be read on the same
screen. The developed application can be used in any device including mobile phones,
tablets, and computers. Hence, it is user-friendly to be used by any interested individual
without prior knowledge in the machine learning algorithms and also the theory behind
the shear mechanisms.
6.1 Conclusions
reinforced concrete (RC) beams, accurate prediction of their shear capacity remains a
challenge. To this end, data-driven ensemble ML-based models to predict the shear
capacity of RC beams are presented in this paper. This study has shown that the ML model
can be used as effective tools to predict the shear capacity of RC beam. The proposed
models account for several input parameters that characterize the beam geometry, concrete
capability of the ML-based models is compared with that of the ACI 318 equation and
Eurocode 2.
The developed ML-based models are shown to be effective in predicting the shear
The ensemble models provided higher accuracy compared to the single models.
Among ML models, xgBoost showed the highest prediction capability with the
lowest RMSE and highest correlation for the test dataset. The experimental shear
capacity and predicted values based on the xgBoost model showed the strongest
The comparisons of the proposed models with the existing code equation
confirmed the superiority of the xgBoost model. The ACI 318 and Eurocode 2
without stirrups.
With an aim to develop an accurate and user-friendly shear design model, the
https://shearcapaity.herokuapp.com/.
based models.
REFERENCES
[1] Baghi H, Barros JAO. New Approach to Predict Shear Capacity of Reinforced
J 2016;114. https://doi.org/10.14359/51689433.
[3] ACI Committee 318. Building code requirements for structural concrete (ACI 318-
rules and rules for buildings. vol. 1. Brussels, Belgium: European Committee for
Standardization; 2004.
https://doi.org/10.1016/j.engstruct.2018.05.084.
[6] Demir F. Prediction of elastic modulus of normal and high strength concrete by
https://doi.org/10.1016/j.conbuildmat.2007.04.004.
https://doi.org/10.1016/j.engstruct.2007.01.003.
[8] Lee S. Prediction of concrete strength using artificial neural networks. Eng Struct
2003;25:849–57. https://doi.org/10.1016/S0141-0296(03)00004-X.
https://doi.org/10.1016/j.conbuildmat.2020.119889.
[10] Trocoli A, Dantas A, Leite MB, Nagahama KDJ. Prediction of compressive strength
https://doi.org/10.1016/j.conbuildmat.2012.09.026.
https://doi.org/10.1016/j.engstruct.2006.05.001.
[12] Jiang K, Han Q, Bai Y, Du X. Data-driven ultimate conditions prediction and stress-
https://doi.org/10.1016/j.compstruct.2020.112094.
29. https://doi.org/10.1016/j.compstruct.2010.04.008.
cylinders confined with CFRP composites using artificial neural networks. Compos
https://doi.org/10.1016/j.compositesb.2012.05.044.
[15] Elsanadedy HM, Al-Salloum YA, Abbas H, Alsayed SH. Prediction of strength
https://doi.org/10.1016/j.compositesb.2011.08.043.
[16] Flood I. Towards the next generation of artificial neural networks for civil
https://doi.org/10.1016/j.aei.2007.07.001.
seismic failure mode identification of reinforced concrete shear walls. Eng Struct
2020;208:110331. https://doi.org/10.1016/j.engstruct.2020.110331.
[18] Keshtegar B, Nehdi ML, Trung N-T, Kolahchi R. Predicting load capacity of shear
https://doi.org/10.1016/j.asoc.2021.107739.
[19] Hwang SH, Mangalathu S, Shin J, Jeon JS. Machine learning-based approaches for
seismic demand and collapse of ductile reinforced concrete building frames. J Build
[20] Mangalathu S, Hwang SH, Choi E, Jeon JS. Rapid seismic damage evaluation of
https://doi.org/10.1016/j.engstruct.2019.109785.
https://doi.org/10.1177/8755293019878137.
2019;193:91–9. https://doi.org/10.1016/j.engstruct.2019.05.028.
[23] Weinstein JC, Sanayei M, Asce M, Brenner BR, Asce F. Bridge Damage
https://doi.org/10.1061/(ASCE)BE.1943-5592.0001302.
road bridge decks based upon arti fi cial neural networks. Eng Struct 2018;171:602–
MSc Thesis Page 89
Application of Machine learning methods for shear capacity of RC beams
16. https://doi.org/10.1016/j.engstruct.2018.05.122.
[25] Morfidis K, Kostinakis K. Approaches to the rapid seismic damage prediction of r/c
https://doi.org/10.1016/j.engstruct.2018.03.028.
[26] Caglar N. Neural network based approach for determining the shear strength of
https://doi.org/10.1016/j.conbuildmat.2009.06.002.
[27] Jalal M, Arabali P, Grasley Z, Bullard JW, Jalal H. Behavior assessment, regression
analysis and support vector machine (SVM) modeling of waste tire rubberized
https://doi.org/10.1016/j.jclepro.2020.122960.
[28] Pal M, Deswal S. Support vector regression based shear strength modelling of deep
https://doi.org/10.1016/j.compstruc.2011.03.005.
[29] Solhmirzaei R, Salehi H, Kodur V, Naser MZ. Machine learning framework for
predicting failure mode and shear capacity of ultra high performance concrete
https://doi.org/10.1016/j.engstruct.2020.111221.
https://doi.org/10.1007/978-3-540-35488-8_8.
https://doi.org/10.3390/risks8030083.
[33] Zhou H, Huang G Bin, Lin Z, Wang H, Soh YC. Stacked extreme learning
MSc Thesis Page 90
Application of Machine learning methods for shear capacity of RC beams
https://doi.org/10.1109/TCYB.2014.2363492.
[34] Sanad A, Saka MP. Prediction of ultimate shear strenggth of reinforced concrete
[35] Cladera A, Mari AR. Shear design procedure for reinforced normal and high-
strength concrete beams using artificial neural networks. Part I: beams without
https://doi.org/10.1016/j.engstruct.2004.02.010.
[36] Chou J, Ngo N, Pham A. Shear Strength Prediction in Reinforced Concrete Deep
9. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000466.
https://doi.org/10.24200/sci.2018.50308.1624.
[38] Chou JS, Pham TPT, Nguyen TK, Pham AD, Ngo NT. Shear strength prediction of
04103-2.
[39] Cladera A, Mari AR. Shear design procedure for reinforced normal and high-
strength concrete beams using artificial neural networks. Part II: beams with
https://doi.org/10.1016/j.engstruct.2004.02.011.
[40] Friedman JH, Hastie T, Tibshirani R. The Elements of Statistical Learning. 2001.
https://doi.org/10.1111/j.1467-985x.2004.298_11.x.
MSc Thesis Page 91
Application of Machine learning methods for shear capacity of RC beams
[41] Collins MP, Bentz EC, Sherwood EG. Where is shear reinforcement required?
https://doi.org/10.14359/19942.
[42] Zhang T, Visintin P, Oehlers DJ. Shear strength of RC beams with steel stirrups. J
541X.0001404.
https://doi.org/10.1016/j.engstruct.2015.05.007.
[45] El-Ariss B. Behavior of beams with dowel action. Eng Struct 2007;29:899–903.
https://doi.org/10.1016/j.engstruct.2006.07.008.
[46] Ormberg G. Evaluating Shear Capacity of Concrete Members with Deficient Shear
Reinforcement. 2010.
[47] ACI Committee 318. Building Code Requirements for Structural Concrete and
[48] Tompos EJ, Frosch RJ. Influence of beam size, longitudinal reinforcement, and
[49] Bentz EC. Empirical modeling of reinforced concrete shear strength size effect for
https://doi.org/10.14359/14274.
[50] Zararis PD, Zararis IP. Shear Strength of Reinforced Concrete Beams under
https://doi.org/10.1016/j.engstruct.2018.05.017.
[52] Collins MP, Bentz EC, Sherwood EG. Where is shear reinforcement require?
[53] Joint ACI-ASCE Committee 445. Recent approaches to shear design of structural
9445(1998)124:12(1375).
[54] Baghi H, Barros JAO. Design approach to determine shear capacity of reinforced
2017;143:4017061. https://doi.org/10.1061/(ASCE)ST.1943-541X.0001793.
[55] ACI Committee 318. Building Code Requirements for Structural Concrete and
USA.; 2014.
[56] AASHTO. LRFD Bridge Design Specifications. 8th ed. Washington, DC:
[57] Japan Society of Civil Engineers. Standard specifications for concrete structures
[58] CSA Committee A23.3. Design of Concrete Structures (CSA A23.3-14). Can Stand
[59] Telkamp GJ. Shear Behavior of Reinforced Concrete Beams and Prestressed
MSc Thesis Page 93
Application of Machine learning methods for shear capacity of RC beams
[60] Vecchio FJ, Collins MP. The modified compression-field theory for reinforced
https://doi.org/10.14359/10416.
[61] Collins MP, Mitchell D, Adebar P, Vecchio FJ. A general shear design method. ACI
[62] Bentz EC, Vecchio FJ, Collins MP. Simplified Compression Field Theory for
2006;103:614–24. https://doi.org/10.14359/16438.
[63] Lee J-Y, Choi I-J, Kim S-W. Shear behavior of reinforced concrete beams with
[64] Cladera A, Marí AR. Shear strength in the new Eurocode 2. A step forward? Struct
[65] Mansour MY, Dicleli M, Lee JY, Zhang J. Predicting the shear strength of
2004;26:781–99. https://doi.org/10.1016/j.engstruct.2004.01.011.
[66] Abuodeh OR, Abdalla JA, Hawileh RA. Prediction of shear strength and behavior
https://doi.org/10.1016/j.compstruct.2019.111698.
members without stirrups using artificial neural networks. Eng Struct 2014;61:99–
112. https://doi.org/10.1016/j.engstruct.2014.01.001.
[68] Chen T, Guestrin C. Xgboost: A scalable tree boosting system. In 22nd SIGKDD
MSc Thesis Page 94
Application of Machine learning methods for shear capacity of RC beams
[69] Rahman J, Ahmed KS, Khan NI, Islam K, Mangalathu S. Data-driven shear strength
https://doi.org/10.1016/j.engstruct.2021.111979.
[71] Nguyen-Sy T, Wakim J, To QD, Vu MN, Nguyen TD, Nguyen TT. Predicting the
compressive strength of concrete from its compositions and age using the extreme
https://doi.org/10.1016/j.conbuildmat.2020.119757.
[72] Nguyen HD, Truong GT, Shin M. Development of extreme gradient boosting model
for prediction of punching shear resistance of r/c interior slabs. Eng Struct
2021;235. https://doi.org/10.1016/j.engstruct.2021.112067.
[73] Mangalathu S, Shin H, Choi E, Jeon JS. Explainable machine learning models for
[74] Machine learning-based prediction of CFST columns using gradient tree boosting
algorithm n.d.
[75] Marani A, Nehdi ML. Machine learning prediction of compressive strength for
2020;265. https://doi.org/10.1016/j.conbuildmat.2020.120286.
[76] Wong LS, Marani A, Nehdi ML. Gradient Boosting Coupled with Oversampling
MSc Thesis Page 95
Application of Machine learning methods for shear capacity of RC beams
Model for Prediction of Concrete Pipe-Joint Infiltration Using Designwise Data Set.
1204.0000557.
https://doi.org/10.1109/64.163674.
https://doi.org/10.1016/S0893-6080(03)00169-2.
[80] Sutton CD. Classification and Regression Trees, Bagging, and Boosting. Handb Stat
2005;24:303–29. https://doi.org/10.1016/S0169-7161(04)24011-1.
https://doi.org/10.1006/jcss.1997.1504.
[82] Fawagreh K, Gaber MM, Elyan E. Random forests: From early developments to
https://doi.org/10.1080/21642583.2014.956265.
[83] Svetnik V, Liaw A, Tong C, Christopher Culberson J, Sheridan RP, Feuston BP.
https://doi.org/10.1021/ci034160g.
[85] Segal M, Xiao Y. Multivariate random forests. WIREs DATA Min Knowl Discov
MSc Thesis Page 96
Application of Machine learning methods for shear capacity of RC beams
2011;1:80–7. https://doi.org/10.1002/widm.12.
2006;63:3–42. https://doi.org/10.1007/s10994-006-6226-1.
[87] Mining D. The Elements of Statistical learning - Springer Series in Statistics. Math
Intell 2009;27:83–85.
[88] Friedman JH. Greedy Function Approximation: A Gradient Boosting Machine. Ann