Seminar
Seminar
Seminar
The science (and art) of training computers to learn from data is called machine learning.
Computers may now learn from experience, just like people and animals do, thanks to machine
learning. Instead of using a preset equation as a model, machine learning algorithms "learn"
information directly from data using computational techniques. As the amount of samples
available for learning grows, the algorithms perform better and better in an adaptive manner.
that lets computers learn from data, forecast future events, and gradually get better at what they
models and algorithms that can automatically learn from data and make precise judgments or
Applications for machine learning are many and span a wide range of industries, such as
predictive analytics, and autonomous cars. Machine learning techniques, for instance, are
utilized by systems for object detection in photos and natural language processing in systems
Your spam filter is a machine learning software that learns to identify spam by showing
it samples of legitimate emails that haven't been marked by users and instances of spam emails
that have. The training set is the collection of examples that the system utilizes to learn. A
Daily life example of how these algorithms are trained, which we all have encountered,
is shown in Fig. 1.1. If you have previously engaged with this system in an attempt to reach
your desired website, then congrats! You have labeled some data for a Google machine-
1
learning model. Within the depths of Google's reCAPTCHA websites, the company states the
The human labor required to solve CAPTCHAs is also put to good use by reCAPTCHA,
which uses the answers to transcribe text, annotate photos, and create datasets for machine
learning. Thus, books are preserved, maps are enhanced, and challenging AI issues are
resolved.
A model is a part of a machine learning system that is capable of learning and making
predictions. Examples of models are random forests and neural networks. Machine learning
algorithms produce predictions or choices based on patterns found in data by using statistical
• Issues for which the current solutions need either extensive lists of rules or a great deal
of fine tweaking (Compared to the conventional method, a machine learning model may
2
• Complex issues for which there is no appropriate answer when utilizing a typical
approach (Maybe the most advanced machine learning methods can figure something
out).
• Machine learning algorithms are well-suited for applications that need real-time or
• Many operations that would ordinarily require human interaction, such as natural
language processing, picture and audio recognition, and predictive analytics, may be
• Because machine learning algorithms are fast and efficient at processing massive
volumes of data, they are well-suited for applications where decisions need to be made
Labeled data, which has known input feature values and associated output values, is used to
train supervised learning systems. The purpose of supervised learning is to develop a mapping
between the input characteristics and the output values so that the algorithm can predict new,
unknown data with accuracy. Using a dataset of photos of cats and dogs, for instance, where
each image is identified as either a cat or a dog, an algorithm for supervised learning may be
taught. In order to categorize fresh photos, the algorithm would first learn to identify the
3
Classification - Bayesian Classifier, K-nearest Neighbours, Regression (Linear, multivariate
Conversely, unsupervised learning algorithms are taught using unlabeled data, meaning that
while the input characteristics are known, the output values are not. Finding patterns and
structures in the data, such as clusters or collections of related data points, is the aim of
unsupervised learning. For instance, without any prior information on which consumers belong
to which segments, an unsupervised learning algorithm might be used to cluster clients based
algorithm.
Reinforcement learning studies how an agent may be trained to operate in a way that maximizes
a cumulative reward or makes the best judgments possible in a given situation. It draws
inspiration from the way both animals and people pick things up via trial and error.
Through interaction with the environment, an agent gains the capability to make decisions
based on input in the form of rewards or penalties in reinforcement learning. The agent acts in
the environment, and it gets feedback in the form of reward signals for its efforts. Learning a
policy, a mapping from states to actions, that maximizes the cumulative reward over time is
Applications for reinforcement learning may be found in a number of fields, such as resource
demonstrated success in handling challenging activities like playing chess and going at a
superhuman level.
4
Classification - Q-Learning algorithm
• Insufficient Quantity of Training Data: For better accuracy of the model, the quantity
of data for which the model has been trained should be sufficient. Insufficiency of it
• Data quality: Large volumes of high-quality data are needed for machine learning
algorithms to learn from. However, the quality and dependability of the generated
models might be impacted by noisy, incomplete, or biased data. The process of cleaning
• Irrelevant Features: The model output will be inaccurate if the training data set contains
• Non-Representative Data: A dataset that does not adequately reflect the features or
non-representative data. Stated differently, the available data does not offer a thorough
• Overfitting: When a machine learning model learns the training set too thoroughly, that
is, to the extent that it picks up noise or random fluctuations in the data that don't
accurately reflect the underlying patterns, this is known as overfitting. Because of this,
an overfitted model may perform splendidly on training data but badly on fresh,
untested data.
or fails to recognize the training data's underlying patterns. Because of this, the
underfitted model could have significant errors on both fresh, untried data and training
data.
• Data security and privacy: With the increasing use of machine learning algorithms,
5
worries over the security and privacy of sensitive data are growing. Inadvertent
judgments or predictions since they can be complicated and harder to interpret. This is
especially crucial in fields like banking and healthcare, where making the wrong choice
generalize successfully to new, unseen data. On the other hand, overfitting can happen
when an algorithm performs badly on fresh data due to overlearning the training set.
rising datasets and manage high data volumes. This can be difficult, particularly for
• Fairness and prejudice: Machine learning models can exhibit bias in a number of ways,
prejudice.
memory and processing power due to their computational intensity. This may restrict
• Algorithm selection: There are several machine learning methods and algorithms, each
having pros and cons of its own. Selecting the right algorithm for a particular job may
learning.
6
2 MACHINE LEARNING ALGORITHMS
In machine learning, a wide variety of algorithms are employed, each having unique advantages
and disadvantages based on the particular problem being resolved and the datatype. These are
A supervised learning approach called linear regression is used to describe the connection
between one or more independent variables and a dependent variable. The objective is to
choose the line that best fits the data and minimizes the total squared errors between the
expected and actual values. For forecasting and prediction purposes, such as projecting stock
When solving classification issues, the objective of supervised learning algorithms like logistic
regression is to predict a binary result (e.g., yes or no). It operates by using one or more
independent variables to describe the likelihood of the dependent variable. When predicting
whether a patient will get a certain illness or if a customer will make a purchase, logistic
7
regression is frequently used in marketing, finance, and healthcare.
For both classification and regression issues, supervised learning techniques like decision trees
are employed. Recursive partitioning of the data into smaller subsets on the basis of the most
informative attributes is what they represent. The tree is split by each partition, and each leaf
node denotes a final judgment or forecast. In finance, marketing, and healthcare, decision trees
are frequently employed for tasks like anticipating client attrition and detecting medical
disorders.
8
2.4 RANDOM FOREST:
This is a supervised learning method that improves accuracy and reduces overfitting by using
subsets of data and then merging their predictions. Random forests are frequently used within
image classification and natural language processing, like recognizing objects in photos and
text categorization.
Using supervised learning, SVM determines the best hyperplane to divide data into distinct
groups and is applied to classification challenges. Maximizing the margin between the nearest
data points and the hyperplane is how it operates. SVMs are often employed in text and picture
classification, for example, in the identification of spam emails and the categorization of
photos.
9
2.6 K-NEAREST NEIGHBORS (KNN):
KNN is a supervised learning technique that locates the k data points that are closest to a new
input and predicts values or labels for those points. It is used for classification and regression
issues. The method determines the k closest points by measuring the distance between each
data point and the new input. KNN is frequently utilized in pattern recognition and
recommendation systems.
An unsupervised learning technique called K-Means clustering divides data into k groups
according to how similar or dissimilar they are. Each data point is assigned randomly to one of
the k-clusters, and the cluster centroids are updated repeatedly until convergence. Image
compression and market segmentation are two typical applications of K-Means clustering.
10
Figure 2-6 K-Means Clustering
Source: anderfernandez.com
Inspired by the human brain's function and structure, neural networks are supervised learning
algorithms that are capable of discovering intricate patterns and correlations within data. They
are made up of several tiers of networked nodes that process data and provide predictions.
Neural networks find widespread use in speech recognition, picture recognition, and natural
language processing, including text transcription from audio and image identification.
11
3 APPLICATIONS OF ML IN CONSTRUCTION
MANAGEMENT
There are several applications of machine learning in construction project management, such
as cost planning, human resource management, forecasting cost overruns, predicting the
outcome of litigation, controlling budget, optimizing project duration, risk prediction, etc. Let
Within the construction planning domain, machine learning techniques can be employed
to analyze past project data, recognize influential factors, and create predictive models. These
algorithms can then be used to make predictions on project costs, timelines, and satisfaction.
Additionally, they can be used to assess the constructability of a project by examining technical
In supervised ML, algorithms are trained using labeled data sets, which contain input
variables (e.g., project attributes) and associated output variables (i.e., project performance
indicators). These datasets are then taught to the algorithms, and the algorithms can then use
hidden patterns, and create relational models. In summary, machine learning techniques allow
for the analysis of large volumes of data and the detection of intricate relationships, which can
(Shayboun et al. 2019)used WEKA software for ML analysis and modeling of project
performance factors. Features of time, cost, and satisfaction of contractors and clients were
selected for prediction. The number of input variables was reduced, and the most distinctive
predictive capability attributes were identified. An analysis was performed to find a correlation
12
between features and project performance using the root mean square error. The error estimate
In summary, machine learning allows us to process great volumes of data and find
complex relationships between them, which helps us to make better predictions and make better
decisions in the construction planning process. In the future, machine learning might be
required in planning because it has the capability to enhance the accuracy and effectiveness of
Traditional planning methods often involve manual data analysis and the subjective
decision-making process. This process can be time-consuming and prone to mistakes. On the
other side, ML algorithms can process large volumes of data quickly and find patterns and
relationships that humans may not be able to see. This can help us to predict project outcomes,
optimize resource allocation, and identify potential risks. By including machine learning in the
planning process, construction companies will be able to make better decisions, lower costs,
However, it’s important to remember that machine learning should not replace human
expertise and judgment but rather be a tool that supports human decision-making. Human
thinking and experience are still necessary to interpret and apply the insights produced by
ML is really useful in HRM. (Garg et al. 2022) found that HRM adoption of ML is highest in
Recruitment and Performance Management. ML algorithms, like decision trees and text mining
algorithms, are used to classify everything in HRM. But for more complicated processes, ML
is still in its early stages. To make HRM more ML-friendly, HR pros and ML pros can work
together. HR pros need to learn how to analyze data and make decisions based on evidence,
while ML pros can give technical advice on how to use ML algorithms. This can help solve
13
problems like data collection, storage, security, and taking into account multiple factors at once.
(El-Sawalhi and Shehatto 2014) Developed a neural network model to provide an accurate
estimate of the initial design construction cost for building projects. This model was based on
data collected from the construction organizations in the Gaza Strip and used to identify eleven
important parameters that influence the project cost. These parameters were then inputted into
the model as input variables, allowing the model to accurately predict the cost of the building
projects without requiring detailed drawings. An average percentage error rate of less than 6%
was observed in the tested dataset. Sensitivity analysis revealed that the area and number of
floors of the typical floor were the most significant parameters in the estimation of the building
cost.
(Mahde Abd et al. 2019) Provides an overview of a prediction model developed using
multiple linear regression techniques to estimate construction project costs in Iraq. The
methodology was based on historical cost data, and 25 variables were identified to contribute
to the estimation of the cost. An MLR model was developed with weighted least squares (WLS)
to facilitate the development of the model. The results demonstrate that the MLR model with
WLS has a high correlation coefficient of 95.8% and a high degree of accuracy of 98.97%. The
outcome of the study is that MLR, with WLS, is a promising approach to cost estimation in
construction projects in the early stages. This eliminates the need for manual estimation and
dependence on intuition, resulting in a more accurate and objective cost forecast technique.
Because it is simple and straightforward to use, the created model may be utilized by
Machine learning algorithms are capable of analyzing historical data and recognizing
patterns and correlations between different parameters and project expenses. This enables more
precise and cost-effective cost estimates, even when detailed drawings are not available at the
14
beginning of a project. Machine learning has a bright future in the construction cost estimation
industry, as it is capable of adapting to changing conditions and learning from experience, thus
increasing the reliability of cost estimates in the long run. In summary, machine learning can
revolutionize the cost estimation process in construction projects and can help to improve
(Li and Li 2022) The Dynamic Distributed Clustering (DDC) method optimises the
management of construction cost budgets by assessing project cost data. This method assists
in the formulation of the Iterative Control Rule and the Iterative Performance Index Function,
By utilizing this method, the Building Project Cost Budget Control System can increase
its reliability. The future of this algorithm lies in its potential to be used in real-world
construction projects, where its application may lead to more accurate and efficient cost
control.
LEARNING
The utilization of machine learning for construction project duration estimation can have
a range of advantages for project management effectiveness. Machine learning models are able
to provide more precise predictions of project duration than traditional estimation methods,
allowing project managers to better plan and schedule resources, thus increasing the efficiency
of project execution.
Additionally, models can be trained using past project data, allowing managers to take
advantage of past experiences and learn from them. This can result in improved estimation
15
accuracy and improved decision-making for future projects, helping project managers to
identify critical factors and make informed decisions to reduce risks and delays.
Al-saadi et al. 2017 paper examines the potential of Artificial Neural Networks (ANNs)
to provide an estimation of the length of a road project in Iraq. It draws on the historical data
of 99 road projects, as well as six variables, such as the Length of the Road, the Number of
Standard Lanes, the Volume of Earthworks, the Number of Intersections, the Types of Paving
(Flip or Rigid), and the Road Furnishing Level. The results demonstrate a robust relationship
between the actual length and the predicted length, with a particularly high level of precision.
The article emphasizes the significance of precise duration estimation in the management of
road projects, and the advantages of incorporating ANNs into the process.
CONSTRUCTION LITIGATIONS
especially when it comes to AI. IPM, or integrated prediction model, is a great way to use
machine learning to predict the outcome of a lawsuit (Arditi and Pulket 2010). It can make
predictions faster and more accurate than other methods, like neural networks and boosted
decision trees, which can save people a lot of time and money. So, it looks like machine
This study (Arditi and Pulket 2010) introduces an integrated prediction model (IPM) that
saving time and money for all parties involved in the construction process.
16
3.7 MACHINE LEARNING TO PREDICT QUANTITY OF
MATERIAL
The complexity and variation of construction projects means that machine learning is
needed for quantity prediction. This is important because it helps to predict material quantities
more accurately, which is important for managing resources in construction projects. Plus,
machine learning allows the model to adjust to different project conditions and give contractors
real-time info, so they can make better decisions about the feasibility of the project and how to
allocate resources.
Machine learning is helpful for quantity prediction because it can create predictive
models that can look at past data and make predictions about the future quantity of construction
materials. For example, the QPM (Quantities Predictor Model) based on AI for GSC Building
Contractors uses machine learning, specifically ANNs, to predict the quantity of key
construction materials like cement, reinforced steel and aggregate (Haddad 2016). QPM uses
machine learning to look at big data sets and spot trends in how building materials have been
Construction managers can benefit greatly from the use of Machine Learning (ML) models to
predict earned value indexes. By taking into account historical data and recognizing the
elements that influence earned value, these models can provide precise estimates and forecasts
for Schedule Performance Index (SPI), Cost Performance Index (CPI) and Total Cost
Performance Index (TCPI). These data are essential for construction managers to evaluate the
17
issues or budget deviations in advance and take proactive steps to ensure the success of the
(Aidan et al. 2020) This study looked at how an ANN model can be used to predict earned
value indexes for residential complexes. It was found that it was better than traditional linear
methods when it came to estimating and forecasting the earned value. It used things like the
budget when the project was finished, the actual cost, the actual percentage, the earned value,
It was able to predict the SPI, the CPI and the TCPI with a correlation of 90% and a
typical accuracy of 89%. The researchers think that ANN models could be a great replacement
for traditional methods when it comes to estimating and predicting earned value. All in all, it's
a great tool for construction managers to make better decisions based on data and to make
(Thanh Nguyen et al. 2022) This paper analyzes the use of AI models, in particular
Support vector machines (SVM), and Radial basis function neural network (RBFN) to predict
the Construction price index (CPI), and the results indicate that both models perform well in
The goal of predicting the construction price is to predict the fluctuation of the
construction price over time, which is important for the various parties involved in the
construction sector, such as the organization, individual, management agency, investor, and
construction contractor. By predicting the construction price, these parties are able to plan and
manage their construction projects by accurately estimating and evaluating their costs. This
helps to reduce the time, effort, and procedures needed to calculate and adjust the total
In addition, forecasting the construction price allows the identification of the construction
market trends, moves, and directions, which helps to make better decisions when it comes to
18
budgeting, contract management, and cost control. All in all, forecasting of the construction
price serves as an effective tool for controlling the construction investment costs, mitigating
The overall conclusion is that the ML models are a dependable resource for estimating
construction project costs, enabling budgeting, bidding and project management, enabling
stakeholders to make informed choices, optimize resources and reduce financial risk in the
construction sector.
19
4 BENEFITS AND CHALLENGES OF ADOPTING
MANAGEMENT
CONSTRUCTION MANAGEMENT
• Improved Project Cost Estimation and Prediction: ML algorithms can more accurately
assess previous project data, find patterns, and forecast future expenses. This can assist
construction firms in making better informed judgments during project bidding and
• Enhanced Project Schedule Management: Machine learning can assess project plans,
identify possible bottlenecks, and provide timetable recommendations. This can assist
projects on schedule.
sensor data from construction equipment to detect early symptoms of wear and tear,
allowing for proactive maintenance and decreasing downtime. This can result in
20
• Automated Safety Monitoring and Compliance: ML can analyse construction site video
footage to identify possible safety issues, such as workers not wearing correct PPE or
using machines without proper training. This can assist to prevent accidents and ensure
automate repetitive processes like document processing and data input, allowing
construction experts to focus on more sophisticated and strategic duties. This has the
create design options that optimise building layouts, material utilisation, and energy
construction projects.
• Improved Risk Management and Mitigation: Machine learning can examine project
data to identify possible hazards like supply chain interruptions, weather disasters, or
21
• Data-Driven Decision-Making: Machine learning may extract insights from massive
can lead to more informed project planning, resource allocation, and risk management
decisions.
CONSTRUCTION MANAGEMENT
Despite the numerous benefits, there are also challenges associated with adopting ML in
• Data Quality and Availability: For training and operation, ML systems rely on high-
quality, well-structured data. The construction sector frequently problems with data
silos, fragmented data, and inconsistent data formats, making data collection and
ML/data science. There is a scarcity of experts with both skill sets, making it difficult
22
• Cultural Barriers and Resistance to Change: The construction sector has a history of
being risk-averse and may be difficult to accept new technologies such as ML.
may be sophisticated and opaque, it can be difficult to grasp how they make judgments.
construction industry, where choices can have severe financial and safety
consequences.
processes and creating extra work silos, ML solutions must be easily integrated with
applications must adhere to relevant rules and ethical principles, notably in terms of
data protection, fairness, and openness. Addressing these concerns is critical for
23
5 FUTURE SCOPE
Machine learning in construction management has the potential to change the industry and
create major gains in efficiency, safety, sustainability, and overall project results. We should
expect to see significantly greater use and more inventive applications across the whole
Here are some key areas where ML is expected to have a future scope of construction
management:
With some accuracy, machine learning can already estimate project costs, timelines,
and quality. However, there is much space for development in this area. Researchers
may, for example, create more accurate models that take into account additional
aspects, such as historical data from prior projects, meteorological data, and economic
data.
building sites for safety issues. However, these systems are not always trustworthy, and
further study is needed in this field. Researchers may, for example, create systems that
can better identify and track safety dangers, as well as automatically inform safety
Machine learning may be used to automate repetitive processes such as data input and
document processing. However, there is much space for development in this area.
Researchers may, for example, create systems that can automate more complicated
24
• Construction planning: Construction designs may be optimised using machine learning.
However, this is a very young area of study, and there is much possibility for
advancement. Researchers, for example, may create systems that optimise plans for
25
6 CONCLUSION
improve project management processes, optimize resource utilization, and achieve outstanding
results. ML offers a range of advantages that can have a direct effect on the successful
completion of construction projects. For example, predictive analytics can provide precise cost
and schedule projections, allowing for more informed decisions and reducing the likelihood of
costly over- or under-budgeting. Additionally, automation and compliance systems can detect
potential risks and enforce safety protocols, resulting in a safer and more efficient work
environment.
improve their performance, enhance project outcomes, and gain a competitive advantage in a
rapidly changing industry. By taking advantage of ML and addressing the challenges it entails,
construction managers can help their companies lead their organizations into a future that is
26