Untitled
Untitled
Untitled
ad hoc query data exploration (Capistrano, ○ 2010 - popularisation of big data &
1.2. Origin of Business Analytics 1.3. Big Data and Business Analytics
From ancient times, people have had this We have already defined Business
need to predict the future. They have used Analytics and Business Intelligence. This
crude methods like stones and sticks to time, let us learn another concept related
project how much crops they were going to to BA which is the Big Data.
harvest or how much they were going to
yield. In this section, we shall trace the So, what is big data?
development of business analytics from
the ancient to the modern time SAS defines big data as a term that
describes the large volume of structured
1. Prehistoric and unstructured data which can be
● The earliest humans would use sticks and analyzed for insights needed for better
stones to help predict sales trends of new decisions and strategic business moves.
inventions.
● First forms of business analytics may be IBM, on the other hand, refers to it as data
statistics whose uses can be traced back at least sets whose size or type is beyond the
to the biblical times in ancient Egypt, Babylon, ability of traditional relational databases to
and Rome. Attributed to its usefulness for capture, manage, and process the data
helping the policy maker (including ancient
with low-latency.
rulers or kings) make a better decision
2. Industrial (1800s)
● Frederick Winslow Taylor develops the concept
What is the difference?
of business management as a scientific
discipline Business Analytics
● 1908 - Henry Ford would apply this principle - focuses on financial and
and measured the time it took for each operational analytics of the
component of his Ford T Model to assemble. business
- relies on the person looking at the
3. Information age (mid-20th century) data and drawing inferences from
○ Digital Age / 1970 - computers it.
played a huge role in the
advancement of business analytics Big Data
with the introduction of Decision
- helps to analyze a broader range of
Support Systems
Information Age / 1980 - Data
data coming in from all sources
○
warehouses became popular to store
- involves machine automation to
and organize large amounts of data,
analyze data.
the first iteration of today’s server
farms. The importance of big data is not on how
○ 1990 - Introduction of Enterprise much data you have, but what you do with
Resource Planning by Gartner those data.
4. Internet age (21st century)
○ The internet, big data and the cloud
came along, with the massive
There are four aspects that define big data 2. Data warehousing. This is where the
which are volume, variety, velocity and data is cleansed, curated, organized, and
veracity. ready for analysis.
1. Volume is about how huge the data sets IT automatically joins data in a
are. “normalized form” in preparation for the
2. Variety includes how many pieces of business analyst
data from diff sources we gather together
from social media data, government data, 3. Extract, Transform, or Load
financial data, banking data, all sorts of Processes (ETL). This is the process of
transactions all combined together to moving data from source systems to data
make one or more profiles for your warehouse to an analytical tool.
customers.
3. Velocity is the speed of data. How data becomes business value:
4. Veracity means that there is a lot of It has to be extracted, curated and
uncertainty, meaning, there is all of these cleansed, joined in a data warehouse in
different data coming together, but the which point is now prepared for
problem is we don’t know what to do with analytics.
them.
2.2 Types of Analytics
MODULE 2: BUSINESS ANALYTICS Now that we have collected the
FRAMEWORK right data and processed them into usable
form, we need to find at what level are we
Introduction going to dig deep into the data to get the
fact-based insight we need to find out so
we can come up with better business
Information, in its raw form, is not much decisions.
useful for business decision making. It has
to be collected, analyzed, and present in a There are three types of analytics that can
way that is useful for decision making. help businesses make the most out of the
big data that they have and find the
solutions to problems that they have:
2.1 Framework for Business Analytics descriptive, predictive and prescriptive.
The process of turning raw data into 1. Descriptive Analytics. This is the data
business action is the framework for that is used to benchmark or to profile.
Business Analytics. Concerns itself with presenting data as it
is.
There are 3 steps in turning data into
analytics which are Data Extraction, Data Most Common Approaches / Methodologies;
Warehousing and the Extract, Transform,
or Load Processes (ETL). 1. Cross Tabulation
-Is getting granular bits of data. Summarizing and
1. Data extraction. This is the first step in aggregating it for better analysis.
turning data into analytics. There are at -It is the most basic methodology , various analytics build
on cross-tabulation
least 3 sources of data which are the
source systems, raw transactions, and 2. Hypothesis and Inferencing
from documents and forms. Once data is transformed we can draw early insights and
conclusions.
● Correlation - type of inference, determine which It is the foundation of predictive analytics. Types of model
factors move together and which factors don’t. and algorithms used depend on what kind one is
● T-test - type of inference , determines the modelling.
similarities or difference between two groups.
Regression - attempt to model independent variables
3. Data Visualization (predictors) to predict a dependent variable (outcome)
After tabulation and inference, best way to show insights ● Linear regression - for continuous variables (a
is through visualizations such as charts: person’s salary).
● Time-Series (Trend) - how value changes ● Logistic regression - for probability or likelihood
overtime of an event (likelihood a person will earn 50k a
● Stacked Area (Composition) - how it changes, month)
intuitive feel of where values are
● Heat Map (Relative Weight) - columns, high and Other predictive analytics model: clustering, decision
low values trees, support vector machines. Neural networks -
● Histogram (Distribution) - to visualize 2 machine learning methodology
population
● Scatterplot (Relationship) - shows relationship To perform predictive analytics, first understand what
between 2 variables. This chart is more relevant kind of outcome you’re modelling and what are the
with predictive analytics. predictors for that outcome.
2. Predictive Analytics. This is used to Once the relationship is determined then the model can be
determine relationships between two tested for accuracy. Ex: trend line is tested using
R-squared with a residual graph to see how far or close
different types of data and make the outcome is to the historical data that was trained on.
predictions about future data.
2. Outputting Model Components
Concerns itself with modelling The model components ARE the relationships generated
relationships between predictions and from the statistical model. These components are later
outcomes. These relationships are tested used for predictions. Often more useful to understand the
for accuracy on an initial sample of data. nature of the relationship itself.
Once an acceptable model is generated,
predictions are generated by feeding the
model a new set of prediction data.
Common approaches / methodologies:
1. Statistical Analysis
The coefficients are the model components from the essence of prescriptive analysis: knowing
regression line. the costs and benefits of a decision.
3. Outputting Predictions
MODULE 3: DATA AND DATABASE
With model components determined, predictions can now
be created by inputting new values of the variable into the
MANAGEMENT
model. Predictions are generated by substituting new
values for the test data.
Introduction
Good business decisions rely on access to
accurate data. To facilitate this,
organisations need a system for recording,
updating, and tracking data on a regular
basis. Once collected, data can be
retrieved and presented in formats useable
to managers and employees of the
organization through computerised
databases. Various database elements can
Tools for Predictive Analytics be linked through database management
systems.
1. Spreadsheets
Excel can do simple modelling via regression and solver.
1. Data, Database and Database
Measures in excel can be generated using data analysis
function called Solver
Management Systems
What is a database?
3. Prescriptive Analytics. This is used to
create recommendations through 1. A database is a collection of logically
simulation and optimization models. It related data and it is typically visualize as
involves providing advice on what actions tables; composed of cells matched with
can be taken given the predictions about several columns and rows (Recario, 2018).
the future and what drives the future. The
2. In computer terms, database is a
collection of data, not necessarily always
stored in a computer, such as records
stored in a filing cabinet, in a notebook
(Crisanto, 2018).
What is a Database Management System update, and delete) our data within
(DBMS)? the DBMS.
● Database definition. Metadata
describes the contents of your
1. A DBMS is a collection of interrelated database. This is a data that
data using the software and hardware describes another data in terms of
used to access the data in a useful manner its constraints, size, and the data
such as database or set of databases type of the tables or the databases
stored in a disk or other media, a that are contained within your
computer, or set of computers where the DBMS.
software to access the database can run ● Data Processing. This allows the
(Crisanto, 2018). user to perform different types of
operations such as string
2. A DBMS is a software package or manipulation, date operations and
software that allows you to store, retrieve, so on. You can also manipulate the
package your database (Recario, 2018). data based on an event or what
they call ‘triggers’ which is based
on the event or the conditions of
2. Functions of a Database Management the event.
System What are the functions of a ● Data sharing. This means that
DBMS? within a database, privileges are
provided to different types of users.
What are the advantages of using a Different types of users can access
DBMS? Here are some of the functions of the same database but the same
a DBMS (Crisanto, 2018, Recario, 2018 ): users can have different privileges.
● The DBMS can provide a
convenient and efficient interface 3. Components
for storing, retrieving, and updating of a Database
data, and for extracting useful System
information from the database, and
a clear and logical view of the
process that manipulates the data. A database
● Data Independence: DBMS system has the
maintains the segregation between following
the program and the data components
● Concurrency Control. This refers (Crisanto, 2018):
to a process to ensure that data is
updated correctly and
appropriately when more than one
person is accessing the data 1. Users
● Recovery Services. This refers to
mechanisms for recovery of data The users of a database system are
that is apparently lost due to the database administrator, system
system crashes developer, and end user.
● Utility Services: An initialization ● A database administrator (DBA) is
and maintenance operations on a the person responsible for all the
database. data resources in an organization.
● Manipulation of data. It tells us The DBA uses tools that come with
that we can CRUD (create, retrieve, a DBMS to improve the
productivity and performance of The final component of a database
database planning and design. system is the database itself. This is the
● The system developers are those space in the disk or computer where the
groups of people who create the data are actually stored.
application programs that cater to
the user requirements. And, they
use their own tools to write 3.4. Components of a Database
programs that communicate with Management System
the DBMS.
● Finally, the end-users in an
organization are the ones that can
add, update, and delete data in a
database through application
programs or directly through a
DBMS. They use the application
program to accomplish their
day-to-day tasks.
2. Database Application A database system has the following
The next component of the components (Recario, 2018):
database system is the database
application. This is a computer program 1. Interface
that allows users to manipulate the data in
a DBMS through a user-friendly interface. The interface is responsible for
It can be divided into four broad interacting with the user. It is also called
categories: the “middleman” as it communicates
● Personal which is restricted to a between two different parties: the end
single user user, which is you, and the database
● Departmental which is referenced management systems.
by hundreds of users over a shared
system or network The interface allows the end user to input
● Enterprise which extensions of values that will be understood by the
departmental applications involving database management system. It is also
thousands of users responsible for the receiving and
● Internet which is the largest form outputting, and providing the results of
of information sharing where the queries or the commands that the end
billions of users are involved. user issued.
2. Parser
3. Database Management System
(DBMS) The parser is responsible for checking the
The DBMS decouples application correctness of the syntax that was
programs from data. The databases store provided by the end user through the
all its data in one location, thereby limiting interface. So it also checks the tables and
data duplication. other objects in the query.
Examples of DBMS include Access, Oracle, 3. Query Optimizer
IBM’s DB2, and SQL Server.
If the parsing is correct and proper, then it
4. Database will proceed to what we call the query
optimizer.
The query optimizer comes from the root
word ‘optimize’, which means it simplifies Three types of analytics.
the query form or the input into a simpler 1. Descriptive : data-driven
type of input that can be digested and can benchmarking
be understood by the database
management system. This allows the This is data that is used to
DBMS to save time and save space when benchmark and to profile to be able to
performing operations. establish historical trends. It is the
commonly used analytics by companies.
4. Execution Engine Not making any predictions nor looking for
any relationships yet - just looking at data
The next one is the execution engine. Its as it is.
main responsibility is to execute the
output of the query optimizer. Once the 2. Predictive : data driven prediction
query has been optimized, then it will pass
the data or the optimized query to the We now then look at the
execution engine whose sole purpose is to relationships of the data. Once we know
execute the command. the relationships inside the data, we can
make predictions and forecasts by feeding
5. Storage new data into the predictive model.
Storage is the physical location of your
database, which can be the secondary or 3. Prescriptive: data driven
the primary memory. recommendation
The third type of analytics is the
prescriptive analytics. It involves providing
MODULE 4: APPLICATIONS OF advice on what actions can be taken given
BUSINESS ANALYTICS the predictions about the future and what
drives the future. The essence of
Introduction prescriptive analysis: knowing the costs
and benefits of a decision.
We learned that raw data needs to be
extracted from its sources, cleansed,
curated, and warehoused to prepare it for Most common applications of Business
analysis. These data can be analyzed in Analytics in the Industry
different ways depending on the problem
or situation at hand. They can be ● Credit Card Companies - Credit
leveraged in various ways to optimize and debit cards are an everyday
business operations. part of consumer spending.
● Customer Relationship
Three types of commonly used analytics -- Management (CRM) - Excellent
descriptive, predictive and prescriptive. customer relations is critical for
Applications for business analytics any company that wants to retain
especially in the three functions which are customer
mostly present in the industry: Finance, ● Finance - Corporations turn to
Human Resource, and Marketing. business analysts to optimize
budgeting, banking, financial
planning,
● Human Resources - Great
1. Types of Analytics businesses are composed of a great
staff, and it’s HR’s job to not only Analytics plays an important role in
find the ideal candidates but keep human resources.
them on board.
● Manufacturing - Business analysts ● Suppose you are an HR manager
work with data to help and you want to understand the
stakeholders understand the things demographics of the employees in
that affect operations and the your company. You may need
bottom line. information on the diversity of the
● Marketing - Business analysts employees’ location, address,
help answer these questions and so school graduated, etc. This is
many more, by measuring where descriptive analysis can be
marketing and advertising metrics, used to manage human resources
better.
2. Applications of Business Analytics in ● As an HR manager, you may also
Finance want to determine which employees
are likely to resign, because
Applications of business analytics in recruiting new people will mean
finance industry. additional cost for the company.
You may use predictive analysis
● An example of descriptive analysis in this case by looking at the
in finance is using analytics in historical patterns of resignations
reporting financial results, from to determine the likely causes of
gathering financial inputs from resignations and the number of
different sources, cleansing it, to employees that are likely to resign
reporting it. in the future.
● Predictive analysis, on the other ● Finally, as an HR manager, you
hand, is when we want to predict may also want to determine the
the trend of sales for the next two drivers that make employees stay
months using historical patterns of in the company. Employee
seasonality, and examining engagement, such as looking at
whether investing a lot in sales what makes them content, happy
people might also drive the sales and stay in the company (ex. party,
trend. bonus, free training, etc) is where
prescriptive analysis can be used.
● Prescriptive analysis is applied
when we want to determine the
feasibility of the project, say the 4. Applications of Business Analytics
likelihood that the project will Marketing
falter, or overshoot the budget, or
fail. This is with the assumption Analytics is very useful especially if you
that we have looked into are in the marketing industry.
consideration the various inputs
such as cost of materials, location ● Let’s say you are in charge of
of project, etc monitoring the results of an online
marketing and sales campaign in
your company. You can use
3. Applications of Business Analytics in descriptive analytics to determine
Human Resources (HR) how many people clicked the ads,
how many people bought the
product, how many people paid
cash-on-delivery, or by credit card.
1. Ethical and Legal Considerations in
● If you want to understand how Business Analytics
factors (ex. price, marketing mix
and attributing the effect, Some examples on ethical and legal
channels, mode of payment, etc) considerations in business analytics such
contribute to the performance to as danger of discrimination and bias,
predict the future performance problem when the integrity of the data has
(success or failure) of a campaign, been compromised, and problems on the
or to achieve targets, you can use lack of transparency. Data privacy law and
predictive analytics. some examples on addressing data privacy
Concerns.
● Prescriptive analytics in
marketing can be seen through the
recommendation engines which are 1.1 Discrimination and Bias
found to be successful in driving
more sales. These are the When we make decisions based on the
recommendations that you can see results of business analysis involving big
whenever you visit an online data, there is a high probability that it will
shopping website, say to buy a affect thousands and even millions of
book. This recommendation will go people.
something like “those who bought
that book also bought this book”. Discrimination and bias happens when
This recommendation is based on our judgement and assumptions are
the prescriptive algorithms which influenced by our own biases. (Bañez,
look at the patterns of people who 2018).
buy the same product or profiles Suppose you used business analytics to
that look similar or items that are screen candidates for employment, and
purchased together. you decided that we are going to eliminate
those who are not employed one year prior
application, since you may feel that this
MODULE 5: ETHICS ISSUES IN seems objective and rational. You want to
BUSINESS ANALYTICS hire people who have continuity in working
and the one-year gap may be a source of
Introduction suspicion. However, this decision may be
biased to certain group of people such as
Ethics is a moral principle that somehow mothers who have to take a one-year leave
guides a person on what is bad and what to take care of their children after giving
is good. We need to be aware of our ethical birth. You may be biased against these
obligations as we practice and apply mothers who will not be qualified for the
business analytics in your profession. The job, not because they are not fit for the
results of business analysis when used as job, but because they chose to take care of
basis to make important decisions can their children after giving birth. These
affect people and the organization, thus, biases will involve serious consequences
there is a need to learn about the ethical and certain people will be unfairly denied
and legal considerations and implications resources.
of business analytics. As business
analytics practitioners, we also need to In this example, your bias against women
know some of the general information may cause a portion of women ending up
about the data privacy being rejected from consideration even
though they are just as good, or even Suppose you think that your company
better, than other candidates. made a wrong decision in not promoting
you. So how do you appeal to this if you
Bias filters out people and limits their resources, think that their method is based on bias,
this limitation to resources then results in risk and thus faulty? To appeal, you need to
factors such as lack of jobs or education. These know the process to determine if you are
people who are devoid of jobs or education are then right, and they are wrong, and decided
filtered out by the bias which creates a based on bias. You also need to know that
self-fulfilling loop. Bias - No Resources - Risk the data sets they used to decide is
Factors - accurate. But chances are, none of these
would be available to you because of
Tools should be validated if it reflects the real
non-disclosure, intellectual property, and
world. Tools are only a means to an end not the end
other institutional agreements you have
with the company. How can you now
itself
question that decision and make your
appeal if you don’t know how the decision
arrived at?
1.2 Integrity of Data Analytics So here, you can see that we need
Another ethical issue concerns the transparency to provide us access as our
integrity of data analytics. As we use the means to understand how systems work
results of analytical analysis to make and affect our lives.
certain decisions, we should also take These analytics can be interpreted as a “black box”
consideration the assumptions used to where information is put, processes it and provides
ensure its validity. Using wrong results without giving information of the internal
assumptions, even if we use the right
workings or what happened in between.
analytical tools will bring us to wrong data.
The right tool is useless if we have
corrupted, incomplete, outdated, or When people are introduced to the inner workings
irrelevant data. of the “black box”, they would most likely encounter
mathematical jargons and get overwhelmed and
An example of this is when a credit card then give up - this term is called “the authority of
company used a data on a client’s inscrutable”
purchases which he never made in the
first place, or mistakenly used on a This system is indeed convenient but opaque.
different client with the same name. It can
also be using correct data but interpreting Principle of transparency requires not just access
it the wrong way. These are just some of but to provide meaningful access - giving means to
the examples that show how integrity understand the essence of how it works.
of data can be compromised.
Practitioners require to be responsible for the 2. Ethical Implications of Business
integrity of their data, methods and interpretations. Analytics
Disclosure of customer data is prevalent in
1.3 Lack of Transparency day-to-day business, especially in the
internet via social media and searches,
Transparency, is another ethical issue and most individuals are unaware that
which is related to the first two issues that whenever you perform a search or post
we have discussed earlier. something in social media, you already
leave a digital trace in the world wide web
and these traces can be used to profile you Not only does the person have the right to
without your consent (Ligot, 2018). You be informed, he/she also has the right to
should know by now that this is true and access the data that a company may have
possible through the use of technology. about him/her--not only the content but
Our information are being collected and the sources of the information as well. You
analyzed, each time we visit a website, can go to the companies that maintain
click an ad, shop online, etc, and most your information and ask them exactly
people are not aware of it. what they keep. The right to information
and the right to access can help address
our earlier concerns about bias and
Data Privacy Law transparency.
With the information you can obtain, you
We leave digital traces whenever we go can determine whether or not the
online, even if we are not disclosing our information was processed fairly or if they
identity. And these traces are used by use correct or updated information. Since
marketing companies to collect and you know the information that they have
process data such as which websites we about you, you can determine whether or
visit, which products we buy, which ads not that information is correct or updated.
we watch, and so on. With the growing
concern that people are losing their ● The Right to Rectify
information from companies who collects
data without their knowledge, has You also have the right to rectify. If the
prompted many countries, including the information they have about you is
Philippines, to pass data privacy laws. inaccurate our outdated, under the data
privacy act, you have the right to have it
In the Philippines, the Data Privacy Act corrected. You have the right to complain if
was made to protect people and bring back the company fails to rectify the data or if
control of their own information. the company fails to respect your other
rights, you may complain before the
As a business analytics practitioner, you National Privacy Commission or you can
need to know some of the general go to courts.
information about the data privacy act
(Bañez, 2018). ● The Right to Complain
Once it is established that your rights are
• Consent Required and the Right to be violated, the company which violated your
Informed rights can be made to pay a fine, and pay
you directly, damages. In some cases, the
Before you can collect and process one’s company’s officers may become criminally
information, you need to get his/her liable – they can go to jail. The company
consent. The consent has to be recorded that processes your data will also have the
and informed, meaning the person has to responsibilities to make sure that your
understand the purpose which his/her data is protected from unauthorized
data is being collected and processed. He access. So, they can’t just leave your data
or she has the right to know how his/her out there in the open – they have to take
data will be processed and whom data will reasonable steps to make sure that your
be shared. data is secure. This responsibility, along
with the right to access and right to rectify
● The Right to Access your information, can help address our
earlier concerns about integrity.
1.2 Impacts and Challenges
1. Introduction to Analytics Benefits
◦ Reduced costs, better risk management,
Analytics is the use of: faster decisions, better productivity and
● data enhanced bottom-line performance such
● information technology, as profitability and customer satisfaction.
● statistical analysis,
● quantitative methods, and Challenges
● mathematical or computer-based ◦ Lack of understanding of how to use
models analytics, competing business priorities,
to help managers gain improved insight insufficient analytical skills, difficulty in
about their business operations and make getting good data and sharing information,
better, fact-based decisions. and not understanding the benefits versus
perceived costs of analytics studies.
1.1 Applications of Business Analytics
Pricing 1.3 Scope of Business Analytics
◦ setting prices for consumer and
industrial goods, government contracts, Descriptive analytics: the use of data to
and maintenance contracts understand past and current business
performance and make informed decisions
Customer segmentation
◦ identifying and targeting key customer Predictive analytics: predict the future by
groups in retail, insurance, and credit card examining historical data, detecting
industries patterns or relationships in these data,
and then extrapolating these relationships
Merchandising forward in time.
◦ determining brands to buy, quantities,
and allocations Prescriptive analytics: identify the best
alternatives to minimize or maximize some
Location objective
◦ finding the best location for bank
branches and ATMs, or where to service Example:
industrial equipment Most department stores clear seasonal
inventory by reducing prices. Key
Social Media question: When to reduce the price and by
◦ understand trends and customer how much to maximize revenue?
perceptions; assist marketing managers
and product designers Potential applications of analytics:
Descriptive analytics: examine historical
data for similar products (prices, units
sold, advertising, ...)
Predictive analytics: predict sales based
on price
Prescriptive analytics: find the best sets
of pricing and advertising to maximize
sales revenue
2. Tools Example of Data Sources and Uses
● Database queries and analysis Internal
● Spreadsheets ● Annual reports
● Data visualization ● Accounting audits
● Dashboards to report key ● Financial profitability analysis
performance measures ● Operations management
● Data and Statistical methods performance
● Data Mining basics (predictive ● Human resource measurements
models)
External
● Economic trends
2.1 Software Support ● Marketing research
3. Mathematical model:
S = aebect Non-Linear Demand Prediction Model
where Assumes price elasticity is constant
• S is sales, (constant ratio of % change in demand to
• t is time, % change in price)
• e is the base of natural logarithms, and
• a, b and c are constants that need to be
estimated.