HRA Merged
HRA Merged
HRA Merged
Introduction to HR
Analytics
1
"Information is the fuel of the 21st century, and data analytics
is the combustion engine"
-Peter Sondergaard
Vice President of Gartner
Motivation for studying analytics
1. Rising popularity: The global HR analytics market is expected to exhibit a
10.4% growth rate during the forecast period of 2019-2025 (DUBLIN, Jan. 9,
2020 /PRNewswire/).
3. Assure bright career: It was predicted that there will be a shortage of more
than 1.5 million managers with adequate training in analytics in the United
States alone by 2018.
3
Answer the following questions
1. How does the bank assess the riskiness of the loan it might make
to you?
5
Introduction
• Three developments spurred recent explosive growth in the use
of analytical methods in business applications:
• First development:
• Technological advances, Internet social networks, and data
generated from personal electronic devices, produce incredible
amounts of data for businesses.
6
Introduction
• Three developments spurred recent explosive growth in the use
of analytical methods in business applications: (contd.)
• Second development:
• Advances in computational approaches to effectively
handle and explore massive amounts of data
• Faster algorithms for optimization and simulation, and
• More effective approaches for visualizing data.
7
Introduction
• Three developments spurred recent explosive growth in the use
of analytical methods in business applications:(contd.)
• Third development:
• Computing power and storage capability.
• Better computing hardware, parallel computing, and cloud
computing have enabled businesses to solve big problems faster
and more accurately than ever before.
8
Figure 1 - Google Trends Graph of Searches on
the term Analytics
9
Decision making
1. Tradition
2. Intuition
3. Rule of thumb
10
A Categorization of Analytical Methods
and Models
11
A Categorization of Analytical Methods and
Models
• Descriptive analytics: It encompasses the set of techniques that
describes what has happened in the past.
13
Data Visualization Techniques and Tools
•Interactive dashboards are very easily constructed with Excel 2010
and higher versions
•Qlik
•Spotfire (Tibco)
•Workday
15
Descriptive Analytics –A HR Dashboard
16
A Categorization of Analytical Methods and
Models
• Predictive analytics: It consists of techniques that use models
constructed from past data to predict the future or ascertain the
impact of one variable on another.
• Eg: Survey data and past purchase behavior may be used to help
predict the market share of a new product.
17
Basic Predictive Analytics Tools
Cross-tabulations, frequency distributions, means and SDs
Correlation
Regression
ANOVA/ t test
18
Advanced Predictive Analytics Tools
Classification
-Supervised learning
- Decision Tree
- Unsupervised learning
- Clustering
- Neural Network
Prediction
- Logistic Regression
- Exponential smoothing
- Neural Network
- Decision Tree
19
Predictive Analytics Software
•Excel, Excel Miner
•SPSS
•SAS
•R
•Python
20
A Categorization of Analytical Methods and
Models
• Prescriptive Analytics: It indicates a best course of action to take
• Models used in prescriptive analytics:
Optimization models
• Models that give the best decision subject to constraints of the situation.
Simulation optimization
• Combines the use of probability and statistics to model uncertainty with
optimization techniques to find good decisions in highly complex and highly
uncertain settings.
Decision analysis
• Used to develop an optimal strategy when a decision maker is faced with
several decision alternatives and an uncertain set of future events.
• It also employs utility theory, which assigns values to outcomes based on
the decision maker’s attitude toward risk, loss, and other factors.
21
Predictive Analytics- A Decision Tree
22
Prescriptive Analytics -Tools
Job allocation
23
A Categorization of Analytical Models for
Prescriptive analytics
• Optimization models
Model Field Purpose
Portfolio models Finance Use historical investment return data to
determine the mix of investments that yield the
highest expected return while controlling or
limiting exposure to risk.
Supply network Operations Provide the cost-minimizing plant and distribution
design models center locations subject to meeting the customer
service requirements.
Price markdown Retailing Uses historical data to yield revenue-maximizing
models discount levels and the timing of discount offers
when goods have not sold as planned.
Zeffanne’s HR Incorporates age distribution, mobility rate,
Multimatrix model productivity ratio, absenteeism, attrition rate etc.
24
Figure 2: The Spectrum of Business Analytics
25
“No one in finance, supply chain, marketing, etc. would ever propose
a solution in their area without a plethora of charts, graphs, and
data to support it, but HR is known to all too frequently rely instead
on trust and relationships. People costs often approach 60 percent of
corporate variable costs, so it makes sense to manage such a large
cost item analytically”…. Google co-counder and CEO Larry Page
26
Figure 3- Google Trends for Marketing, Financial,
and Human Resource Analytics, 2004–2012
27
Definition of HR Analytics
• According to Lauri, Bassi, McBassi & Company, HR analytics is
defined as the application of a methodology and the integrated
process for improving the quality of people related decisions for
the purpose of improving individual and/or organizational
performance.
29
Evolution of HRA
31
HR Analytics in Practice
• Example for Human Resource (HR) Analytics:
• Sears Holding Corporation (SHC), owners of retailers Kmart and Sears, Roebuck and
Company, has created an HR analytics team inside its corporate HR function. The
team uses descriptive and predictive analytics to support employee hiring and to
track and influence retention.
32
33
Thank you
34
HR Metrics
“Numbers have an important story to tell. They rely on
you to give them a clear and convincing voice.”
“Without data we have only opinion”
2
HUMAN RESOURCE ACCOUNTING IN INDIA
• Though Human Resources Accounting was introduced way back in the 1980s, it
started gaining popularity in India after it was adopted and popularized by NLC.
• Even though the situation prevails, yet, a growing trend towards the
measurement and reporting of human resources particularly in public sector is
noticeable during the past few years.
• CCI, ONGC, Engineers India Ltd., National Thermal Corporation, Minerals and
Metals Trading Corporation, Madras Refineries, Oil India Ltd., Associated
Cement Companies, SPIC, Metallurgical and Engineering consultants India
Limited, Cochin Refineries Ltd. Etc. are some of the organizations, which have
started disclosing some valuable information regarding human resources in their
financial statements.
• Itis needless to mention here that, the importance of human
resources in business organization as productive resources was by
and large ignored by the accountants until two decades ago.
• There are different benchmarks to assess these ratios and the one by
Saratoga is considered the best. It was developed jointly by Saratoga
and SHRM (earlier ASPA).
• Over 40% fortune 500 companies are its clients. The Human capital
measurement and benchmarking business function of Price
Waterhouse Coopers
6
Classification of HR Measures
• By Saratoga
• Organization and operations
• HR staff and structure
• Compensation and benefits
• Staffing and hiring
• Retention and separations
8
9
Recruiting efficiency
• Interview time
• Hire rate
• Offer acceptance rate
• Quality of Hire
10
Recruiting efficiency
• Interview time
• Hire rate
• Offer acceptance rate
• Quality of Hire
11
Recruiting efficiency
• Interview time
• Hire rate
• Offer acceptance rate
• Quality of Hire
12
Recruiting efficiency
• Interview time
• Hire rate
• Offer acceptance rate
• Quality of Hire
13
Recruiter effectiveness = RT + TTF + CPH + OAR + QH/N
14
Training and development metrics
• Training cost factor= [CC+TR+S+RC+TL+TS+PS+OH]/PT
Where CC= Consultant cost
TR= Training facility rent
S = Supplies/ Stationery
RC = Refreshment cost
TL = Travel and lodging
TS= Trainer’s salary/ fees
PS = Participant’s salary
OH = Overhead of training dept./ agency
PT = No. of people trained
•Training cost per hour = TC/ TH
15
Calculating Training ROI
• % Training ROI = (Benefits/ Cost) X100
19
Cost and benefit analysis
• Employee Cost factor = Total compensation/FTE
• Revenue Factor = Total Organizational Revenue/FTE
• Income factor = Total Organizational revenue-Total operating
expenses/FTE
20
The Indian Context
• No set guidelines for measuring HR’s effectiveness
Add-ins
• Additional features are available using add-ins. Several are provided with
Excel, including:
• Analysis Tool Pak: Provides data analysis tools for statistical and
engineering analysis (includes analysis of variance and regression
analysis)
• Euro Currency Tools: Conversion and formatting for euro currency
• Solver Add-In: Tools for optimization and equation solving
Few shortcut keys
• Ctrl+end (move to the last cell in the workbook)
• Ctrl+home (to move to the first cell)
• Ctrl+right arrow (to go to the right edge of the active
range)
• Ctrl+left arrow (to go to left edge)
• Ctrl+up arrow (to go to the top edge of the active
range)
• Ctrl+s (save)
• Ctrl+c (copy)
• Ctrl+x (cut)
• Ctrl+z (undo)
Using data types to populate a worksheet
• Three type of data can be entered into excel: Text, numbers and
formulas
• Entering dates
• Dates are often used in worksheets to track data over a specified period of
time.
• Excel interpret two digit years from 00 to 29 as the years 2000 to 2029; two
digit years from 30 to 99 are interpreted as 1930 to 1999.
• When you enter a date into a cell in particular format, the cell is automatically
formatted even if you delete the entry.
• Regardless of the date format displayed in the cell, the formula bar displays
the date in month/day/four digit year format because that is the format required
for calculation and analyses.
Enter a date or time in a cell
• Sort Range: It sorts the data in a range of cells, which can be helpful when working
with a sheet that contains several tables. Sorting a range will not affect other content
on the worksheet.
Custom sorting
• Sometimes we may find that the default sorting options can't sort data in the order we
need. Excel allows us to create a custom list to define our own sorting order.
Filtering Data
• Filters can be used to narrow down the data in the worksheet and hide
parts of it from view.
• When a conditional format is applied on a cell it means the formatting of the cell is
based on a condition.
• As the name suggests, you can setup conditions based on which Excel will know
what formatting to apply for different values.
• Let’s try a very simple conditional format. Click on a cell and go the Home Tab >
Conditional Formatting. Then select the first option, Highlight Cell Rules > Greater
Than.
• In the next dialogue box, put a number, like 100 and press OK.
Now, if you put a number below 100, nothing will happen. But as soon as you put a number that is greater
than 100, the cell will turn red. Again, if you change the number to one which is less than or equal to 100,
Conditional formatting thus allows you to visually analyze your data, based on a large number of condition
types,
• Top / Bottom 10
• Duplicates / Uniques
• Text containing
Apart from these in-built ones, you can create ‘n’ number of conditions
Using Basic Formulas
UNDERSTANDING AND DISPLAYING FORMULAS
Types of referencing
1. A relative cell reference is one that adjusts the cell identifier
automatically if you insert or delete columns or rows, or if you copy the
formula to another cell.
2. An absolute cell reference refers to a specific cell or range of cells
regardless of where the formula is located in the worksheet. Absolute
cell references include two dollar signs in the formula, preceding the
column letter and row number. The absolute cell reference $B$3, for
example, always refers to column (B) and row (3).
3. A mixed cell reference is a cell reference that uses an absolute column or
row reference, but not both.
4. An external reference refers to a cell or range in a worksheet in another
Excel workbook, or to a defined name in another workbook.
Naming a Range
• After selecting a range of cells, you can name the range using three
different methods:
• By typing a name in the Name Box next to the formula bar
• By using the New Name dialog box
• By using the Create Names from Selection dialog box
Rules and guidelines for naming ranges
include the following:
• Range names can be up to 255 characters in length.
• Range names may begin with a letter, the underscore character (_), or
a backslash (\). The rest of the name may include letters, numbers,
periods, and underscore characters, but not a backslash.
• Range names may not consist solely of the letters “C”, “c”, “R”, or “r”,
which are used as shortcuts for selecting columns and rows.
• Range names may not include spaces. Microsoft recommends you use
the underscore character (_) or period (.) to separate words, such as
Fruit_List and Personal.Budget.
• Range names cannot be the same as a cell reference, such as A7 or
$B$3.
Using Functions
Using the COUNT, COUNTA, and
COUNTBLANK Functions
• Use the COUNT function when you want to determine how many
cells in a range contain a number.
• There are other variations of the COUNT function.
• The COUNTA function counts all nonblank entries in a range, whether
they include text or numbers.
• The COUNTBLANK function counts the number of blank cells in a
range.
• Using the MIN Function: The MIN function allows you to determine
the minimum value in a range of cells.
• Using the MAX Function: The MAX function returns the largest value
in a set of values.
• AutoSum makes that task even easier by calculating (by default) the
total from the adjacent cell up to the first nonnumeric cell, using the
SUM function in its formula.
Using the IF Logical Function
• The result of a conditional formula is determined by the state of a
specific condition or the answer to a logical question.
• An IF function sets up a conditional statement to test data. An IF
formula returns one value if the condition you specify is true and
another value if it is false.
• The IF function requires the following syntax:
• =IF(Logical_test,Value_if_true,Value_if_false)
Using the SUMIFS, COUNTIFS, and AVERAGEIFS
Functions
• The SUMIFS function adds cells in a range that meet multiple criteria. The syntax
for the SUMIFS function is:
• =SUMIFS(Sum_range,Criteria_range1,Criteria1,Criteria_range2,Criteria2,…)
• The COUNTIFS function counts the number of cells within a range that meet
multiple criteria. You can create up to 127 ranges and criteria. The syntax for the
COUNTIFS function is:
• =COUNTIFS(Criteria_range1,Criteria1,Criteria_range2,Criteria2,…)
• The AVERAGEIFS function returns the average (arithmetic mean) of all cells that
meet multiple criteria. You learn to apply the AVERAGEIFS formula in the
following exercise to find the average of a set of numbers where two criteria are
met. The syntax for the AVERAGEIFS function is:
• =AVERAGEIFS(Average_range,Criteria_range1,Criteria1,Criteria_range2,Criteria2,
…).
Looking Up Data Using the VLOOKUP Function
• When worksheets contain long and sometimes cumbersome lists of
data, you need a way to quickly find specific information within these
lists. This is where Excel’s lookup functions come in handy.
• The “V” in VLOOKUP stands for vertical. This formula is used when
the comparison value is in the first column of a table. Excel goes down
the first column until a match is found and then looks in one of the
columns to the right to find the value in the same row. The VLOOKUP
function syntax is:
• =VLOOKUP(Lookup_value,Table_array,Col_index_num,Range_lookup)
Looking Up Data Using the HLOOKUP
Function
• The “H” in HLOOKUP stands for horizontal. HLOOKUP searches
horizontally for a value in the top row of a table or an array and then
returns a value in the same column from a row you specify in the
table or array.
• The HLOOKUP function syntax is similar to that of the VLOOKUP
function, with the exception of the Row_index_num argument that
replaces the Col_index_num argument from VLOOKUP. The HLOOKUP
syntax is:
• =HLOOKUP(Lookup_value,Table_array,Row_index_num,Range_lookup
)
Using the MATCH and INDEX Functions
• The MATCH and INDEX functions are lookup functions that can help
you locate a specific item or the position of an item in a specified
range.
• Use the MATCH function when you need to know the relative position
of an item in a range you specify, rather than the item itself.
• Use the INDEX function to return a value or the reference to a value in
a specified range.
Application for Tableau in HR
data Visualization
Introduction
• Tableau Software is an interactive data
visualization software company founded on January
2003 by Christian Chabot, Pat Hanrahan and Chris
Stolte, in Mountain View, California.
• If r = 1 = perfect correlation.
MS Excel Data Analysis ToolPak
• An Excel Add-In which includes several
statistical analysis techniques
5
Installing Analysis Toolpack
1. Open a blank Excel spreadsheet.
2. Click on the windows icon (pre 2010) or the file tab
(2010+).
3. Choose Excel Options (pre 2010) or just options (2010+).
4. Choose add-ins.
5. In manage (bottom of window), choose Excel Add-ins and
click Go.
6. Check the box that says Analysis ToolPack and click OK.
7. After loading the Analysis ToolPack, the Data Analysis
command is available under the Data tab. It should be the
far right option.
Analysis Toolpack input interface
• The Input Range refers to the location of
the data set.
7
Excel Correlation Output
Excel Correlation Output
Tools / data analysis / correlation…
Correlation between
Tree Height and Trunk Diameter
SPSS – STATISTICAL PACKAGE FOR
SOCIAL SCIENCES
• Versions- 16, 20, 21
The coefficients of the multiple regression model are estimated using sample data
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
Group Statistics
Std. Error
anx_cat N Mean Std. Deviation Mean
Overall GPA Low 10 3.3800 .45959 .14533
High 10 2.9900 .58205 .18406
The group statistics box provides the mean, standard deviation, and
number of participants for each group (level of the IV).
Interpreting the Output
Independent Samples Test
Levene’s test is designed to compare the equality of the error variance of the dependent
variable between groups. We do not want this result to be significant. If it is significant,
we must refer to the bottom t-test value.
Analysis of variance (ANOVA)
• Technique to compare difference in mean
output/ attitude/ performance (continuous
variable) between three or more groups of
employees/ product lines/ sales teams/ students
• When there is only one dependent variable
• Not meant for categorical dependent variable
• Assumption: Groups should be homogeneous
(indicated by Levene’s test; should be
insignificant)
Types of ANOVA
• One way – the groups are different in terms of
only one level/factor
• Eg: comparing marks of MBA, BBA, & LAW
students
• Two way - the groups are different in terms of
two levels/ factors
• Eg: comparing marks of MBA(Boys & Girls),
BBA (Boys & Girls), LAW (Boys & Girls)
One-Way Analysis of Variance
(ANOVA)
• Evaluate the difference among the means of three or
more groups
Examples: Accident rates for 1st, 2nd, and 3rd shift
Expected mileage for five brands of tires
• Assumptions
– Populations are normally distributed
– Populations have equal variances
– Samples are randomly and independently drawn
Conducting ANOVA in Excel
• Step 1: On the Data tab, click Data Analysis.
• Step 2: Select ANOVA: Single Factor and click
OK
• Step 3: Click in the Input Range box and select
the range for the variables for comparison.
• Step 4: Click in the Output Range box and
select any suitable cell.
•
Excel ANOVA Output
ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups 5.127333 2 2.563667 10.21575 0.008394 4.737416
Within Groups 1.756667 7 0.250952
Total 6.884 9
Homogeneity Test
Levene’s test. This significant result means
the assumption of equal group variances
has not been met.
SPSS One way ANOVA OUTPUT
ANOVA
VAR00001
Sum of
Squares df Mean Square F Sig.
Between Groups .152 2 .076 .147 .865
Within Groups 4.635 9 .515
Total 4.787 11
What Is SAS Enterprise Guide?
SAS Enterprise Guide is an easy-to-use Windows
client application that provides these features:
56
Import from an External File
• The Import Data wizard enables you to create
SAS data sets from text, HTML, or PC-based
database files (including Microsoft Excel,
Microsoft Access, and other popular formats).
When you use the Import Data wizard, you
can specify import options for each file that
you import.
57
Import Data
• File >> Import Data
58
The Task List
You can use tasks to do
everything from
manipulating data, to
running specific analytical
procedures, to creating
reports.
59
Correlation in SAS Enterprise Guide
• Tasks >> Multivariate >> Correlations
60
Correlation Analysis
• Since p-values are less than 0.05, there are
significant (positive) relationships between Q6
(Overall satisfaction on Advisor) and Q1, Q2,
Q3, Q4, Q5.
61
Linear Regression
• Tasks >> Regression >> Linear Regression
62
Regression Analysis
These are the F Value and
p-value, respectively, testing
the null hypothesis that the
Model does not explain the
variance of the response
variable.
63
Regression Analysis
These are the t Value and
p-value, respectively, testing
the null hypothesis that the
coefficients are significantly
equal to 0.
64
One-Sample t-Test
65
T-Test Output
67
One-Way ANOVA results
A Few Examples:
Consumer chooses brand (1) or not (0);
A quality defect occurs (1) or not (0);
A person is hired (1) or not (0);
Evacuate home during hurricane (1) or not (0);
A person quits (1) or stays (0)
The linear vs. logistic regression
equations
The Logistic Regression Model
Next
Binomial Logistic Regression:
•Point to Regression
…and click.
Selecting Variables
…and click.
Entering Categorical Variables
To designate
gender as a
categorical
variable, click
the button
labeled
Categorical.
Contd…
Click in the box labeled
Covariates,
then click the arrow to move
it to the box labeled
Categorical Covariates.
Binomial Logistic Regression:
Next
Binomial Logistic Regression Output
Variable Coding
Binomial Logistic Regression Output (Enter Method)
Model Summary
Binomial Logistic Regression Output (Enter Method)
Classification Table
Binomial Logistic Regression Output (Enter Method)
Rotation of Factors
Interpretation of Factors
Calculation of
Factor Scores
Hierarchical clustering
A set of nested clusters organized as a hierarchical tree
Hierarchical Clustering
p1
p3 p4
p2
p1 p2 p3 p4
Traditional Hierarchical Clustering Traditional Dendrogram
p1
p3 p4
p2
p1 p2 p3 p4
Non-traditional Hierarchical Clustering Non-traditional Dendrogram
K-means Clustering
4
0.2 3 4
2
5
0.15
2
0.1
1
3 1
0.05
0
1 3 2 5 4 6
Strengths of Hierarchical Clustering
Introduction
Decision tree analysis helps identify characteristics of groups, looks at relationships between independent
variables regarding the dependent variable and displays this information in a non-technical way. The
process can also be used to identify classification rules for future events e.g. identifying people who are
likely to belong to a particular group.
Basic model
The following example uses records from the Titanic on passengers. The tree will look at what factors
affected chances of survival.
Independent variables:
Gender
Class (1st, 2nd, 3rd)
Child under 13 (Under 13, adult)
Travelling alone/ travelling with
others.
Growing method: The most commonly used growing methods are CHAID (Chi-squared automatic
interaction detection) and CRT (Classification and regression).
Summary of differences:
Treatment of missing values. CRT uses surrogates (classification via other independent variables
with a high association with the independent variable with a missing value) whereas CHAID treats
all missing values within an independent variable as one category.
CHAID uses Pearson’s Chi-squared to decide on variable splits and CRT uses Gini
CRT only produces binary splits. If all independent variables are binary, the resulting tree from CRT
and using the Pearson’s Chi-squared option within CHAID will produce the same tree.
CRT has a pruning ability so that extra nodes which do not increase the risk (wrong classification) by
much can be automatically removed to leave a simpler tree.
Decision tree analysis in SPSS
Maths and Statistics Help Centre
The risk represents the proportion of cases misclassified by the proposed classification. The
classification table summarises the percentages classified correctly. The model classified 95.1% of
those dying correctly, but only 52% of those who survived.
Decision tree analysis in SPSS
Maths and Statistics Help Centre
/* Node 1 */.
IF (((Gender = "male") OR (Gender != "female") AND (Number of accompanying siblings
or spouses != "1")))
THEN
Node = 1
Prediction = 0
Probability = 0.809015
/* Node 5 */.
IF (((Gender = "female") OR (Gender != "male") AND (Number of accompanying siblings
or spouses = "1"))) AND (((Class = "1st" OR Class = "2nd") OR (Class != "3rd") AND
((Age NOT MISSING AND (Age > 23.5)) OR Age IS MISSING AND (Number of accompanying
siblings or spouses != "3 or more")))) AND (((Class = "1st") OR (Class != "2nd") AND
(Age IS MISSING OR (Age > 34.5))))
THEN
Node = 5
Prediction = 1
Probability = 0.965278
/* Node 6 */.
IF (((Gender = "female") OR (Gender != "male") AND (Number of accompanying siblings
or spouses = "1"))) AND (((Class = "1st" OR Class = "2nd") OR (Class != "3rd") AND
((Age NOT MISSING AND (Age > 23.5)) OR Age IS MISSING AND (Number of accompanying
siblings or spouses != "3 or more")))) AND (((Class = "2nd") OR (Class != "1st") AND
(Age NOT MISSING AND (Age <= 34.5))))
THEN
Node = 6
Prediction = 1
Probability = 0.886792
/* Node 4 */.
IF (((Gender = "female") OR (Gender != "male") AND (Number of accompanying siblings
or spouses = "1"))) AND (((Class = "3rd") OR (Class != "1st" AND Class != "2nd")
AND ((Age NOT MISSING AND (Age <= 23.5)) OR Age IS MISSING AND (Number of
accompanying siblings or spouses = "3 or more"))))
THEN
Node = 4
Prediction = 0
Probability = 0.509259
Decision tree analysis in SPSS
Maths and Statistics Help Centre
Forecasting
Introduction
• All operational systems operate in an environment of uncertainty
• Information is an asset - managing it is a necessity
• “Reliable” information about the future is a source of competitive edge
• Forecasting supplements the intuitive feelings of managers and
decision makers.
Forecasting « Laws »
• Forecasts are always wrong
• Forecasts always change
• The further into the future, the less reliable the forecast.
• Forecasts for group statistics tend to be more accurate than forecasts for
individuals (risk/uncertainty pooling concept)
• Learning not predicting
Forecasting techniques
Types of Forecasting
Approaches & Methods
Unpredictable Quantitative
Delphi Technique
Expert Opinion
Factor Listing Method
Time Series Patterns
© Wiley 2007
Overview of forecasting models
• Time series
• Moving average
• weighting past value
• The decomposition method
• Associative models
• Correlation analysis
• Regression analysis
Forecasting Software
• Spreadsheets
• Microsoft Excel, Quattro Pro, Lotus 1-2-3
• Limited statistical analysis of forecast data
• Statistical packages
• SPSS, SAS, NCSS, Minitab
• Forecasting plus statistical and graphics
• Specialty forecasting packages
• Forecast Master, Forecast Pro, Autobox, SCA
© Wiley 2007
Neural Network
Biological inspirations
• Some numbers…
• The human brain contains about 10 billion nerve cells (neurons)
• Each neuron is connected to the others through 10000 synapses
nucleus
cell body
dendrites
• A neuron has
• A branching input (dendrites)
• A branching output (the axon)
• The information circulates from the dendrites to the axon via the
cell body
• Axon connects to dendrites via synapses
• Synapses vary in strength
Neural Networks
• A mathematical model to solve engineering problems
• ANN are black box-type learning algorithms which are designed to elucidate latent
relationships and patterns from a set of input data, typically non-linear and not
following any assumptions of normal distribution, and then map them onto a set of
stochastic outputs for the end user.
• Components of ANN:
• The input layer: It is the receiver of information from external environment
• The hidden layer (processing element): It is the black box which contains the
artificial neurons for processing of the input data and extraction of hidden
pattern from the data.
• The output layer: this present actual outputs from the analysis
• Weightage factor: Each input is assigned a particular weight which signifies the
impact that the input will have on the hidden layer during data processing
• Summation function: it is the aggregate of weights associated with all the inputs.
Processing Information
in an Artificial Neuron (Figure 15.3)
Inputs Weights
x1 w1j
Output
Neuron j Yj
x2 w2j
wij xi
5
Popular Neural Network Architectures
1. Single layer Feed Forward Architecture
• Multiple layer feed forward architecture
Recurrent Neural Networks
Backpropagation
9
Objective of Neural network
• Compute Outputs
10
Learning
• The procedure that consists in estimating the parameters of neurons so that the
whole network can perform a specific task
• 2 types of learning
• The supervised learning : Analyst decides the adjustments in weights
• The unsupervised learning: ANN organizes the outputs on its own and
develop learning insights using some internal criteria.
Stages of data Analysis with ANNs