Managing The Business Risk of Fraud
Managing The Business Risk of Fraud
Managing The Business Risk of Fraud
Mike Blakley
Presented to:
Fall 2009
Managing the business risk of fraud using sampling and data mining EZ-R Stats, LLC
Sharp rise in accounting fraud over the past 12 months Accounting fraud had grown to 38 percent of the economic crimes in 2009 Employees face increased pressures to :
Survey findings
Greater risk of fraud due to increased incentives or pressures More opportunities to commit fraud, partially due to reductions in internal finance staff While companies are expecting more fraud, they have not done much People who look for fraud are more likely to find it
Managing the business risk of fraud EZ-R Stats, LLC
Session objectives
Understand the framework for managing the business risk of fraud Plan, perform and explain statistical sampling in audits Reduce audit costs using data mining, sequential sampling and other sampling techniques Apply SAS 56, the new SAS suite and the revised (2007) Yellow Book. Run, hands-on, the most productive analytic technique (regression analysis). Use data mining to introduce greater efficiency into the audit process, without losing effectiveness.
Session agenda - 1
Introduction and the Process for Managing the Business Risk of Fraud Introductions All Around Course Objectives Framework of risk management for fraud Fundamentals of data mining Data mining: The Engine That Drives analysis
SAS 56 IIA Practice Advisory 2320 The Yellow Book (2007 revision) The Guide Managing the Business Risk of Fraud
Managing the business risk of fraud EZ-R Stats, LLC
Random Numbers Determining Sample Size Case Study Attribute sampling Variable Sampling Case study Stratified Sampling Obtaining and Interpreting the results
Other Sampling Approaches DCAA Audit Package Sequential Sampling Overview of the process Attribute Sampling Variable Sampling
Regression Analysis Overview Terms Statistical basis Charting Regression Seeing Is Believing Plotting Data
Inserting a Trend line Confidence Intervals Prediction Intervals Calculation of Statistical Confidence Bounds
Statistical Intervals
Overview Statistical Basis Data Conversion and Extraction Data mining objectives
Handout (CD)
Overview
Fraud patterns detectable with digital analysis Basis for digital analysis approach Usage examples Continuous monitoring Business analytics
Objective 1
Three brief examples ACFE/IIA/AICPA Guidance Paper Practice Advisory 2320-1 Auditors Top 10 Process Overview Who, What, Why, When & Where
Objective 1a
Supplier Kickback School Bus parts $5 million Jail sentences Period of years
Objective 1a
Understaffed internal audit Software not used Data on multiple platforms Transaction volumes large
Objective 1a
Preventable
Need structured, objective approach Let the data talk to you Need efficient and effective approach
Objective 1
Regression Analysis
Intervals
Objective 1
Data outliers
Sometimes an out and out Liar But how do you detect it?
Objective 1
Data Outliers
Example 2
Objective 1a
150 100 50 0 NDC1 NDC2 NDC3 NDC4 NDC5 NDC6 Drug Type
Objective 1
CMS Report for 2005 South Florida - $2.2 Billion Rest of the country combined $.1 Billion
Objective 1
Pareto Chart
Medicare HIV Infusion Costs - 2005 ($Billions) data source: HHS CMS
120.0% 100.0%
Annual Medicare Costs
County
Example 2
Objective 1a
50.0 40.0 30.0 20.0 10.0 0.0 Prov 1 Prov 2 Prov 3 Prov 4 Prov 5 Prov 6 Prescriber
Example 2
Objective 1a
Prescriptions by Dr. X
Dr. X compared with Total Population
350 300 250 200 150 100 50 0 NDC1 NDC2 NDC3 NDC4 NDC5 NDC6 Drug Type
Dollar Amount
Population Dr. X
Example 2
Off-label use
Objective 1a
Serostim
Treat wasting syndrome, side effect of AIDS, OR Used by body builders for recreational purposes One physician prescribed $11.5 million worth (12% of the entire state)
Example 3
Revenue trends
Objective 1a
1.15 1.1 1.05 1 0.95 0.9 2001 2002 Calendar Year 2003 Overall Linear (Overall)
Example 3
Dental Billings
Objective 1a
Objective 1b
Guidance Paper
A proposed implementation approach Managing the Business Risk of Fraud: A Practical Guide http://tinyurl.com/3ldfza Five Principles Fraud Detection Coordinated Investigation Approach
Objective 1b
ACFE, IIA and AICPA Exposure draft issued 11/2007, final 5/2008 Section 4 Fraud Detection
Guidance Paper
Five Sections
Fraud Risk Governance Fraud Risk Assessment Fraud Prevention Fraud Detection Fraud Investigation and corrective action
Risk Governance
Fraud risk management program Written policy managements expectations regarding managing fraud risk
Risk Assessment
Periodic review and assessment of potential schemes and events Need to mitigate risk
Fraud Prevention
Fraud Detection
Establish detection techniques for fraud Back stop where preventive measures fail, or Unmitigated risks are realized
Reporting process to solicit input on fraud Coordinated approach to investigation Use of corrective action
Fraud Risk Governance one grand wink-wink, nod-nod Fraud Risk Assessment - categorically false Fraud Prevention my husband passed away Fraud Detection - We didn't know? Never saw one. Fraud Investigation and corrective action - Pick-APayment losses $36 billion
Objective 1b
Detective Controls Process Controls Anonymous Reporting Internal Auditing Proactive Fraud Detection
Objective 1b
Operate in the background Not evident in everyday business environment These techniques usually
Occur in ordinary course of business Corroboration using external information Automatically communicate deficiencies Use results to enhance other controls
Managing the business risk of fraud EZ-R Stats, LLC
Whistleblower hot-lines (DHHS and OSA have them) Process controls (Medicaid audits and edits) Proactive fraud detection procedures
Objective 1b
Journal entries suspicious transactions Identification of relationships Benfords Law Continuous monitoring
Objective 1b
Identify hidden relationships Identify suspicious transactions Assess effectiveness of internal controls Monitor fraud threats Analyze millions of transactions
Organization should develop ongoing monitoring and measurements Establish measurement criteria (and communicate to Board) Measurable criteria include:
fraud allegations fraud investigations resolved Employees attending annual ethics course Whistle blower allegations Messages supporting ethical behavior delivered by executives Vendors signing ethical behavior standards
Managing the business risk of fraud EZ-R Stats, LLC
Evaluate effectiveness of technique regularly Adjust technique as required Document adjustments Report modifications needed for techniques which become less effective
International standards for the professional practice of Internal Auditing Analytical audit procedures
May include Study of relationships Comparison of amounts with similar information in the organization Comparison of amounts with similar information in the industry
EZ-R Stats, LLC
Performed using monetary amounts, physical quantities, ratios or percentages Ratio, trend and regression analysis Period to period comparisons Auditors should use analytical audit procedures in planning the engagement
Factors to consider
Significance of the area being audited Assessment of risk Adequacy of system of internal control Availability and reliability of information Extent to which procedures provide support for engagement results
Objective 1c
Objective 1d
Gaps
Univariate
Holiday
Objective 1e
Objective 1e
Traditionally, IT specialists With appropriate tools, audit generalists (CAATs) Growing trend of business analytics Essential component of continuous monitoring
Objective 1e
Objective 1e
Basis is quantification Software can do the leg work Statistical measures of difference Chi square Kolmogorov-Smirnov D-statistic Specific approaches
EZ-R Stats, LLC
Objective 1e
Why - Advantages
Automated process Handle large data populations Objective, quantifiable metrics Can be part of continuous monitoring Can produce useful business analytics 100% testing is possible Quantify risk Repeatable process
Objective 1e
Why - Disadvantages
Costly (time and software costs) Learning curve Requires specialized knowledge
Objective 1e
Traditional intermittent (one off) Trend is to use it as often as possible Continuous monitoring Scheduled processing
Objective 1e
Where Is It Applicable?
Volumes are large Data structures are complex Potential for fraud exists
Cost
Software Training Skills not widely available Development costs Testing resources
Time consuming
Objective 1
Objective 1 Summarized
Three brief examples CFE Guidance Paper Top 10 Metrics Process Overview Who, What, Why, When & Where
Objective 1 - Summarized
Understand the framework for managing the business risk of fraud Plan, perform and explain statistical sampling in audits Reduce audit costs using data mining, sequential sampling and other sampling techniques Apply SAS 56, the new SAS suite and the revised (2007) Yellow Book. Run, hands-on, the most productive analytic technique (regression analysis). Use data mining to introduce greater efficiency into the audit process, without losing effectiveness.
Statistical Sampling
Brief History / Timeline Overview Attribute Sampling Compliance Variable Sampling Numeric Estimates
History of Sampling
Basis is two laws/theorems of probability Law of Large Numbers Central Limit Theorem
Value
Indian mathematician Bramagupta 600 AD Italian mathematician Cardon 1500s Statement without proof that empirical statistics improve with more trials
Jacob Bernoulli first to prove in 1713 Foundation for central limit theorem
See it in action today Any population Large number of samples Average is normally distributed
French mathematician Abraham de Moivre 1733 approximate distribution from tossing coin (heads/tails) Ho hum reaction French Mathematician LaPlace expanded it Ho hum reaction
Managing the business risk of fraud EZ-R Stats, LLC
Industrial revolution
Manufacturing
Engineering
Excitement!
Students T
William Gosset - 1908 Guinness Brewery
SAS 39
Attribute sampling
Overview
Sample size calculations Attribute sampling Variable sampling Random number generators
Its a guess Every package different answer Need to know the population But thats why youre taking a sample!
Attribute Sampling
Using RAT-STATS
Unrestricted populations
Managing the business risk of fraud using sampling and data mining
Session Objectives
1.
2. 3. 4.
Understand what is attribute sampling and when to use it Understand unrestricted populations Overview of the process using RAT-STATS Understand the formula behind the computations
Attribute sampling
Attribute Compliance
Statistical approach
Recommended Economical Efficient Requires
Overview of process
Confidence Precision
Determine required sample size Identify samples to be selected based upon random numbers Pull the sample and test Compute the sampling results (i.e. estimate of range)
Managing the business risk of fraud EZ-R Stats, LLC
The sampling parameters are first developed by the auditor RAT-STATS is used to compute sample size RAT-STATS used to generate random numbers Pull the sample and test Enter results in RAT-STATS to compute estimates
Managing the business risk of fraud EZ-R Stats, LLC
Each random number selected corresponds with an item Put the selected item on a separate schedule
Thats It!
Now well see an actual demo using the RAT-STATS software Excel population of 5,000 invoices Results of test of attributes stored in the worksheet
Variable Sampling
Using RAT-STATS
Unrestricted populations
Managing the business risk of fraud using sampling and data mining
Session Objectives
1.
2.
3.
4.
Understand what variable sampling is and when to use it Understand unrestricted populations Overview of the process using RATSTATS Understand the formula behind the computations
Managing the business risk of fraud EZ-R Stats, LLC
Variable sampling
Variable Estimating
Statistical approach
Recommended Economical Efficient Requires
Overview of process
Confidence Precision
Determine required sample size Identify samples to be selected based upon random numbers Pull the sample and test Compute the sampling results (i.e. estimate of range)
Managing the business risk of fraud EZ-R Stats, LLC
The sampling parameters are first developed by the auditor RAT-STATS is used to compute sample size RAT-STATS used to generate random numbers Pull the sample and test Enter results in RAT-STATS to compute estimates
Managing the business risk of fraud EZ-R Stats, LLC
1. 2. 3.
1. 2. 3.
Step 2 Obtain the random numbers Done by entering info into RAT-STATS Output can be a variety of sources: Text File Excel Microsoft Access Print File
random number selected corresponds with an item Put the selected item on a separate schedule
requires reviewing documents Example data contains both examined and audited value.
Thats It!
Now
well see an actual demo using the RAT-STATS software Excel population of 5,000 invoices Audited values stored in the worksheet
Attribute Sampling
Using RAT-STATS
Stratified populations
Managing the business risk of fraud using sampling and data mining
Session Objectives
1.
2.
Understand what is stratification and when to use it Overview of the process using RAT-STATS
Stratified sampling
Strata Homogenous More
Overview of process
Confidence Precision
Determine required sample size Identify samples to be selected based upon random numbers Pull the sample and test Compute the sampling results (i.e. estimate of range)
The sampling parameters are first developed by the auditor RAT-STATS is used to compute sample size RAT-STATS used to generate random numbers Pull the sample and test Enter results in RAT-STATS to compute estimates
Managing the business risk of fraud EZ-R Stats, LLC
Each random number selected corresponds with an item Put the selected item on a separate schedule
Thats It!
Now well see an actual demo using the RAT-STATS software Excel population of 5,000 invoices Results of test of attributes stored in the worksheet
Variable Sampling
Using RAT-STATS
Stratified populations
Managing the business risk of fraud using sampling and data mining
Session Objectives
1.
2.
3.
4.
Understand what stratified sampling is and when to use it Populations benefiting from stratified sampling Overview of the process using RATSTATS Understand the formula behind the computationsof fraud Managing the business risk EZ-R Stats, LLC
Overview of process
Confidence Precision
Determine required sample size Identify samples to be selected based upon random numbers Pull the sample and test Compute the sampling results (i.e. estimate of range)
Managing the business risk of fraud EZ-R Stats, LLC
The sampling parameters are first developed by the auditor RAT-STATS is used to compute sample size RAT-STATS used to generate random numbers Pull the sample and test Enter results in RAT-STATS to compute estimates
Managing the business risk of fraud EZ-R Stats, LLC
1. 2. 3.
1. 2. 3.
4.
Done by entering info into RAT-STATS Multi-stage random numbers Output can be a variety of sources: Text File Excel Microsoft Access Print File
Managing the business risk of fraud EZ-R Stats, LLC
random number selected corresponds with an item in a strata Put the selected item on a separate schedule
requires reviewing documents Example data contains both examined and audited value.
Thats It!
Now
well see an actual demo using the RAT-STATS software Excel population of 5,000 invoices Divided into three strata Audited values stored in the worksheet
Managing the business risk of fraud EZ-R Stats, LLC
Objective 2 - Summarized
Understand the framework for managing the business risk of fraud Plan, perform and explain statistical sampling in audits Reduce audit costs using data mining, sequential sampling and other sampling techniques Apply SAS 56, the new SAS suite and the revised (2007) Yellow Book. Run, hands-on, the most productive analytic technique (regression analysis). Use data mining to introduce greater efficiency into the audit process, without losing effectiveness.
Optimize sample size (most bang for the buck) Skip sampling review 100% of transactions using computer assisted audit techniques (CAATs)
Sample optimization
Sequential sampling
University of Hawaii
Banana aphids
Sequential sampling
Banana aphids
Provides complete coverage Best practice Basis for continuous monitoring Repeatable process
Objective 3 - Summarized
Understand the framework for managing the business risk of fraud Plan, perform and explain statistical sampling in audits Reduce audit costs using data mining, sequential sampling and other sampling techniques Apply SAS 56, the new SAS suite and the revised (2007) Yellow Book. Run, hands-on, the most productive analytic technique (regression analysis). Use data mining to introduce greater efficiency into the audit process, without losing effectiveness.
General standards
The staff assigned to conduct an audit or attestation engagement under GAGAS must collectively possess the technical knowledge, skills, and experience necessary to be competent for the type of work being performed before beginning work on that assignment. The staff assigned to a GAGAS audit or attestation engagement should collectively possess:
Managing the business risk of fraud EZ-R Stats, LLC
(1) statistical sampling if the work involves use of statistical sampling; (2) information technology
Managing the business risk of fraud EZ-R Stats, LLC
Requires use of analytic review procedures for: Audit planning Overall review stages
Objective 4 - Summarized
Understand the framework for managing the business risk of fraud Plan, perform and explain statistical sampling in audits Reduce audit costs using data mining, sequential sampling and other sampling techniques Apply SAS 56, the new SAS suite and the revised (2007) Yellow Book. Run, hands-on, the most productive analytic technique (regression analysis). Use data mining to introduce greater efficiency into the audit process, without losing effectiveness.
Next Metric
1. 2. 3. 4. 5. 6. 7. 8. 9. 10.
Outliers Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
EZ-R Stats, LLC
7 - Trends
Trend Busters
Does the pattern make sense?
ACME Technology
30,000 25,000 20,000 15,000 10,000 5,000 0
-0 7 M ay -0 7 Ju l-0 Se 7 p07 N ov -0 7 Ja n08 M ar -0 M 8 ay -0 8 n0 7 ar
Amount
Ja
Date
7 Trends
Trend Busters
Linear regression Sales are up, but cost of goods sold is down Spikes
7 Trends
Identify trend lines, slopes, etc. Correlate trends Identify anomalies Key punch errors where amount is order of magnitude
EZ-R Stats, LLC
7 Trends
Linear Regression
Test
relationships (e.g. invoice amount and sales tax) Perform multi-variable analysis
Managing the business risk of fraud EZ-R Stats, LLC
7 Trends
How is it done?
Estimate linear trends using best fit Measure variability (standard errors) Measure slope Sort descending by slope, variability, etc.
EZ-R Stats, LLC
N 18 17 27 32 23 56
Generally the trend is gently sloping up, but two accounts (43870 and 54630) are different.
Managing the business risk of fraud EZ-R Stats, LLC
Objective 5 - Summarized
Understand the framework for managing the business risk of fraud Plan, perform and explain statistical sampling in audits Reduce audit costs using data mining, sequential sampling and other sampling techniques Apply SAS 56, the new SAS suite and the revised (2007) Yellow Book. Run, hands-on, the most productive analytic technique (regression analysis). Use data mining to introduce greater efficiency into the audit process, without losing effectiveness.
Objective 6
Analytical review Isolate the significant few Detection of errors Quantified approach
Objective 2
Quantified Approach Population vs. Groups Measuring the Difference Stat 101 Counts, Totals, Chi Square and K-S The metrics used
Objective 2a
Quantified Approach
Based on measureable differences Population vs. Group Shotgun technique
Objective 2a
Objective 2b
Fraud patterns
Common theme something is different Groups Group pattern is different than overall population
Objective 2c
Measurement Basis
Transaction
Objective 2d
Detailed knowledge of statistics not necessary Software packages do the numbercrunching Statistics used only to highlight potential errors/frauds Not used for quantification
Objective 2d
Comparison of group with population as a whole Can be based on either counts or amounts Difference is measured Groups can then be ranked using a selected measure High difference = possible error/fraud
Based roughly on the Wake County Transportation fraud Illustrates how the process works, using Excel
Objective 2d
Histograms
Objective 2d
Population
700 600 500 400 300 200 100 0 Jan- Feb- Mar- Apr- May- Jun- Jul- Aug- Sep- Oct- Nov- Dec07 07 07 07 07 07 07 07 07 07 07 07
Jan- Feb- Mar- Apr- May- Jun- Jul- Aug- Sep- Oct- Nov- Dec07 07 07 07 07 07 07 07 07 07 07 07
Objective 2d
Count by Month
80 120.0% 70 60 50
Count
Cum Pct
100.0% 80.0%
40 30 20 10 0
Ja n0 Fe 7 bM 07 ar -0 Ap 7 r-0 M 7 ay -0 Ju 7 n0 Ju 7 lAu 07 g0 Se 7 p0 O 7 ct0 No 7 v0 De 7 c07
60.0%
40.0% 20.0%
0.0%
Ja n07 Ju l-0 7 07 Se p07 M ay M ar ov -0 7 N 07
Month
Objective 2d
Two statistical measures of difference Chi Squared (counts) K-S (distribution) Both yield a difference metric
Objective 2d
Chi Squared
Classic test on data in a table Answers the question are the rows/columns different Some limitations on when it can be applied
Objective 2d
Chi Squared
Table of Counts Degrees of Freedom Chi Squared Value P-statistic Computationally intensive
Objective 2d
Kolmogorov-Smirnov
Objective 2d
2.
3.
4.
For each cluster element determine percentage Then calculate cumulative percentage Compare the differences in cumulative percentages Identify the largest difference
EZ-R Stats, LLC
Objective 2d - KS
Kolmogorov-Smirnov
Objective 2e
Classification by metrics
Stratification Day of week Happens on holiday Round numbers Variability Benfords Law Trend lines Relationships (market basket) Gaps Duplicates
Objective e
7.
8. 9. 10.
Outliers / Variability Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
Objective 2
Quantified Approach Population vs. Groups Measuring the Difference Stat 101 Counts, Totals, Chi Square and K-S The metrics used
Objective 2 - Summarized
1. 2. 3. 4. 5.
Understand why and how Understand statistical basis for quantifying differences Identify ten general tools and techniques Understand examples done using Excel How pattern detection fits in
Session Break!
Objective 3
Overview Explain Each Metric Examples of what it can detect How to assess results
Objective 3
Trapping anomalies
Objective 3
Gaps
Univariate
Holiday
1 - Outliers
Outliers / Variability
Outliers are amounts which are significantly different from the rest of the population
Managing the business risk of fraud EZ-R Stats, LLC
1 - Outliers
Outliers / Variability
Charting (visual) Software to analyze z-scores Top and Bottom 10, 20 etc. High and low variability (coefficient of variation)
1 - Outliers
Basic statistics Minimum, maximum and average Variability Sort by statistic of interest Variability (coefficient of variation) Maximum, etc.
EZ-R Stats, LLC
1 - Outliers
Example Results
Provider
3478421 2356721 3546789 5463122
N
3,243 4,536 3,421 2,311
Coeff Var
342.23 87.23 23.25 18.54
Two providers (3478421 and 2356721) had significantly more variability in the amounts of their claims than all the rest.
Managing the business risk of fraud EZ-R Stats, LLC
Next Metric
1. 2. 3.
4.
5. 6. 7.
8.
9. 10.
Outliers Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
EZ-R Stats, LLC
2 - Stratification
2 - Stratification
Stratification - How
Charting (visual) Chi Squared Kolmogorov-Smirnov By groups
2 Stratification
Transactions out of the ordinary Up-coding insurance claims Skewed groupings Based on either count or amount
2 Stratification
The process?
1. 2. 3. 4. 5.
Stratify the entire population into bins specified by auditor Same stratification on each group (e.g. vendor) Compare the group tested to the population Obtain measure of difference for each group Sort descending on difference measure
EZ-R Stats, LLC
2 Stratification
4562134
4321089 4237869
8,913
3,410 2,503
5,234
342 298
0.7453
0.5231 0.4632
Two providers (2735211 and 4562134) are shown to be much different from the overall population (as measured by Chi Square).
Managing the business risk of fraud EZ-R Stats, LLC
Next Metric
1. 2. 3. 4. 5. 6. 7. 8. 9. 10.
Outliers Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
EZ-R Stats, LLC
3 Day of Week
Day of Week
3 Day of Week
Identify unusually high/low activity on one or more days of week Dentist who only handled Medicaid on Tuesday Office is empty on Friday
How it is done?
Programmatically check entire population Obtain counts and sums by day of week (1-7) Prepare histogram For each group do the same procedure Compare the two histograms Sort descending by metric (chi square/dstat)
3 Day of Week
Provider 2735211 only provided service for Medicaid on Tuesdays. Provider 4562134 was closed on Thursdays and Fridays.
Managing the business risk of fraud EZ-R Stats, LLC
Next Metric
1.
2. 3. 4. 5. 6. 7. 8. 9. 10.
Outliers Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
EZ-R Stats, LLC
4 Round Numbers
Round Numbers
Its about.
Estimates!
4 Round Numbers
Isolate estimates Highlight account numbers in journal entries with round numbers Split purchases (under the radar) Which groups have the most estimates
4 Round Numbers
Round numbers
Classify population amounts $1,375.23 is not round $5,000 is a round number type 3 (3 zeros) $10,200 is a round number type 2 (2 zeros) Quantify expected vs. actual (d-statistic) Generally represents an estimate Journal entries
4 Round Numbers
N
4,136 833 8,318 9,549
Chi Sq
54,637 35,324 768 546
D-stat
0.9802 0.97023 0.321 0.2189
Two accounts, 2735211 and 4562134 have significantly more round number postings than any other posting account in the journal entries.
Managing the business risk of fraud EZ-R Stats, LLC
Next Metric
1.
2. 3. 4.
5.
6. 7. 8.
9.
10.
Outliers Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
EZ-R Stats, LLC
5 Made up numbers
Made up Numbers
5 Made Up Numbers
5 Made Up Numbers
Basic formula is =log(1+(1/N)) Workbook with formulae available at http://tinyurl.com/4vmcfs Obtain leading digits using Left function, e.g. left(Cell,1)
5 Made Up Numbers
Made up numbers
Benfords Law Check Chi Square and d-statistic First 1,2,3 digits Last 1,2 digits Second digit Sources for more info
5 Made Up Numbers
How is it done?
Decide type of test (first 1-3 digits, last 1-2 digit etc) For each group, count number of observations for each digit pattern Prepare histogram Based on total count, compute expected values For the group, compute Chi Square and d-stat Sort descending by metric (chi square/dstat)
EZ-R Stats, LLC
5 Made Up Numbers Invoice Amounts tested with Benfords law - Example Results
Store 324 563 432 Hi Digit 79 89 23 Chi Sq 5,234 4,735 476 D-stat 0.9802 0.97023 0.321
217
74
312
0.2189
During tests of invoices by store, two stores, 324 and 563 have significantly more differences than any other store as measured by Benfords Law.
Next Metric
1. 2. 3. 4. 5. 6. 7. 8. 9. 10.
Outliers Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
EZ-R Stats, LLC
6 Market Basket
Market Basket
Medical Ping ponging Pattern associations Apriori program References at end of slides Apriori Latin a (from) priori (former) Deduction from the known
EZ-R Stats, LLC
6 Market basket
Unexpected patterns and associations Based on market basket concept Unusual combinations of diagnosis code on medical insurance claim
6 Market basket
Market Basket
JE Accounts JE Approvals Credit card fraud in Japan taxi and ATM
6 Market basket
How is it done?
First, identify groups, e.g. all medical providers for a patient Next, for each provider, assign a unique integer value Create a text file containing the values Run apriori analysis
6 Market basket
Apriori outputs
For each unique value, probability of other values If you see Dr. Jones, you will also see Dr. Smith (80% probability) If you see a JE to account ABC, there will also an entry to account XYZ (30%)
Next Metric
1. 2. 3. 4. 5. 6. 7. 8. 9. 10.
Outliers Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
EZ-R Stats, LLC
8 - Gaps
8 Gaps
Missing documents (sales, cash, etc.) Inventory losses (missing receiving reports) Items that walked off
8 Gaps
How is it done?
Check any sequence of numbers supposed to be complete, e.g. Cash receipts Sales slips Purchase orders
8 Gaps
Excel sort and check Excel formula Sequential numbers and dates
8 Gaps
Gap Testing - Example Results
Missing 1 2 1
Next Metric
1. 2.
3.
4. 5. 6.
7.
8. 9. 10.
Outliers Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
EZ-R Stats, LLC
9 - Duplicates
Duplicates
9 Duplicates
Same items same vendor, same invoice number, same invoice date, same amount Different items same employee name, same city, different social security number
9 - Duplicates
Duplicate Payments
Fuzzy
logic
9 - Duplicates
Levenshtein distance Soundex Like clause in SQL Regular expression testing in SQL Vendor/employee situations
Russian physicist
9 - Duplicates
How is it done?
First, sort file in sequence for testing Compare items in consecutive rows Extract exceptions for follow-up
9 - Duplicates
Count 4 2 2
Next Metric
1.
2. 3. 4.
5.
6. 7. 8.
9.
10.
Outliers Stratification Day of Week Round Numbers Made Up Numbers Market basket Trends Gaps Duplicates Dates
EZ-R Stats, LLC
10 - Dates
Date Checking
Payment issued?
Managing the business risk of fraud EZ-R Stats, LLC
10 Dates
10 Dates
10 Dates
Journal entries Employee expense reports Business telephone calls Invoices Receiving reports Purchase orders
10 Dates
Determination of Dates
Transactions when business is closed Federal Office of Budget Management An excellent fraud indicator in some cases
10 Dates
10 Dates
Federal Holidays
Established by Law Ten dates Specific date (unless weekend), OR Floating holiday
10 Dates
Office of Personnel Management Example of specific date Independence Day, July 4th (unless weekend) Example of floating date Martin Luther Kings birthday (3rd Monday in January) Floating Thanksgiving 4th Thursday in November
10 Dates
How it is done?
Programmatically count holidays for entire population For each group, count holidays Compare the two histograms (group and population) Sort descending by metric (chi square/d-stat)
EZ-R Stats, LLC
10 Dates
N 37 23 18
24135
34
312
0.2189
Two employees (10245 and 32325) were off the chart in terms of expense amounts incurred on a Federal Holiday.
Managing the business risk of fraud EZ-R Stats, LLC
Objective 3
Overview Explain Each Metric Examples of what it can detect How to assess results
Objective 3 - Summarized
1. 2. 3. 4. 5.
Understand why and how Understand statistical basis for quantifying differences Identify ten general tools and techniques Understand examples done using Excel How pattern detection fits in
Objective 4
Use of Excel
Objective 4
Excel templates
Variety of tests
Objective 4
Work with Ranges =sum, =average, =stdevp =largest(Range,1), =smallest(Range,1) =min, =max, =count Tools | Data Analysis | Descriptive Statistics
EZ-R Stats, LLC
Objective 4
Excel Histograms
Objective 4
Objective 4
Objective 4
Repeatable process Audit standardization Learning curve Streamlining of tests More efficient and effective Examples http://ezrstats.com/Macros/home.html
Objective 4
Many built-in functions right off the shelf with SQL Control totals Exception identification Drill down Quantification June 2008 article in the EDP Audit & Control Journal (EDPACS) SQL as an audit tool
http://ezrstats.com/doc/SQL_As_An_Audit_Tool.pdf
EZ-R Stats, LLC
Objective 4
Use of Excel
Objective 4 - Summarized
1. 2. 3. 4.
5.
Understand why and how Understand statistical basis for quantifying differences Identify ten general tools and techniques Understand examples done using Excel How Pattern Detection fits in
Next Fit
Managing the business risk of fraud EZ-R Stats, LLC
Objective 5
Objective 5
Business Analytics Fraud Pattern Detection Continuous fraud pattern detection Continuous Monitoring
EZ-R Stats, LLC
Objective 5
Business Analytics
Fraud analytics -> business analytics Business analytics -> fraud analytics
Objective 5
Fraud analytics can feed (CM) Continuous fraud pattern detection Use output from CM to tune fraud pattern detection
Objective 6 - Summarized
Understand the framework for managing the business risk of fraud Plan, perform and explain statistical sampling in audits Reduce audit costs using data mining, sequential sampling and other sampling techniques Apply SAS 56, the new SAS suite and the revised (2007) Yellow Book. Run, hands-on, the most productive analytic technique (regression analysis). Use data mining to introduce greater efficiency into the audit process, without losing effectiveness.
Kolmogorov-Smirnov http://tinyurl.com/y49sec Benfords Law http://tinyurl.com/3qapzu Chi Square tests http://tinyurl.com/43nkdh Continuous monitoring http://tinyurl.com/3pltdl
Market Basket
Apriori testing for ping ponging Temple University http://tinyurl.com/5vax7r Apriori program (open source) http://tinyurl.com/5qehd5 Article Medical ping ponging http://tinyurl.com/5pzbh4
Questions?
Contact info
Phone: E-mail:
(919)-219-1622