Statistics Cheat Sheet

Statistics in Behavioral Sciences Cheat Sheet
by SH (Sana_H) via cheatography.com/164538/cs/36366/
Statistics Paired t-test
the branch of mathematics in which data are used descriptively or to compare means of two related groups
inferentially to find or support answers for scientific and other quanti‐ ex. compare weight of 20 mice before and after treatment
fiable questions. two conditions:
It encompasses various techniques and procedures for recording, - pre post treatment
organizing, analyzing, and reporting quantitative information. - two diff conditions ex two drugs
ASSUMPTIONS
Difference - parametric test & non-parametric test - random selection
- normally distributed
PROPERTIES PARAMETRIC NON-PARAMETRIC
- no extreme outliers
assumptions YES NO
FORMULA
value for mean median/mode t= m / s/√n
central m= sample mean of differences
tendency df= n-1
probability normally distri‐ user specific
distribution buted t-distribution
population required not required aka Student's t-distribution = probability distribution similar to normal
knowledge distribution but has heavier tails
used for interval data nominal, ordinal data used to estimate pop parameters for small samples
Tail heaviness is determined by degrees of freedom = gives lower
correlation pearson spearman
probability to centre, higher to tails than normal distribution, also
tests t test, z test, f Kruskal Wallis H test, Mann-w‐
have higher kurtosis, symmetrical, unimodal, centred at 0, larger
test, ANOVA hitney U, Chi-square
spread around 0
df = n - 1
Correlation Coefficient
above 30df, use z-distribution
a statistical measure of the strength of the relationship between the t-score = no of SD from mean in a t-distribution
relative movements of two variables we find:
value ranges from -1 to +1 - upper and lower boundaries
-1 = perfect negative or inverse correlation - p value
+1 = perfect positive correlation or direct relationship TO BE USED WHEN:
0 = no linear relationship - small sample
- SD is unknown
Alternatives ASSUMPTIONS
PARAMETRIC NON-PARAMETRIC - cont or ordinal scale
- random selection
one sample z test, one sample t one sample sign test
- NPC
test
- equal SD for indep two-sample t-test
one sample z test, one sample t one sample Wilcoxon signed rank
test test
two way ANOVA Friedman test
one way ANOVA Kruskal wallis test
independent sample t test mann-whitney U test
one way ANOVA mood's median test
pearson correlation spearman correlation
By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

cheatography.com/sana-h/ Last updated 16th January, 2023. Measure your website readability!
Page 1 of 5. https://readable.com
Two-sample z-test z-test (cont)
to determine if means of two independent populations are equal or REJECT NULL HYPOTHESIS IF Z STATISTIC IS STATISTICALLY
different SIGNIFICANT WHEN COMPARED WITH CRITICAL VALUE
to find out if there is significant diff bet two pop by comparing sample z-statistic/ z-score = no representing result from z-test
mean z critical value divides graph into acceptance and rejection regions
knowledge of: if z stat falls in rejection region-> H0 can be rejected
SD and sample >30 in each group TYPES
eg. compare performance of 2 students, average salaries, employee One-sample z-test
performance, compare IQ, etc Two-sample z-test
FORMULA:
z= x̄ ₁ - x̄ ₂ / √s₁2/n₁ + s₂2/n₂ ANOVA
s= SD
Analysis of Variance
formula:
comparing several sets of scores
z= (x̄ ₁ - x̄ ₂) - (µ₁ - µ₂) / √σ₁2/n₁ + σ₂2/n₂
to test if means of 3 or more groups are equal
(µ₁ - µ₂) = hypothesized difference bet pop means
comparison of variance between and within groups
to check if sample groups are affected by same factors and to same
Point Biserial correlation
degree
measures relationship between two variables compare differences in means and variance of distribution
rpbi = correlation coefficient ONE-WAY ANOVA=no of IVs
one continuous variable (ratio/interval scale) single IV with different (2) levels/variations have measurable effect
one naturally binary variable on DV
FORMULA: compare means of 2 or more indep groups
rpb= M1-M0/Sn * √ pq aka:
Sn= SD - one-factor ANOVA
- one-way analysis of variance
Two-sample z-test - between subjects ANOVA
Assumptions
to determine if means of two independent populations are equal or
- independent samples
different
- equal sample sizes in groups/levels
to find out if there is significant diff bet two pop by comparing sample
- normally distributed
mean
- equal variance
knowledge of:
F test is used to check statistical significance
SD and sample >30 in each group
higher F value --> higher likelihood that difference observed is real
eg. compare performance of 2 students, average salaries, employee
and not due to chance
performance, compare IQ, etc
used in field studies, experiments, quasi-exp
FORMULA:
CONDITIONS:
z= x̄
- min 6 subjects
- sample no of samples in each group
z-test
H0: µ1=µ2=µ3 ... µk i.e. all pop means are equal
for hypothesis testing
Ha: at least one µi is different i.e atleat one of the k pop means is not
to check whether means of two populations are equal to each other
equal to the others
when pop variance is known
µi is the pop mean of group
we have knowledge of:
- SD/population variance and/or sample n=30 or more
if both unknown -> t-test
left-tailed
right-tailed
two-tailed

Spearman Correlation Advantages & Disadvantages - NON-PARAMETRIC TESTS
non-parametric version of Pearson correlation coefficient ADVANTAGES DISADVANTAGES

named after Charles Spearman simple, easy to understand less powerful than parametrics
denoted by ρ(rho)
no assumptions counterpart parametric if exists, is
determine the strength and direction of monotonic variables bet two
more powerful
variables measured at ordinal, interval or ratio levels & whether they
more versatile not as efficient as parametric tests
are correlated or not
monotonic function=one variable never increases or never easier to calculate may waste information
decreases as its IV changes hypothesis tested may be more requires larger sample to be as
- monotonically increasing= as X increases, Y never decreases accurate powerful as parametric test
- monotonically decreasing= as X increases, Y never increases
small sample sizes are okay difficult to compute large samples
- not monotonic= as X increases, Y sometimes dec and sometimes
by hand
inc
can be used for all types of tabular format of data required
for analysis with: ordinal data, continuous data
data (nominal, ordinal, interval) that may not be readily available
uses ranks instead of assumptions of normality
aka Spearman Rank order test can be used with data having outliers
FORMULA:
ρ= 1- 6Σdᵢ 2/n(n2-1) Application
di= difference between two ranks of each observation PARAMETRIC TESTS NON-PARAMETRIC TESTS
-1 to +1
- quantitative & continuous data - mixed data
+1 = perfect association of ranks
- normally distributed - unknown distribution of
0= no association
population
-1= perfect negative association of ranks
closer the value to 0, weaker the association - data is estimated on ratio or - different kinds of measur‐
Value Ranges interval scales ement scales
0 to 0.3 = weak monotonic relationship
0.4 to 0.6 = moderate strength monotonic relationship degrees of freedom
0.7 to 1 = strong monotonic relationship independent values in the data sample that have freedom to vary
FORMULA:
Parametric and Non-parametric test no of values in a data set minus 1
Fixed set of parameters, certain assumptions about distribution of df= N-1

population
PARAMETRIC - prior knowledge of pop distribution i.e NORMAL t-test
DISTRIBUTION statistical test to determine if significant difference between avg
NON-PARAMETRIC - no assumptions, do not depend on population, scores of two groups
DISTRIBUTION FREE tests, values found on nominal or ordinal 1908-William Sealy Gosset- student t-test and t-distirbution
level for hypothesis testing
easy to apply, understand, low complexity knowledge of:
decision based on - distribution of population, size of sample distribution - normally distributed
parametric - mean & <30 sample
non-parametric - median/mode & >30 sample or regardless of size

t-test (cont) One-sample z-test
no knowledge of SD to check if difference between sample mean & population mean

TYPES: when SD is known
one-sample t-test - single group FORMULA:
FORMULA: z=x-µ/SE
t= m - µ / s/√n SE=σ/√n
SD FORMULA: z score is compared to a z table (includes % under NPC bet mean
σ= √Σ(X-µ)2 / N and z score), tells us whether the z score is due to chance or not
2
s= √Σ(X-µ) / n-1 conditions:
independent two-sample t-test - two groups knowledge of:
paired/dependent samples t-test - sig diff in paired measurements, - pop mean
compares means from same group at diff times (test-retest sample) - SD
H0: no effective difference = measured diff is due to chance - simple random sample
Ha: two-tailed/ one-tailed nonequivalent means/smaller or larger than - normal distribution
hypothesized mean two approaches to reject H0:
PERFORM two-tailed test: to find out difference bet two populations - p-value approach - p-value is the smallest level of significance at
one-tailed: one pop mean is > or < other which H0 can be rejected...smaller p-value, stronger evidence
-critical value approach - comparing z stat to critical values... indicate
Independent two-sample t-test boundary regions where stat is highly improbable to lie= critical
regions/rejection regions
aka unpaired t-test
if z stat is in critical region-> reject H0
to compare mean of two independent groups
based on:
ex. avg weight of males and females
significance level (0.1, 0.05, 0.01), alpha level, Ha
two forms:
- student's t-test : assumes SD is equal
Biserial correlation
- welch's t-test : less restrictive, no assumption of equal SD
both provide more/less similar results to measure relationship between quantitative variables and binary
ASSUMPTIONS: variables
- normally distributed given by Pearson - 1909
- SD is same biserial correlation coeff varies bet -1 and 1
- independent groups 0= no association
- randomly selected ex. IQ scores and pass/fail correlation
- independent observations continuous variable and binary variable (dichotomised to create
- measured on interval or ratio scale binary variable)
FORMULA: rbis or rb = correlation index estimating strength of relationship
t= x̄₁ - x̄₂ / √s₁2/n₁ + s₂2/n₂ between artificially dichotomous variable and a true continuous
df= n1 + n2 - 2 variable
S= √Σ (x1-x̄)2 + (x2-x̄)2 / n1+n2-2 ASSUMPTIONS:
- data measured on continuous scale
- one variable to be made dichotomous
- no outliers
- approx normally distributed
- equal variances (SD)
FORMULA
rb= M1-M0/SDt * pq/y

Biserial correlation (cont) Mann-Whitney U test (cont)
M1=mean of grp 1 U1=n1n2+ n1(n1+1)/2 - R1

M2= mean of grp 2 U2=n1n2+ n2(n2+1)/2 - R2
p= ratio of grp 1 R= sum of ranks of group
q= ratio of grp 2
SDt= total SD One-way ANOVA test
y= ordinate
Pearson Correlation
measures strength and direction of a linear relationship between two

variables
how two data sets are correlated
gives us info about the slope of the line
One-way ANOVA test
r
aka:
- Pearson's r
- bivariate correlation
- Pearson product-moment correlation coefficient (PPMCC)
cannot determine dependence of variables & cannot assess
nonlinear associations
r value variation:
One-way ANOVA test
-0.1 to -.03 / 0.1 to 0.3 = weak correlation
-0.3 to -0.5 / 0.3 to 0.5 = average/moderate correlation
-0.5 to -1.0 / 0.5 to 1.0 = strong correlation
FORMULA:
r=n(Σxy)-(Σx)(Σy) / √[nΣx 2-(Σx)2] [nΣy2-(Σy)2]
Mann-Whitney U test
non-parametric test to test the significance of difference two indepe‐

ndently drawn groups OR compare outcomes between two indepe‐
ndent groups
equi to unpaired t test
CONDITIONS:
No NPC assumption, small sample size >30 with min 5 in each
group, continuous data (able to take any no in range), randomly
selected samples,
aka:
Mann-Whitney Test
Wilcoxon Rank Sum test
H0: the two pop are equal
Ha: the two pop are not equal
denoted by U
FORMULA:


Statistics Cheat Sheet

Uploaded by

Copyright:

Available Formats

Statistics Cheat Sheet

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Statistics Cheat Sheet

Uploaded by

Copyright:

Available Formats

Statistics in Behavioral Sciences Cheat Sheet

by SH (Sana_H) via cheatography.com/164538/cs/36366/

Statistics Paired t-test

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

Two-sample z-test z-test (cont)

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

Spearman Correlation Advantages & Disadvantages - NON-PARAMETRIC TESTS

non-parametric version of Pearson correlation coefficient ADVANTAGES DISADVANTAGES

Fixed set of parameters, certain assumptions about distribution of df= N-1

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

t-test (cont) One-sample z-test

no knowledge of SD to check if difference between sample mean & population mean

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

Biserial correlation (cont) Mann-Whitney U test (cont)

M1=mean of grp 1 U1=n1n2+ n1(n1+1)/2 - R1

measures strength and direction of a linear relationship between two

non-parametric test to test the significance of difference two indepe‐

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Statistics Cheat Sheet

Uploaded by

Copyright:

Available Formats

Statistics Cheat Sheet

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Statistics Cheat Sheet

Uploaded by

Copyright:

Available Formats

Statistics in Behavioral Sciences Cheat Sheet

by SH (Sana_H) via cheatography.com/164538/cs/36366/

Stat​ist​ics Paired t-test

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

Two-sample z-test z-test (cont)

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

Spearman Correl​ation Advantages & Disadv​antages - NON-PA​RAM​ETRIC TESTS

non-pa​ram​etric version of Pearson correl​ation coeffi​cient ADVANTAGES DISADV​ANTAGES

Fixed set of parame​ters, certain assump​tions about distri​bution of df= N-1

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

t-test (cont) One-sample z-test

no knowledge of SD to check if difference between sample mean & population mean

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

Biserial correl​ation (cont) Mann-W​hitney U test (cont)

M1=mean of grp 1 U1=n1n2+ n1(n1+1)/2 - R1

measures strength and direction of a linear relati​onship between two

non-pa​ram​etric test to test the signif​icance of difference two indepe​‐

By SH (Sana_H) Published 17th January, 2023. Sponsored by Readable.com

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Statistics Paired t-test

Spearman Correlation Advantages & Disadvantages - NON-PARAMETRIC TESTS

non-parametric version of Pearson correlation coefficient ADVANTAGES DISADVANTAGES

Fixed set of parameters, certain assumptions about distribution of df= N-1

Biserial correlation (cont) Mann-Whitney U test (cont)

measures strength and direction of a linear relationship between two

non-parametric test to test the significance of difference two indepe‐