0% found this document useful (0 votes)

9 views29 pages

Presentation 3

Uploaded by

yonasante2121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views29 pages

Presentation 3

Uploaded by

yonasante2121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Probability

and

Sampling Distributions
R as a set of statistical tables
• The R suite of programs provides a simple way for statistical tables of just about

any probability distribution of interest and also allows for easy plotting of the form

of these distribution

General syntax for distribution functions

• There are four basic R commands that apply to the various distributions defined in R.

• Letting DIST denotes the particular distribution and parameters the parameters to

specify that particular distribution

• d DIST(x, parameters)--- probability density of DIST evaluated at x

• p DIST(x, parameters)- returns Pr(DIST(parameters)<=x)

• q DIST(p, parameters)-- returns x satisfying Pr(DIST(parameters)<=x)=p

• rDIST(n, parameters)- generates a random variables from DIST(parameters)

Distribution Parameters R Function
Binomial size, prob binom
Poison lambda pois
Geometric prob geom
Negative binomial size, prob nbinom
Normal mean , sd norm
Student t df t
Chi- square df chisq
F df1,df2 f
Exponential rate exp
Gamma Shape, rate gamma
Uniform min , max unif
Probability density function

• dnorm(2) # returns probability density of standard normal distribution

evaluated at z=2
## let z be a sequence of random normal deviates ranging b/n -3.5 and 3.5, increasing by 0.1
• z<-seq(from=-3.5,to=3.5,by=0.1) # defines z-values for the pdf
• f1<-dnorm(z,0,1)
• par(mfrow=c(2,2))
• plot(z,f1,type="l") # type=“l” will plot the lines
• title("Normal distribution")
• y<-seq(0,5,length=100)
• f2<-dchisq(y,1) # computes the probability of y for χ2 with
df=1
• plot(y,f2,type="l")
• title("Chi-square distribution")
Normal distribution Chi-square distribution
0.0 0.1 0.2 0.3 0.4

0.0 0.5 1.0 1.5

f2
-3 -2 -1 0 1 2 3 0 1 2 3 4 5

z y

Probability distributions
>z=seq(-4,4,by=.1)

>y=dnorm(z)

>plot(z,y,type="l", main="Standard Normal and t (df=3) Densities")

>y2=dt(z,df=3)

>lines(z,y2,type="l",col="red")

• One can see that Student's t density is very similar to standard Normal
density except that the t density has an additional parameter called degrees
of freedom (df).

• Each new choice of df will produce a new t density.

• If df=100 or larger, t density is almost the same as standard Normal.

Exercise: Change df=3 to various values (e.g. df=30) and see the two curves
Note: t density in red color has fatter tails
Probability mass function
• For discrete distributions, where variables can take on only distinct values, it is preferable to

draw a pin diagram,

• Example1 the binomial distribution with n = 50 and p = 0.33

• par(mfrow=c(2,1))

> x <- 0:50 # The Binomial distribution takes values 0,...,n

> plot(x,dbinom(x,size=50,prob=.33),type="h",main = "Binomial mass")

> plot(x, pbinom(x,size=50,prob=.33),type = "s", main = "Binomial distribution")

• The distribution drawn corresponds to, for example, the number of 5s or 6s in 50 throws of a

symmetrical die.
Example 2 the Poisson distribution with lambda 0.2
> x<-0:10
> y<-dpois(0:10,0.2)
>data.frame("Prob"=y,row.names=x)
> plot(0:10, dpois(0:10,0.2), type='h', xlab="Sequence Errors", ylab="Probability" )
Cumulative distribution & Quantiles
• R has useful mechanism for determining p-values instead of searching
through statistical tables and they can be easily achieved using the p(dist)
and q(dist) functions. Some examples are shown below.
> pnorm(1.96, 0,1) # the probability that Z is less than or equal to 1.96
[1] 0.9750021
>2*pnorm(-1.96) # 2-sided p-value for normal distribution
[1] 0.0249979
> qnorm(0.975)
[1] 1.959964
> 2*pt(-2.43,df=13) # 2-sided p-value for t distribution
[1] 0.0303309
To find the probability of getting t=1.50 (or greater) when df=15.

• Use either of the methods

• Method 1

• > pt(1.50, df=15, lower.tail = FALSE)

• [1] 0.07718333

• Method 2

• > 1- (pt (1.50, df = 15))

• [1] 0.07718333

what’s the probability of getting 12.1 or greater for a chi-square distribution with 8 degrees of
freedom?
• #Method 1
• > pchisq(12.1, df =8, lower.tail= FALSE)
• [1] 0.1467976
• #Method2
• > 1 - pchisq(12.1, df =8)
• qt() calculates the quantile for a given prob-value and degrees of freedom

separately.

• qt(p, df, lower.tail = TRUE)

• The default argument lower.tail = TRUE is used for two-sided and one-sided

seceding tests (X <=x). It has to be set on FALSE for a one-sided acceding test.

>qt(0.03391, 13, lower.tail = FALSE)

qt(0.025,df=13) # to calculate the quantile for probability

0.025

[1] -2.160369

> qchisq(0.975,1)

[1] 5.023886

>qf(0.01, 2, 7, lower.tail = FALSE) ## upper 1% point for an F(2, 7)

Random number generators in R
• E.g. random numbers from the normal distribution can be drawn using
rnorm() function
• rnorm(10) # draws 10 random numbers from a standard
normal distribution
• rnorm(10,5,2) #draws 10 random samples from a normal
distribution with mean 5 and standard deviation of 2
• rnorm(1000,5,2) #draws 1000 random samples from a normal
distribution with mean 5 and standard deviation of 2
• runif(10)
• rbinom(20,size=5,prob=.2)
• rpois(20,6) # to generate a random sample of size=20
Continued…
• x <- rnorm(100, 50, 10)
• hist(x, probability = TRUE)
• mean(x)
• sd(x)
• xnew <- seq(min(x), max(x), length=100)
• lines(xnew, dnorm(xnew, mean(x), sd(x)), col="red")
Simulating the Sample Distribution of the Mean
• The CLT is perhaps the most important concept in statistics
>data<-rnorm(25,100,15)
mean(data)
sd(data)
• We know that when the population is normal, the sample mean has a
normal distribution with mean 100 and a standard deviation of 3.
• Let’s verify that with a statistical simulation
>mean(rnorm(25,100,15))
>replicate(10, mean(rnorm(25,100,15))) #replicate 10 times
>data<- replicate(100000, mean(rnorm(25,100,15))) # replicate
100000 times and save in data
>mean(data)
>sd(data)
• These results are very close to our theoretical expectation
Simulating the Sample Distribution of the Mean
• Let’s look at histogram of our means

>hist(data,breaks=100, main=“Histogram of Sample Means”,

col="blue“, xlab = "Sample Means")
>plot.density(density(data)) # it certainly looks normal
Example2
>x <- rnorm(1000,5,2)
>hist(x, probability = TRUE)
>mean(x)
>lines(density(x), col="blue")
>xnew <- seq(min(x), max(x), length=100)
>lines(xnew, dnorm(xnew, mean(x), sd(x)), col="red"
Examining the distribution of a set of data
• Given a (univariate) set of data we can examine its distribution in a large number of
ways. The simplest is to examine the numbers.
>attach(faithful)
> summary(eruptions)
• Min. 1st Qu. Median Mean 3 rd Qu. Max.
1.600 2.163 4.000 3.488 4.454 5.100
Graphical examination of normality
• R has a function hist to plot histograms
>hist(eruptions) # It has an unusual
distribution.
>hist(eruptions, seq(1.6, 5.2, 0.2),prob=TRUE)
>lines (density(eruptions), bw=0.1)
>rug(eruptions) # show the actual data
points
##Q-Q plots can help us examine this more carefully
>qqnorm(eruptions)
>qqline(eruptions)
Cont’d…..
• We might want a more formal test of agreement with normality (or not).
• R provides the Shapiro-Wilk test
> shapiro.test(eruptions),
Shapiro-Wilk normality test
data: eruptions),
W = 0.8459, p-value = 9.036e-16
• Since the p-value is very small, so we reject H0 and conclude that the sample are unlikely to be from a
normal distribution
Kolmogorov-Smirnov test

> long <- eruptions[eruptions > 3]

> ks.test(long, "pnorm", mean = mean(long), sd = sqrt(var(long)))
• One-sample Kolmogorov-Smirnov test
data: long
D = 0.0661, p-value = 0.4284 alternative hypothesis: two-sided
• (Note that the distribution theory is not valid here as we have estimated the parameters of the normal
distribution from the same sample.)
Hypothesis Tests in R

Recall the basic structure of hypothesis tests:

• An overall model and related assumptions are made. (The most common
being observations following a normal distribution.)
• The null (H0) and alternative (H1 or HA) hypothesis are specified. Usually
the null specifies a particular value of a parameter.
• With given data, the value of the test statistic is calculated.
• Under the general assumptions, as well as assuming the null hypothesis is
true, the distribution of the test statistic is known.
• Given the distribution and value of the test statistic, as well as the form of
the alternative hypothesis, we can calculate a p-value of the test.
• Based on the p-value and pre-specified level of significance, we make a
decision. One of:
--Fail to reject the null hypothesis.
One and two sample tests

• Consider the following data on the latent heat of fusion of ice

(cal/gm) from rice (1995, p490).
>Method_1<-c(79.98, 80.04, 80.02, 80.04, 80.03, 80.03, 80.04, 79.97,
80.05, 80.03, 80.02, 80.00, 80.02)
>Method_2<-c(80.02, 79.94, 79.98, 79.97, 79.97, 80.03, 79.95, 79.97)
One Sample t-test
Suppose we want to test if the mean of Method _1 is 80:

We specify µ0 with mu=80.

>t.test(Method _1,mu=80) or
> t.test(Method_1,mu=80, alternative="two.sided", conf.level = 0.95)
Two Sample t-test
For the previous sample data
a) Test for the equality of means of the two samples
(b) Test for equality of the variances of the two samples.
#Box plots provide a simple graphical comparison of the two samples.
>boxplot(Method _1, Method _2) #which indicates that the first group tends to give
higher results than the second.
• To test for the equality of the means of the two examples, we can use an unpaired t-
test by
>t.test(Method _1, Method _2, alternative = c("two.sided"))
## which does indicate a significant difference (p < 0.05)., assuming normality.
Comparison of variances
• By default the R function does not assume equality of variances in the
two samples. We can however use the F-test to test for the equality of
variances in the two samples provided that the two samples are from
normal populations.
• Checking homogeneity (approximate equality) of variances is, on the one
hand, a necessary precondition for a number of methods (for example
comparison of mean values) and on the other hand the heart of a number
of more sophisticated methods (such as analysis of variance).
• F-test for variance equality

• This test depends on the ratio of variances. The null hypothesis asserts
the ratio to be one.
• Given the above data sets, the R code for this test where the level of confidence is
0.95 will be:

##which shows no evidence of a significant difference, and so we can use the classical
t-test that assumes equality of the variances.
t.test(Method _1, Method _2,var.equal=T, alternative = c("two.sided"))
Paired t-Test
• Paired tests are used when there are two measurements on the same
experimental unit.
• The theory is essentially based on taking differences and thus
reducing the problem to that of a one-sample test.
• First generate the following data
• x<-sample(Method_1,7,replace=FALSE)
• y<-sample(Method_2,7,replace=FALSE)
• t.test(x,y,paired=TRUE)
• All the tests seen so far assume normality of the two samples. The
two-sample Wilcoxon (or Mann Whitney) test only assumes a
common continuous distribution under the null hypothesis.
> wilcox.test(A, B)
Wilcoxon rank sum test with continuity correction
data: A and B
W = 89, p-value = 0.007497
alternative hypothesis: true location shift is not equal to 0
• Package exactRankTests is required when there is ties in the data to
conduct a better test.
Nonparametric Tests of Group Differences
R provides functions for carrying out Mann-Whitney U, Wilcoxon
Signed Rank, Kruskal Wallis, and Friedman tests.# independent 2-
group Mann-Whitney U Test
wilcox.test(y~A)
# where y is numeric and A is A binary factor
• # independent 2-group Mann-Whitney U Test
wilcox.test(y,x) # where y and x are numeric
• # dependent 2-group Wilcoxon Signed Rank Test
wilcox.test(y1,y2,paired=TRUE) # where y1 and y2 are numeric
• # Kruskal Wallis Test One Way Anova by Ranks
kruskal.test(y~A) # where y1 is numeric and A is a factor
• # Randomized Block Design - Friedman Test
friedman.test(y~A|B)
# where y are the data values, A is a grouping factor
# and B is a blocking factor
χ2 test for I × J contingency table
Consider the following two categorical variables:
x<-
as.factor(c("Milk","Milk","Milk","Milk","Tea","Tea","Tea","T
ea"))
y<-
as.factor(c("Milk","Milk","Milk","Tea","Milk","Tea","Tea","T
ea"))
These vectors of categorical variables are converted into contingency
tables by the R as:
table(x,y)
y
x Milk Tea
Milk 3 1
Tea 1 3
For making a test, we use the following R code:
chisq.test(x,y)
R output:
Pearson's Chi-squared test with Yates' continuity correction #read more
on Yate’s continuity correction,..
data: x and y
X-squared = 0.5, df = 1, p-value = 0.4795
• There is no significant association between the two variables.
Correlation coefficients for continuous
variables
• x <- c(1,2,3,5,7,9)
• y <- c(3,2,5,6,8,11)

>cor.test(x, y, method="pearson")
• If the linearity of a relationship or the normality of the residuals is
doubtful, a rank correlation test can be carried out. Mostly,
Spearman’s rank correlation coefficient is used:
>cor.test(x, y, method="spearman")
Exercises

5.1 Do the values of the react data set (notice that this is a single

vector, not a data frame) look reasonably normally distributed? Does

the mean differ significantly from zero according to a t test?

• 5.2 In the data set vitcap, use a t test to compare the vital capacity

for the two groups. Calculate a 99% confidence interval for the

difference. The result of this comparison may be misleading. Why?

• 5.3 Perform the analyses of the react and vitcap data using

nonparametric techniques.

• 5.4 Perform graphical checks of the assumptions for a paired t test

Lab 4
No ratings yet
Lab 4
6 pages
Genetica Cuantitativa
No ratings yet
Genetica Cuantitativa
120 pages
11-Exp 4 - Normal Distribution-07!03!2024
No ratings yet
11-Exp 4 - Normal Distribution-07!03!2024
38 pages
Core Statistics PDF
100% (4)
Core Statistics PDF
256 pages
ACTEX Learning: R Formula & Review Sheet
No ratings yet
ACTEX Learning: R Formula & Review Sheet
52 pages
R-Program Lab Manual
No ratings yet
R-Program Lab Manual
57 pages
Mathematical Computations Using R
No ratings yet
Mathematical Computations Using R
53 pages
Arzu Sir Assignment by Romin
No ratings yet
Arzu Sir Assignment by Romin
13 pages
Unit 4
No ratings yet
Unit 4
38 pages
Da Unit-4
No ratings yet
Da Unit-4
37 pages
Statistics With MATLABOctave
No ratings yet
Statistics With MATLABOctave
46 pages
Probst at
No ratings yet
Probst at
7 pages
R Complete
No ratings yet
R Complete
24 pages
BN2102 1-6 Notes
No ratings yet
BN2102 1-6 Notes
38 pages
QM2 Tutorial 3
No ratings yet
QM2 Tutorial 3
26 pages
Day 3
No ratings yet
Day 3
19 pages
Main
No ratings yet
Main
13 pages
BSc. AC-Sem IV
No ratings yet
BSc. AC-Sem IV
19 pages
Probability Distributions in R
No ratings yet
Probability Distributions in R
42 pages
HWK3 324
No ratings yet
HWK3 324
9 pages
Chapter 2 - Representing Sample Data: Graphical Displays
No ratings yet
Chapter 2 - Representing Sample Data: Graphical Displays
16 pages
CompleteLectureNotes STAT 261
No ratings yet
CompleteLectureNotes STAT 261
158 pages
Stat Doc Pract 6,7,8
No ratings yet
Stat Doc Pract 6,7,8
17 pages
Distributions Plotting
No ratings yet
Distributions Plotting
8 pages
MIT18 05S14 Cl5cont Slides PDF
No ratings yet
MIT18 05S14 Cl5cont Slides PDF
12 pages
RP Notes Unit 4 - Distribution Fucntions
No ratings yet
RP Notes Unit 4 - Distribution Fucntions
5 pages
Meraz5.0 Cult Events
No ratings yet
Meraz5.0 Cult Events
31 pages
Assignment - 2 3
No ratings yet
Assignment - 2 3
4 pages
Lab 8
No ratings yet
Lab 8
5 pages
STA80006 Weeks7-12 PDF
No ratings yet
STA80006 Weeks7-12 PDF
29 pages
Statistics Using R Tutorial
No ratings yet
Statistics Using R Tutorial
22 pages
Probability Problem Solution Strategy in R PDF
No ratings yet
Probability Problem Solution Strategy in R PDF
12 pages
Lecture 8
No ratings yet
Lecture 8
76 pages
R Lab - Probability Distributions
No ratings yet
R Lab - Probability Distributions
10 pages
Experiment 6
No ratings yet
Experiment 6
7 pages
Practical Statistics
No ratings yet
Practical Statistics
14 pages
5-Normal Distribution-23-01-2025
No ratings yet
5-Normal Distribution-23-01-2025
35 pages
Introduction To Rstudio: Creating Vectors
No ratings yet
Introduction To Rstudio: Creating Vectors
11 pages
Sujal 4
No ratings yet
Sujal 4
31 pages
MIT18 05S14 Cl5contslides PDF
No ratings yet
MIT18 05S14 Cl5contslides PDF
11 pages
Package DISTRIB': R Topics Documented
No ratings yet
Package DISTRIB': R Topics Documented
8 pages
Simple Statistics Functions in R
No ratings yet
Simple Statistics Functions in R
41 pages
Practical 5
No ratings yet
Practical 5
4 pages
2018dec 02402 Solution en
No ratings yet
2018dec 02402 Solution en
31 pages
Sim R
No ratings yet
Sim R
6 pages
R Session - Note3
No ratings yet
R Session - Note3
4 pages
A Guide To Dnorm, Pnorm, Qnorm, and Rnorm in R
No ratings yet
A Guide To Dnorm, Pnorm, Qnorm, and Rnorm in R
7 pages
Module 6 Common Continuous Probability Distribution
No ratings yet
Module 6 Common Continuous Probability Distribution
45 pages
Math10282 Ex05 - An R Session
No ratings yet
Math10282 Ex05 - An R Session
6 pages
2015 Exit Exam - Questions
No ratings yet
2015 Exit Exam - Questions
159 pages
5 Describing Populations: in This Chapter We Describe Populations and Samples Using The Language of Probability
No ratings yet
5 Describing Populations: in This Chapter We Describe Populations and Samples Using The Language of Probability
9 pages
Homework 1: Statistics 109 Due February 17, 2019 at 11:59pm EST
No ratings yet
Homework 1: Statistics 109 Due February 17, 2019 at 11:59pm EST
23 pages
Basic Statics
No ratings yet
Basic Statics
218 pages
ch-2 Sample Survey
No ratings yet
ch-2 Sample Survey
164 pages
R Class 10
No ratings yet
R Class 10
5 pages
Statistics With MATLAB/Octave: Andreas Stahel Bern University of Applied Sciences Version of 30th June 2017
No ratings yet
Statistics With MATLAB/Octave: Andreas Stahel Bern University of Applied Sciences Version of 30th June 2017
46 pages
R Commands
No ratings yet
R Commands
2 pages
Econometrics I - Problem Set 1: Econometricswithr Download R
No ratings yet
Econometrics I - Problem Set 1: Econometricswithr Download R
3 pages
Superior Speaking Course Overview & Resource Guide CEC-2023
100% (1)
Superior Speaking Course Overview & Resource Guide CEC-2023
100 pages
Social & Economic Statistics (Chapter 1 - 5)
No ratings yet
Social & Economic Statistics (Chapter 1 - 5)
71 pages
Lab-2: Probability Distributions Name: Objective:To Compute Probability Density Function (PDF) and Cumulative Distribution Function (CDF) Outcomes
No ratings yet
Lab-2: Probability Distributions Name: Objective:To Compute Probability Density Function (PDF) and Cumulative Distribution Function (CDF) Outcomes
15 pages
R Commands
No ratings yet
R Commands
5 pages
Intro To Sample Survey Lecture Note
No ratings yet
Intro To Sample Survey Lecture Note
28 pages
Research Methods and Sampling Practice
No ratings yet
Research Methods and Sampling Practice
94 pages
8 Probability Distributions: 8.1 R As A Set of Statistical Tables
No ratings yet
8 Probability Distributions: 8.1 R As A Set of Statistical Tables
6 pages
PC 101 Week 1 Gathering Agenda
100% (1)
PC 101 Week 1 Gathering Agenda
5 pages
4.last Mpo
No ratings yet
4.last Mpo
2 pages
JK DFlipFlop Lab Viva QA
No ratings yet
JK DFlipFlop Lab Viva QA
2 pages
DB Lecture Note All in ONE
No ratings yet
DB Lecture Note All in ONE
85 pages
Programmable Logic Contrler PLC LG
No ratings yet
Programmable Logic Contrler PLC LG
13 pages
Statistics Exit Exam
100% (1)
Statistics Exit Exam
9 pages
Lecture Note Introduction To Stat Seni
No ratings yet
Lecture Note Introduction To Stat Seni
91 pages
Statistics For Data Sciences
No ratings yet
Statistics For Data Sciences
10 pages
Presentation 2
No ratings yet
Presentation 2
18 pages
Time Series Lecture Notes-Ch-1
No ratings yet
Time Series Lecture Notes-Ch-1
24 pages
Time Series Lecture Notes-Ch-2
No ratings yet
Time Series Lecture Notes-Ch-2
21 pages
Presentation 4
No ratings yet
Presentation 4
22 pages
IOE MSc. Transportation: Old Questions of Second Semester
No ratings yet
IOE MSc. Transportation: Old Questions of Second Semester
8 pages
Promiseme
No ratings yet
Promiseme
9 pages
Developer ChatGPT
100% (1)
Developer ChatGPT
2 pages
Sampling Unit 7
No ratings yet
Sampling Unit 7
6 pages
1401 4954 1 PB
No ratings yet
1401 4954 1 PB
11 pages
100 Ways To Develop Your Mind
No ratings yet
100 Ways To Develop Your Mind
10 pages
Sampling Unit 6
No ratings yet
Sampling Unit 6
5 pages
Astro Rashi .
No ratings yet
Astro Rashi .
2 pages
Basic Management Concept - Introduction
100% (7)
Basic Management Concept - Introduction
48 pages
Sampling Unit 8
No ratings yet
Sampling Unit 8
7 pages
Mathematical Reasoning
100% (1)
Mathematical Reasoning
2 pages
Ashtavakra Geeta Script and Translation in English
No ratings yet
Ashtavakra Geeta Script and Translation in English
79 pages
2
No ratings yet
2
12 pages
How To Write A Critique of A Novel PDF
No ratings yet
How To Write A Critique of A Novel PDF
7 pages
Note On Heraclitus, Fragment 124
No ratings yet
Note On Heraclitus, Fragment 124
4 pages
Preview
No ratings yet
Preview
15 pages
A Class of Methods For Solving Nonlinear Simultaneous Equations
No ratings yet
A Class of Methods For Solving Nonlinear Simultaneous Equations
17 pages
Accident Incident Investigation
No ratings yet
Accident Incident Investigation
2 pages
Strategi Percepatan Fungsionalisasi Sentra Kelautan Perikanan Terpadu Natuna
No ratings yet
Strategi Percepatan Fungsionalisasi Sentra Kelautan Perikanan Terpadu Natuna
14 pages
Meiji Restoration: A New Industrialized Japan
No ratings yet
Meiji Restoration: A New Industrialized Japan
3 pages
Wastewater Treatment - Floculation and Coagulation
No ratings yet
Wastewater Treatment - Floculation and Coagulation
17 pages
Final DLL Science
No ratings yet
Final DLL Science
2 pages
English Language Reqirements
No ratings yet
English Language Reqirements
4 pages
J-Rod Photo Is A Fake
No ratings yet
J-Rod Photo Is A Fake
4 pages
Action Research Proposal - Lit Review PDF
100% (1)
Action Research Proposal - Lit Review PDF
12 pages
Application Form Bre Generation
No ratings yet
Application Form Bre Generation
2 pages
Getting To Yes Summary
No ratings yet
Getting To Yes Summary
3 pages
Devon Heise - Resume 2009
No ratings yet
Devon Heise - Resume 2009
2 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Presentation 3

Uploaded by

Presentation 3

Uploaded by

Probability

General syntax for distribution functions

specify that particular distribution

• d DIST(x, parameters)--- probability density of DIST evaluated at x

• p DIST(x, parameters)- returns Pr(DIST(parameters)<=x)

• q DIST(p, parameters)-- returns x satisfying Pr(DIST(parameters)<=x)=p

• rDIST(n, parameters)- generates a random variables from DIST(parameters)

• dnorm(2) # returns probability density of standard normal distribution

0.0 0.5 1.0 1.5

>plot(z,y,type="l", main="Standard Normal and t (df=3) Densities")

• Each new choice of df will produce a new t density.

• If df=100 or larger, t density is almost the same as standard Normal.

draw a pin diagram,

• Example1 the binomial distribution with n = 50 and p = 0.33

> x <- 0:50 # The Binomial distribution takes values 0,...,n

> plot(x,dbinom(x,size=50,prob=.33),type="h",main = "Binomial mass")

> plot(x, pbinom(x,size=50,prob=.33),type = "s", main = "Binomial distribution")

• Use either of the methods

• > pt(1.50, df=15, lower.tail = FALSE)

• > 1- (pt (1.50, df = 15))

• qt(p, df, lower.tail = TRUE)

>qt(0.03391, 13, lower.tail = FALSE)

>qf(0.01, 2, 7, lower.tail = FALSE) ## upper 1% point for an F(2, 7)

>hist(data,breaks=100, main=“Histogram of Sample Means”,

> long <- eruptions[eruptions > 3]

Recall the basic structure of hypothesis tests:

• Consider the following data on the latent heat of fusion of ice

We specify µ0 with mu=80.

vector, not a data frame) look reasonably normally distributed? Does

the mean differ significantly from zero according to a t test?

difference. The result of this comparison may be misleading. Why?

• 5.4 Perform graphical checks of the assumptions for a paired t test

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.