Week 1-12 Statistics
Week 1-12 Statistics
:*
¥
'
:*
Data :
Data that
collected
are
fact and
figure
summarised
are
analysed and
for
presentation and interpretation .
data
# Statistics is art
of learning from
Descriptive statistics :
Inference statistics :
lt is concerned with conclusion
drawing
from data .
Population :
total collection
Population is
of all elements that
interested in is called
we are
population .
Sample :
will be
The of the population that
subgroup
studied in detail is called *ample .
structured data :
These are data which are
organised
in
predefined fashion
.
Unstructured data :
these not
are data which are
cases : lt is a unit
for which data
is collected .
Variable : It
is characteristic that
varies across all units .
[ each student )
Eg
: case
Variable C name marks board , , ,
gender :)
Data
Numerical
Categorical
Qualitative variable Quantitative Variable
Describe Continuous
in
float cmeasure) in
integer counts
scale
of measurement ÷
Nominal
} categorical Data
>
.
→ Ordinal .
Interval and
} Numerical Data
→
,
→
Ratio
Nominal scale :
consist labels
when data is
characteristic
ofobservation or
name
is known
as
as nominal scale
of .
. . , ,
Weather →
comfortable uncomfortable
. or .
→
Some nominal variable can be
numerical ( like MIF
yecoaded 011 ) .
=
Ordinal scale :
Excellent
Eg good hotpoor
: or .
Cold, warm or
.
On this
diff b/w Excellent to
good
→
may
.
not
equal to
diff b/w good to
poor
.
.
Interval scale
of measurement :
alt has all the
data and in this interval
ordinal
b/w values property of
is
expressed in term of a unit
fixed
of measure
is called
than scale
internal scale
. measure
of
.
Interval data
always
> are numeric .
can
find diff b/w 2 value
.
, ,
Ratio scale :
lt has all
property of interval scale
and here ratio
of 2 scalemeasurement
then scale
are
meaningful ,
Run etc
Eg Height marks
weight
-
.
, , ,
,
Categorical data :
Frequency distribution
distribution :
Frequency of qualitative
value and their
data
is
listing of distinct
frequency .
Relative frequency the ratio of frequency :
observation .lt is
to total
of no
generally
-
used to 2 dataset
compare
.
leg
: -
AAABBCCCCD AAAAABCCCCCCDDD
B 11 2 '
2 B I I 0-06
C 1111 4 '
4 C ☒1 6 04
D 1 I •
L D 111 3 02
Total 10 Total 15
Charts
of categorical data :
We use
pie Chatto each cat data -
.
Bar
display value
of
horizontal
distinct
qualitative data
with relative
on axis
chart frequency
.
We to
bar each to other
use
category compare
.
A A
A
B
D D
B B
D C C
C
occupy
part
amountof graph should correspond
data represented (Diff bar
to
of . .
should not be
of diff width ) .
Trunk at ed
graph ( baseline of graph
→
You may
use
graph by using y-axis
.
%
90 -
A
80 -
80 _
☐
B
70 .
70 .
e
60
s←
.
50 .
(
misleading) Y-axis brake
Measure
it of
of2 centralin tendency :
is
type categorical data :
Median ,
Mode .
Mode
of of categorical column is
: Mode a
most
category
common .
largestwith
bar
largest pie
than
or .
→
If have 2
we
of there is than 2
category than it is multimodel
→
more .
Median : Median is middle observation of
sorted value 9m ordered form . .
AAA BCABAACCCDABC
Eg
→
"
"
A A A A A A A B B B C C C C D :D
Y ¥ median Codd
median ( even)
)ded
9-1 tries to divide data in 2
half .
Numerical data :
lf desecrate data is
single value data
→ .
Byabout
findingdata
frequency
→
know
.
you can
No
of people diff house 2,43 4,5 2,3 3,3
in .
→ ,
, , , 4,4 1,253,4
,
Converting in category : 2I 32 3 -
.
45
a
f- 5 4 I
5 _
5
4 _
u
3 _
3
2 _
2
I -
I
>
I 2 3 4 5
10 -
> there
organise data in number
of
understandable:
classes to make data
Each observation should
belong to
exactly
→
I class .
•
Marks is measured not counted so continues data .
30,40, 30,11 ,
27 , 34,45 48 41,93 , 21,47, 36,33g
, ,
Class interval f- Rt
10 -20 I ◦
0
20-30 3 •
05
30-40 8 •
4
40-50 8 •
4
Total -20 I
g-
7-
6-
5-
4-
3-
2-
I -
>
% So 30 Go ÉO
Stem -
and-
stem
leaf
5
Ig
→
75 = 7
75 , 78 =
7 5,8
Steam leaf
I 0 5
2 23 589
3 I 6
4 58
Descriptive measure :
Measures ft indicate
of central
tendency
> :
→
Measure of dispersion : those measures
indicate variability /Spread in data .
Measures
of central
tendency :
et
capture
> Mean
centre or
typical ness
of dataset .
→ Median
→
Mode
lt is most
commonly used
Mean ÷
lt is
measure .
→ mean
reffersas average
.
Population mean -_
µ =
×, -1×2-1×3 -1 . . .
-1in
N
Lecompte → mean
☒
471=7
2 12,5, 7,6 7,3 =
, ,
7-
2,105,5>7 6,7 ,
5--1261=21.33
% I. 2 3 45 Total { fixi
"n
% 2 3 54 I 15 Mean __ =✗-
Rift 2 6 15 16 5 44 Ifi
i= ,
4,51=2.93
mean __
Mean
for continuous data :
73¥ = 365
→ 36.5
approximation
is not mean be we
are not
midpoint
looking
to data but
. only seeing
add constant to in
→
of you
dataset than every point
old ☒ + constant
new ☒
your
= .
→
of you
dataset
multiply constant to
every
old
point in
than constant ☒ ☒
your
new = * .
outlier
Highly affected by
→
.
Median : -
a- is another
frequently
lt
used measure
divide dataset
of
central
tendency
.
in
top 50% & bottom 50%
-
= no
avg of Cnz
and -11 . n
-2
) .
Example →
2,9 ,
4, 6,7 ,
8
On odd .
On even .
" °
median 3rd median
A- 5
52+1 6=2--3 6-2+1--4
= =
=
,
is added to each
Of dataset
constant
point of
median
,
length
will be
does 'n
old change
median + c.
so new
each data
ofconstant is
multiplied to -
point
-
median
-
Than due to some
will be old
length
median ✗ c.
.
Mode :
value
it is most
frequently occurring
of dataset
occur more than
.
than
of value I
-
no
there is no mode .
is added to each
Of dataset
constant
point of
mode
,
length
will be
does in
old mode change so
+ c.
new
point
mode
Than due to some
-
will be oldmode
length ✗ c.
.
outlier
Let's
Don't
affect 2
by
data :
.
compare
D1 3. 3.3.3.3 D2 =L, 2,3,
→
4,5
→
finding DL measure
02
of central
tendency
:
above
descriptive
difference
measure
quantaténely
that indicate
,
we use
→
Range ,
→
Variances
→
standard dauiation ,
Interqiartile range
→
.
Range -9-1 is
defined
: difference as b/w
and lowest value
largest
dataset .
of a
Let's 2 data :
compare
DL 3. 3.3.3.3 D2 =L, 2,3,
→
4,5
Range =3 -3=0
Range 5-1=4
=
103--1,2 3 4,15
, , As we can see it do well in
So ,
Range is extremely enciline to outlier *
.
to measure
is consider daviatéon variability
dataset
One
way of
-
12345 Centrepoint =3 ☒ __
02 (R µ)2+(R2-NY2+ -16cm UP
- -
= , . . .
Sample R variance :
( Rn -502
5- = ( ,
-
Ñ)2+(kz-ÑJ+ .
.
.
+
n -1
(E , -
E) Ck , - IF
Eg DI Deviation from
Data
:
Squared Deviation
-
mean
.
I 68 68-59 =
9 81
2 79 79-59 =
20 400
6 70 70-59 = 11 121
7 61 61-59 = 2 4
Population variance -_
18,098--1898
Sample
189*-210.88
variance =
standard deviation .
ÑJ+
2
s = (R ,
-
B) + ( R2 -
.
.
.
+ ( Rn E) 2
-
n
-
Ds .
Sample Std .
deviation =
210.88
Of a constant is
multiplied to all data point
new 51 deviation old 51 deviation ✗ constant
= .
d- is affected outlier
by
-
.
percent
of data
of
.
To
find percentile :
-
Arrange data in
ascending order
-
.
it is odd don't
get integerthan get
'
if determine
'
we
part)
greater
=
.
But if np is
integer and ripof-11
in case even
-
.
avg of rip
than take .
data :
Example : Arranged
35,38
-
,
47 , 58
,
61, 66, 68,68, 70,79
P np
01 I (35+38)/2 =
36.5
025 25 =
47
05 5 (61+66)/2 =
63.5
0.75 75
=
68
I 10 =
79
→
Quartile :
fine number :
Minimum - summary
1st quartile ( lower )
-
02 03 ( median)
quartile
min a Max
2nd _
median
3rd ( upper )
quartile
-
Maximum
Interguartibe
→
:
The range
interguartileand
range
LIQR) is
difference b/w first third
quartile .
IQR= Qs -
Qs
Eg
-
3rd
quartile
(03-01)
,
03--68
IQR 1825 =
lt is also measure
of dispersion .
Contingency table
a- is also called
:
Eg
:
3
27 39
9 5 14
Total 38 62 100
>
Row relative frequencies : d- is
dividing
total
each row
by its row .
Gender No Yes
Eg Total
:
>
Association b/w two variable
whether
:
A- is
finding
variable
information about
one
provide information about
another variable .
of the column relative
frequency
→
row or
all than two
fornot
are same rows
variable are associated with each
other .
60 ?
no yes
100 -
no yes
40 _
75 .
50 _
20 _
25 _
38.0%62-0%100
from income
So ,
Total
High 52 % 290%2006
Medium 71.0% 62.94 66.06
low 23-6 % 81 % 14.08
Total 38 62 100
that
Scatter plot a- is
graph displays
: a
pair of values as
points on 2- ☐ plane .
7-
Age Height
I 75
←
100 - '
'
'
2 85
.
* .
60
3 94
_
↳ 101 20 -
5 1 08 ' '
%
"
¥
'
%
•
e 2 y s
>
Direction → Does pattern Up Down
,
or both .
>
,
→ Cavaliere > bit t.in ear or curve .
'
'
:
ii.÷:: : ::
:
: :
::
> Outliers > Outside pattern (exceptions) .
Measure of association :
Covariance
-
Correlation
( Yi 5) -
Age Height
I 75
Deviation
-2
✗ Deviation
-17.6
( a-
35.2
ly , -5)
7- 6
2 85
-
I -7.6
3 94 0 1-4 0 I
Age Height
I 6
Deviation
-2
✗ Deviation 's
2
( a- Cy , -5)
-
y
2 5 -1 I -1
3 4 O O O 2
↳ 3 I -
I -
I
5 2
2 -2 -4
☒ =3 5=4 4=40
2
' ' mean
Cov > 0 Corso \
Y Y
° ki E)70
Lying > ^
-
'
-
( Ri ÑKO
-
cyi.gl> 0
DO _
6- .
'
100 _ .
5- .
.
'
80 _
4 _
.
.
I -
60 3
_
_
.
( Rix )
40 Yi F) <
-° 2- .
(y g)< o
-
,
-
20 _
( Rix ) >0 I _
.
¥
" • " '
¥
'
%
' •
%
•
I 2 3 y 5 I 2 3 y 5
associated with
when
large ✗
large 's and
>
is
vice-versa the deviation
sign will be
associated with small Y and
same
When
large ✗
>
is
vice-versa the deviation
sign will different . .
Population covariance
n
:
Cr Illy g)
-
-
Cover g) ,
=
i=1
N
sample covariance
n
:
Cr Illy g)
-
-
Cover g) ,
=
i=1
n -1
1
Population covariance
-8,1=164
- 2
Pop -
Cov .
=
=
-2
Sample covariance
-841=205
-
Sam .cov=
-41=-2-5
Correlation A- is more
interpreted easily
:
d- is derived
from covariance
→ .
( r F) ( ,
-
, -5)
> Correlation =
it
=
COV ( R , Y)
"
( Ri -
Ei
n
( y ,→, Sxsy
E- I E- 1
Hi -732 , -55
Age Height
I 75
sq.Dev.ofxsq.Dev.ofxlri-xxy.FI
-
E- 4 -17.6=30976 35.2
2
" I -7.5=5776 76
85
-
=
= 0
3 94 0
1-42 1.96 :
4 101 [ = '
8.4%7056 8.4
5 108 Ñ= 4 15.4=23716 30.8
☒ =3 5=926 Elo E- 6772 5=82
Correlation __
Cov Cri) =82_ Or 2 0.9964
Sxsy THE 1.58×13-07
Hi -732 -55
Age Height
1 6
-
sq.Dev.ofxsq.Dev.ofxcri-xxy.FI
I y 22 = 4
y
= -
2 5 -5 =
I 5 = I -1
3 4 =
0 = 0 0
↳ 3 i =
i -
i = I -
I
5 2 22 =
& -22--4 -4
☒ =3 5=4 2=10 EIO E. = -10
correlation __
Cover,Y) =
-10 or -2.5 =
-
&
Sxsy Foxx 1-58×158
2
mean
Corr Correo
' '
>0 1
Y Y
° ki ñ )> 0
Lying) > ^
-
'
-
( Ri ñK0
-
cyi.gl> 0
DO _
6- .
'
100 _ .
5- .
.
-
80 _
4 _
.
,
I -
60 3
_
_
.
( Rix )
40 Yi -81<0 2- .
(y g)< o
-
,
-
20 _
( Rix ) >0 I _
.
¥
" " ' '
¥
'
%
' •
%
•
I 2 3 y 5 I 2 3 y 5
through line .
Point Bi serial
-
correlation coefficient :
.
,
value numerical
→
compute
column
mean
of
in respect to encoded cat column .
.
male/Female student
egPo
→ mean
of marks of .
→
and Ps is proportion of group Eg . :
-
Jo -51
Tpb =
Pops
Sx
PERMUTATION & COMBINATION
Adding rule of counting :
Eg
: There are 5 paints and we
of
possiblity to choose will be 5+4 10
ways
=
.
ways
.
Ni + Nat + Nr r'actions
possibility altogether for
'
. . .
.
of
possiblity to choose will be 5×4--20 ways
.
.
numbers -10 -
ways
.
ways
=
26×26×10 ✗ 10X 10 =
6760000 .
not allowed
If reputation : 2625-10987
=
26×25×10×9 ✗ 8×7--3276000
Eg → There alhokte who take part in 100m race
What are possible
are
ways
.
→ Position I 2 3 4 5 6 7 8
8 7 6 5 4 32 4
ways 8×7×6×5 ✗ 4×3×2×4=40.320
No
of ways
=
number )
n ! . n ! =
is
n ✗
called
(n -
1) ×
factorial
.1 . . ☒
denoted
by
Note : 0 ! =L
So i≤
9h
general for have ,
n
, we
,
! n ( = n ✗ n -
1) • a. ✗ ( n it 1) ✗ ( n
-
-
i) !
Eg :
6g,÷ =
6×5×4113×24
3×2×5
=
6×5×4--120 Or 6×5×4×34
34
6! ✗ 51
=
6×5×4×3 ! ✗
5×4 ! =
6×5×4×5=600
31×4 ! 3! 4!
Eg :
25×241123×22×21 ✗ . . # 25 !
=
=
22×21 ✗ . . . ✗ I 22 !
or some n' .
by formula
:
N ✗ (n - 1) ✗ . .
.
✗ (n - 81-1 )
& is denoted "
by Pr
MPs = N ✗ (n -
1) ✗ ( n 2) ✗ -
. .
✗ ( n -81-1 )
^
Pr = n ✗ (n -
1) ✗ ( n 2)✗
-
. . . ✗ In -8-1 1) ✗ ( n 8)( n
-
-
8- 1)* . .
✗ I
( n 8)( n
-
-
8- 1)× . .
✗ I
npr =
n !
a- 8) !
Special cases :
1.) npo = n ! ni I
n;-
= =
(n -031 .
2.) MPs = n !
= n
( n -
y!
3.) npn = I n ' n! where 0 ! = 1 .
G-
=
(n ng ! -
Eg : No
of ways of 8 person committee can be choosers
for
Chairman & vice chairman without repititon .
n -8 8! 56
ways
-
=
.
22 (8-2) !
No of 4
digit no can be formed using digits
1,2 , 3, 4,5 if no
digit is repeated .
>
=
n -5 -
=
5! =
120
8--4 1 !
How no can be
formed ? Same
many
even .
- _
2-
- -
-
-4 fixed
Sog N = 4
g
8--3
N = 10 ,
8--6
'°
Po =
151200
3W 3W =
3×3--9 32--9
ways
The possible permutation of
of objects
' '
no
.
r
denoted
by no
.
how ?
DATA times it
many
can be
rearranged
DATA
=
DATA DANA 198 ,
, we consider A different )
AiTAzDAaTA.D (
ATAD
similarly )
4! if total ways to
arrange
4 letter wordsin .
'
/2 !
arranged
'
DATA be in 4 ! 12
ways
=
can .
are
of I kind and rest distinct is equal to n !
P!
No of permutation
-
of
'
n'
object where ps is one kind &
Pa is diff .
17 ! Da ! . . .
Pn !
For statistics
' '
n =
to p=5 =
10 ! =
50 ↳ ◦ ° >
s
=
3 3! ✗3 ! ✗ 11×2! ✗ 1 !
it =3
a
=
I
i = 2
C =
I
circular
arrangement
:
A C
rotated
D B = B D Both are same
just .
C
A
A A
D B ≠ B D
C C
Mirror
images are
different .
anticlockwise rotation
9f
than
clockwise
will
or
be a- 1) !
are some
it
2
anticlockwise rotation different
9f clockwise or are
point will be Cn 1) !
means
if we
fix one
-
A C
D B ≠ B D
C
A
Ex Solve
for
. n .
"
Pu =
20hPa =
A- =
20 ✗ MI
En -41 ! In -27 !
( n 2) -
✗ ( n 3) -
=
20 do , D= -2 or
n=7✓
,
1%-3
T.hn?En:?; ---3-Bnn--y
=
§
= → 2h
= 20 B n -
-
to
(5-8) !
2×61
↳
¥÷y .
=
(7-8) ! :
-
(7-8) ! =
G- 8) ( 6- 8) (5-8) !
É¥=¥→%?¥→ → (7-8116-8)=12
82-130+30--0 0--3008--10
Combination:
In combination ab=ba
is
The electing possibleofation daterobject
number comber
by "C and is
given by: wer=u!)!
R!
In other word,
selecting object from 'n' objects
is same as
rejecting new objects from' objects
"In and "Co:1 for all values
1
of
=
n
Crest "Cr;
4-1
"Cr =
KUKR
Examples en with
paper is divided part
exam a in a
A student need to attend
↳95
question.
8 ques & min. of I have to done in both.
* TC3s+'cy"CutC, 3Cs=35+1757218-420
ways.
Total of ways to choose card 52 cards?
52Cn
no
526
from
270725
=> =
↳! 48!
All" card of same suits?
48
x32n=uxted:28
Select 11
there and
players from
bowler
17
player where
must be their
5 bowler
?
are
4
Ways
•
5cg ✗ 12
Cy 5!
=
×
12 !
=
5×950402=3960
4! 1 ! 7!5 ! 120
ways
In points of
that
general given
be drawn
in
, ,
number
point
line
segment
can
connecting is ncz .
→
Permutation is used when Onder matters .
→
Combination is used when order don't matter .
Eg
:
Perm : No
of ways 8 athlete can come 1st 2nd 43rd
.
athlete
Comb : of ways qualify
No 3
from 8 -
Similarly going
lines
Direction matters
to dot if :
Permutation - .
Probability :
an observation or outcome .
Random experiment : lt is experiment whose
an
Event : it is collection
of basic outcomes . 9T is subset of
*
ample space We event occurred if event is
say
.
contained in subset
Eg : Die roll ( 1.3.63 coin toss ( HH)
.
.
Intersection event
of event ( n ) : For 2 or more we
Venn
Disjoint
diagram event
ACB
A
☒
Subset
Null event and disjoint event :
Event without outcomes are called null event
any
.
( Ol )
Symbol
If intersection of A and B is null event than
A & B can't occur
together do A and B is
( An 13=0 )
disjoint
00
mutually exclusive event .
compliment of event an :
all
'
A compliment of A is denoted
by A means
outcome of sample spaces that are not in A.
Eg The
Coin toss
: 5- { Hit HT.TT TH} A {HT} A ÷ { HH.TT TH }
=
.
, , ,
A is contained B A is subset B- A CB
say by or
of
Eg 2 coin toss S { HH HT, TT , TH }
' =
.
,
A Ist = head =
{ HH , HT} ,
Both head =
{ HM } B =
we can BCA
say
ProbabilityClassical :
every
-
ñ
Even dice roll
Eg
: in -
no
6=(1,2/3,415,6)
3=12,4 b)
My 62 1-2
m = n = =
, = =
Relative 9T calculated
frequency is
by repeating
:
of experiment .
PC E) =
Giz %Eˢ
Eg 9h 1000 toss
you get
head 468 times
:
so
statement This
guess by person making
.
individual's
probability is measured by
degree of belief in event .
There tomorrow
Eg % chance is 70
you
will win
: .
Probability
For
Axioms
event E. the probability of E
:
is number
any
between 0 and I That is 0 ≤ PCE ) ≤ I .
, .
The
probability of sample space is 1. ( the outcome
with PCE ]=D
of exp
.
will be element
of sample spaces
The
probability of the union of disjoint event is equal
to sum of
prob of these events PCE UEa) =P(E) + RED . .
,
;)
•
P JE =
Pl Ei )
i I -
- E- 1
E,
3g Es
%
=
=
PIE U ED
} +3g 8- I
=
, =
=
General property of probability :
E- and E ' is
disjoint EUEC -5
-
PCS)=I PCE ) ,
-1 PCE )= I
'
PCE ] =L PCE)
'
-
P( 01--0
Probablity
5=0
-
of null event
et also work
-
disjoint
PCE.VE ) =P (E.) PIED . + -
P( Eines)
a •oB
E. UEz= IUIIUIII
I II III E ,
= IUII
Ez = IIUIII
Et Ez
• Mi Ein Ez = TI
①
PCE ) ,
PLIUII )
=
> PCI )+P(II) -
①
P( Ez) =P # VIII ) > PCI)tP(TI) -
④
PIE,nE2 ) =P ( TI ) > PHI ) -
From 11,1114W
get
1 we :
,
P( Eines)
Eg→ both
Customer probablity to
buy shirt 3 apart neither -2 •
1. what is •
is probability of customer to buy .
5=-3 ,
P= •
2 ,
Snp -_ •
1 then hither will be P (SUPT
:-P ( sup)= Pls] -1 PCP] PCPAS ) -
= .
3-1.2 -
•
I = •
4
thence PCSUP ) ? 1- 4
,
6 ◦
= •
Own smartphone
Gender No Yes Rowt .
Female 10 32 44
Male 14 42 56
Column 1- 24 76 100
Female •
10 •
32 •
44
Male ,y 42 56 Probof being male
•
. •
.
column 1- 76
owning smartphone
•
24 • 100 ☆ .
Joint
Probablityigt
intersection
represent probablity
events
of
2 of or more
.
Pl PCFO )tP(FO )
'
Lg : female) 10-1.34=-44
•
=
=
Planning Phone )= •
34-1.42=70
Conditional
probability
:
>
Among femalenotbuyers what chance
of owningmale
phone
' '
=
is
a row or column .
of contingency
Gender No Yes Total
Male
Female
10kt
14156
34/44
42/56
44
56
By Row
Total 24/10076/100 100
PCEIF ) d- is conditional
probability of E-
given
-
f- has occurred .
The of that
probability event E occurs
given
event
given by
:
f- occurs .is
PCE / F) = PCENF ) ; P( F) O E F
P( f)
5
Multiplication rule : conditional
probability formula
:
P (f) .
PCE I F) =
P ( EAF ) . P (f)
P (F)
•
: PCE n F) = P (f) •
PCE / F)
9T states that prob of .
both E &
Foaming together
is
equal to probability that F occurs multiplied by
cond prob of E F occurs
given
.
. .
lt is
useful to compute proto of intersection of event
Eg :
Cond I
Of 40
-
student male 17
female
23
list st female & 2nd male )
-
.
P ( f, n Ms) =
PCF ) ,
•
P ( Mal F.)
% 233-9 251
= •
• =
event
arbitrary no
of is robbers as :
PCE MEN . . . A En ) =
PLEA P( Eat E) PIE 31 Ein Ez ) . . . P (Ent Ein Ez . .
.
n En -
s )
Eg : A deck of
the
52 card .
divided in 4 pile randomly .
What
Ace of
is
probability of following events .
Er :
spade is in
any of pile 4 .
PLE ) ,
= I
=
3¥
E3 : AS AH AD . , in diff .
pile .
Pi and AM in P2 so to
we put As in
fillSorest
12-12=24 card A & As we have 50 cards
of no
'
. ,
A & Pa complement
of ways AD can be in is
PLEW EA Ez)= I
of -7¥ =
3¥
Pl Eal Ein Ean Es )
got by fixing AS AH
we
Ey : =
12 (
Firing 3 card in and3 deeds
49
choosing from 36149 complement
Pl Ein Ean Es MEU)=
3¥ 3¥ ¥9
:
Eg .
.
≈
0105
Eg
Since :
.
product of individual
probability -
§
( 1 1) ( 1 2) ( 1 3) ( 1 4) ( 1 5) ( 1 6)
, , , , , , , , , , , ,
( 2 1) ( 2 2) ( 2,3 ) ( 2 4) ( 2 5) ( 2 6)
, , , , , , , , , , ,
( 3 1) ( 3 2) ( 3,3 ) ( 3 4) ( 3 5) ( 3 6)
, , , , , , , , , , ,
g =
,
,, ,
( 4 , , ( 4,3 , ( g µ , ( g g, , g g ,
, , , , , , , , ,
(5 ,
1) ,
( 5 2) ( 5,3 ) ( 5 4) ( 5 5) ( 5 6)
, , , , , , , , ,
g, , , ,
, ,,, , ,
, ,
, ,, , , ,
, , ,
, , , ,, ,
,,
Define foll
Ed
.
:
events :
First outcome is 3
6136 =
1/6
Ez : Sum
of outcome is 8 .
5136
E3 : sum outcome
of
6136 =
116
is 7 .
AreEin
event
Ea
E. & Ea independent ? No
→ is 1st outcome 3 & sum of outcome is 8 .
PCE NED ≠
,
PC E.) ✗ PIEa) , so E. & Es are not independent .
independent .
'
Independence of E and F
E and F independent does E and F?
If are so
'
En F
E En F F
En F & EUF is
disjoint mutually& exclusive
'
.
'
So , P (E) PCEN F) +
=
PCEAF )
PCE ) =P(E) ✗ PlF) + PLEA F)
PCE ) P (E) xp (F)
-
PCENF )
' =
'
PCENFY =P (E) Ptfe) so , Elif are independent .
Independent of 3 events :
Pl Ena ) PC E) ✗ PC 6)
=
Pl Fn G) PC F) ✗ PCG)
=
9f all these 4
happens than it is independent event .
Law of total
probability
: lot E and F be events In order
to outcome to be E it must be in both E& For bein E and
.
not in F.
'
Ehf
E En F F E = ( En F) UCEMF c)
Formula & interpretation :
PCE ) =
PCE NF ) + PCEAFC)
=P (f) PCEIF ) + PCF c) PCE IF c)
> E
Fi FI F3 Fk -
E =
( En FDU ( En Fa) UCENF}) U ( ENE )
P (E) PLEA f.) + p( Enfz) -1 P( Enfz ) + P ( En Fk )
=
Than event E
for any
-
P (E) =
PCE / f.) Plf ) -1 PIE / f) PCE ) + PCE If3) P (B) +
•
, • . . .
+ PCE/ Fr ) PCFK)
"
p( Ei ) =
PIE / Fi )P( Fi )
E- I
prob of .
event F conditioned on E- we know
PCF / E) =
PC Fn E)
P (E)
From def:
PCFIE) =
PIFTE) -
PCEIF) PCF)
PCES ↑CF) PLEIF) + DCFY) PLEIF4
mutually for
event E.
any A CFIE)= PCEIF) PCF)
K
↑ (EIFi) P(Fi)
=I
Sample space: A
distribution
random variable is numerical
of outcome of statistical experiment.
S S
CL, 1), (1,2), (1,3), (1,4), (1,5), (1,6),
( 2 1), (2,2), (2,3), (2, 4), (2, 5), (2,6),
33;·13.3,23,)3C,Cs
1 ), ), 2), 33), ( ·U), (, 5), 2,6),
S -
(
3 3 3 is3 33.6
3
13.),2),)s3, he
of how
many
outcome will have smaller of the
outcome is 3?
( 1,1 ) 2 1
( 1,3 ) 4 1 ( 1,5) 6 1
( 2,1 ) 3 1 ( 2,3 ) 5 2 ( 2,5) 7
2
( 4,1 ) 5 1
( 4,3 ) 7 3 ( 4,5) 9 4
(5,1 ) 6 1 ( 5,3 ) 8 3 ( 5,5) 10 5
( 6,1 ) 7 1
( 6,3 ) 9 3 ( 6,5) 11 5
( 1,2 ) 3 1
5 1 ( 1,6 ) 7 1
( 1,4 )
( 2,2 ) 4 2 ( 2,4 ) 6 2 ( 2,6 ) 8 2
( 4,2 ) 6 2
( 4,4 ) 8 4 ( 4,6 ) 10 4
Relevant
✗ Value
{(
event
Probablitylk )
2 1,1 ) } V36
3 { ( 12112,1 ) } 2136
4 { (1,31122%13,1) } 3136
'
h O i r
G r - -
10 {14,6%(5,5%16,4)} 3136
11 { (5,6%6,5) } 2136
12 { ( 6,6 ) } 1136
=9 9136
3
- .
.
=7 7/36
↳
. . -
=5 5136
5 { (5,5%15,6%6,5) } =3 3136
6 { ( 6,6 ) } = I 1136
Answer
Eg :3 coin
S={
are tossed .
HHH.HHT.HTH.HTT.THH.TN/-eTTHsTTT}
are head ?
Of these 3. tosses How
many
"
" "
"
which result in head first time inpos ?
.
→ bet ✗ =
hoof head ,
✗ take value 0.1.2.3
Value
of ✗ Relevant event Probablitycx)
0 { (TTT) } 118
I {( HTT),(THT) > ( TTM)} 318
2 { ( HHT) , ( HTM) ,( TTH)} 318
3 { IHHH) }
'
13
Value
of ✗ Relevant event Probablitycx)
I {( HHH ),( HHTI.LHTHJ.CH/-TD 418
2 {CTHH)g( THT )} 218
3 {LTTH ) } 118
Nil { ( TTT)} 48
Eg: life insurance
agent has
death.
a
elderly client who are
paid th
upon
Next year
mayhappen
Conditions t
Money
Both so *
IB 2C
Only younger die A AeBa I (
Only
Both
die i
elder
survive
A? B
B
IC
8
Evalues
Assume A&B are
independent event with
probabilities PCA) == 05 & PIB =.10
Relevent event
value of X
"BC
* PCAY X PCBY 95 x.9 0855
= =
=
Probablity
In descrete
mass
function:
random variable. We take countable
value
no
of possible
So let X discrete random variable with
be a
is possible value. Then it can lebled as M., 232,.. n.
por descrete random variable 1, we define
probablity mass
for CPMF) Pins of x by
↑(Ri)=DCX= i)
Re
1x=Ui)
TE, ...
En
, then
1) (vi)-O, i 1.2, = ...
PCR) = 1
== 1
S S
CL, 1), (1,2), (1,3), (1,4), (1,5), (1,6),
( 2 1), (2,2), (2,3), (2, 4), (2, 5), (2,6),
33;·13.3,23,),3C,Cs
1 ), ), 2), 33), ( ·U), (, 5), 2,6),
S=
(
3 3 3 33 33?;6
3
13.),2),)s3, he
(x=1)
↑
if ff f gs t
verify P(xi)=8=I
To
I 1
=
ply:yis t 85
Sum:1 so, it is PMF.
Graph of probability mass function:
helpful to
It illustrate
is mass
for in a
graphical
0x-xi) the
format by plotting on y-axis
against
xi theon x-axis.
of previous example:
·
Comulative distribution
function:
Flal=P(x1a).
It can be
expressed by
possible
who
9fy a variable
descreterandom
is
rests... at
distribution for For X is a step function.
It will be step to. So for an
example.
Eg
·
We want to
analyse no
of credit card owned
people.
byRandom experiment:
Random variable: Selecting
No of
adults at random.
cc owned
by person
Probablty
Eg
: consider
-
Even
a
game of rolling dice
outcome You loose amt
- .
.
ondice.
you
hundred
winning always
will after many in ve
-
rolls so won't
we
play outcome is Ho
.
Relative each
frequency of
.
will be
So ,
gain If -2 f- +3£ - ↳
f- -1-501--6f- =
-0.5
oeffers as expectation of ✗
given by
:
E- (X ) = ✗i P (✗ =
✗i )
i =L
considered obs
9T is as
long run
avg of
ind
.
. . .
random variable:
Expectation of a
fun of a
expected value
giveg(x) of =
gxPCX=xi)
if a and
are constayEx) at +6
Eg:
Expectation of sum
of a random variable:
->
expected value of sum of
random variable is
same individual random values.
as sum
of
E(x+1)= ECX) + ECY)
Eg:
expectation of sum
of
and Var
many
expected value of sum of random variable is
same individual random values.
as sum
Let, X., Xe,...,
of
XK be K descrete rand. Var.. Then,
Exi =
Ei
Example: Tossing a coin three times
Toss a coin i times.
Let Xi be a random variable which equals 1 if the outcome is
a head, 0 otherwise.
E(Xi)=0.5
X1 +X2 +...+Xn is the total number of heads in n tosses of the coin.
E(X1 +X2 +...+Xn)=XE(Xi)=0.5×n
For n=3,X1+X2+X3 is equal to the number of heads in three tosses of a coin.
E(X1 +X2 +X3) = 3×0.5 = 1.5
This is the same expectation of number of heads in three tosses of a coin.
PMF:
Variance of
let x be a
a
fan of random
random variable,
variable:
let a be a constant,
than, Var (CX) c"Var(X)
=
Proof:
Ecaxtb)=aM+b,
VarCax-b) Eaxtb-aN-b=ECarx-4/2)
=
=
aLE (X-MP=a " Var (x)
Variance
of of
a random variable:
sum
Than, Var(x+Y) =
Var(X) + Var(X)
ig*"="
#
var Var(X i)
c f
=
standard deviation
To calculate standard daviation
of random variable:
of random var
we use $P(X) =
Varles (the sproot of variance (
Properties of std.dav:
SDCCX) =
23D(x)
5D(x+C)=5D(X)
+C
* E(x) +9 vex) SD(X)
Examples:
Bernoullitial: A trial or an experiment whose
either
outcomes can be
classified as success or
Burnoulli
fail ure is trial
3:5 success, failures
which
let x be a rand. Var takes I as success
is:
L
x=xi)
O
* 1 -
P If
->
Expected
EX)=
value of berous and variable:
0x11-D) +PXP: P
->
Variance of bernoulli rand. Var
vCx P-p2=PC/-) =
success&
failure equally likely.
are
->
success' 'p'
'failure' with probablity 'l-p'
↑
as or
-
let X is total no
of success that vitrials,
occurs in
that X is binomial rand. Var with parameter n&P.
is
n = no of ind.trial (Heren fixed.)
P: Prob of success.
Binomial rand Var.:
X is binomial rand. Var with parameter n&P
that represent no
of successin ind. Demouli bials
when each trial is a success with prob.p.x take
within
value 0.1.
j the probableypini
Example A
ofdistribution:
binomial
Examples:
Expectation and variance
of binomial random variable
Binomial rand. Var is no
of success in 'n'indeptrial
when each trial is a success with prob. P.
Written as X =
Xt yet... +
Xn
a, var. of sum
of ind.var is sum
of their variance:
Ver (x) =
npC1-P3
&| ⑨#
⑧| ↑
·
Hypergermetricdistribution
the
copulation
=>
must
bedded into one subsetoneis
and
without
Sampling replacement.
->
another
'm' items are a
type?'N-m are
type
N 2
- ~ -
~
2 N-m X 21-X
let x be no
of items of type 1, then PMt of
descrete rand Var X is called
hypergeometric dist
m N-m
P (x x)= 21 X
-
= X x =
0, 1, . . .
4
,
i
N ix 14sxkm, n-XXN-m
H
45. Ib X= 2
Best
↳ W, 2b
bits ↳O
W
YW, 3 b 30,2b X= (
W B
3
3w, 3b 3/b
F
L
3)
G
2 w, 3b X =
8
Variance:
is
let x
objects are
a
hypergeometric
follow selected distribution in which
from N objects with in of object
in one
type aN-m objects
of being
second
type.
Vas(XS
M Nm
n
=
Nn
N-d
Poisson distribution: It probab of of events
give no
occuring in
fixed of
interval time or
space.
X denote no
of times an event occurs.
age:
Probability mass
for of This distribution is
poisson:
is poison descrete
no of
per interval,
X events
ang
rand, var. Xv Poisson (x)
P(x=x) 2*x*, X: 0,1,2,
= ...
x
-
Rand Val X represent no. of interval per time interval
->e is mathematical constant 2. 718.
expectation of possonone variancetion of pain
on
continous random variable: It is the one that has
possible values that
form an interval
along real line.
It assume value in interval
can
any an
with
it.
Probability dist
of a continuous and
curve var. Is also
called probat density fr. denoted by FIM).
Area under PBF:
Consider ept.4's, where is less than 'b'
any
that assumes value that lies
The
prob a
blu
ab is
equal to area under curve
blew amb.
P(xE (a,b)) =P19xxxb) is area under curve byw a 9b.
=(u)
PlaxX1b) =
IEndn
Blue area b/w0&L.
a b
is 1.
Cumulative dist.
for continous
In:
hand. Var X
F1al=P1x19) =
fLecxdx
Since, prob of cant rand Var X assume value
single
is always were so it will be same as above.
Expectation a variance:
Expected E=xf(xdx
value:
Variance:
(x-E)f(x)dx
Uniform distribution: A continous rand. Var has
a
uniform distribution, denoted XvU(a,b),
probab-density for is: 1/ (b -a) a <cb
Fus
fews S o otherwise
-
soi
a = 2
b =
4
b-a = 2
hence,
t =
=
0.5
given by o otherwise
Cumulative dist
of Uniform dist:for (a)
Eas I
for class
F
or (22 2b)
rand.
Expectation & variance
I of uniform Var:
Expectation of (Xrvca,b);
E(X)= a+ B
*
erive:
ECX=/*xSCX, detoxdx tatr6=ba
Variance of X WU (a,b);
Var(x) =
(b-a) 2
12
CEXIT =
b-taltab (talba
D)
erives
Var(x) =
Gx2) -
t
Examplet
Answer
Non-uniform distribution: These are dust, which
not at all interval.
are same
point of time in a
exponential distribution: A
&
rand var whose pdf is
geven as
5cns<*e** xC,0
O otherwise W
is said to be
exp. distribution
L
with
parameter x.
caf of exp. fn:
fas=P(x(9)
=
Jaxe dxa
=- ex o
= 1-ea
ECXY:
fe
VaRCE)= z
XR
Example: