Probability Sam
Probability Sam
Probability Sam
HH
ALL POSSIBLE HT THERE ARE 36
OUTCOMES TH POSSIBLE OUTCOMES
TT
0811253224
THE RULES OF PROBABILITY
P ( H ) P (B/H)
P ( H/B ) = ----------------------------------------------------------
P ( T) P ( B/T ) + P (D). P (B/D)+ P (H ) P(B/H )
EXAMPLE 1:
A car repair firm employs three paint sprayers, Tomy, Dicky and
Harry. Tomy is responsible for painting 25 % of all the cars
produced, Dicky for 35 % and Harry for the remaining 40 %. On
the basis of frequent quality inspections it is discovered that, on
average, 5 % of the car sprayed by Tomy fall below the minimum
standard, while for Dicky is 8 % and for Harry 10 %. If a car is
selected at random is judged to be sub-standard. What is the
probability that it was sprayed by Harry? P ( H/B ) =
b. If a car selected at random is judged to be standard, What is
the probability that it was sprayed by Tomy or Dicky?
Example:2
n x n–x
C π q
x
Example:
The four engines of a commercial aircraft are design so that they each operate independently. Test,
carried out over a long period of time, show that there is a one-in-a hundred chance of in-flight failure of
a single engine.
What is the probability that on a given flight:
a. No failure occur?
b. No more than two failure occur?
c. At least two failure occur.
d. At most 2 of the engines run well.
E. Less than two of the engines run well.
f. Two of the engines run well.
BINOMIAL DISTRIBUTION
PROBABILITY OF A STUDENT WILL PASS IN THE
FINAL STATISTICS EXAM = 0.80. IF 6 STUDENTS
ARE SELECTED AT RANDOM, WHAT IS THE
PROBABILITY THAT:
0811253224
Example
Probability distribution of a student will get
score A = 30%, B = 45%, C = 15% , or D =
10%. If 7 students were selected at random,
what is the probability that:
A. More than 4 students will get score A or C.
B. at least 5 students will get D.
C. less than 3 students will get B or C.
D. all of them will get A or B or C.
Hasil sdr up-load ke akademik MM
Example : 2
Suppose that a family has 5 children. What is the
probability of having:
a. At least 3 of them are male.
b. All of them are female.
c. At most there is one male.
d. the number of female is more than 4.
Big-Quiz ( 15-point dari 100 )
1. Income ($/minggu) Jumlah RT
100-149 3
150-199 6
200-249 8
250-299 6
300-349 3
a. Hitunglah: Mean, Median dan Modus dan koefisien variasi.
b. Berapa besar income minimum bagi yang tergolong 5% tertinggi.
c. Berapa Income maksimum bagi yang tergolong 10% terendah.
d. Berapa income maksimum yang tergolong 15% diatas rata-rata.
Big-Quiz
3. Two dice is thrown 6 times, what is the
probability of obtaining:
n Total number of both dice is 9 appears
more than 4 times.
n Total number of both dice is 6 appears at
most once.
n Total number of both dice is 8 appears at
least 5 times.
n Total number of both dice is 12 appears 6
times.
Big-Quiz
2. Th Sales ( Unit ) Harga ( $ )
2015 100 25
2016 125 23
2017 150 22
2018 175 20
A. Buatlah model regresi dan jelaskan
2. Zi ~ N ( 0, 1 )
Xi - µ
Zi = --------------- =
σ
Zi MEASURES THE DISTANCE OF ANY
PARTICULAR VALUE OF X FROM THE µ,
MEASURED IN UNITS OF THE STANDARD
DEVIATION.
THE SIGN OF Z SCORE MIGHT BE POSITIVE,
NEGATIVE OR ZERO. IT DEPENDS ON THE
DIFFERENCE BETWEEN Xi VALUE AND ITS
MEAN ( µ ).
FINDING AREAS UNDER THE NORMAL CURVE
Zi-value PROBABILITY( AREAS )
1.65 0.4505
1.96 0.4750
1.28 0.3997
0.50 0.1915
1.00 0.3413
Starting point of z value is always from where Zi = 0 up
to the point of another zi value ( the difference between
mean and a selected value of Xi ).
Example:1 ( Zi value: at the end of this book )
The scores of 250 students in mathematics class follow the
normal distribution, with a mean of 69 and standard
deviation of 10. What is the probability or areas under this
normal curve for the scores :
a. Between 60 and 65.
b. ≥ 80
c. Between 64 and 80.
d. If the minimum passing grade is 55. how many students fail
in this exam.
e. What is the minimum scores for 5 % the best.
f. What is the maximum scores for 37.5% above the average.
g. What is the max score for 10% the most stupid students in
class.
Example: 2
The minimum salary for 4% the top of the employees is Rp
35 millions. On the other hand, the maximum salary for 10%
the lowest of the employees is Rp 24 millions. If assumed that
the salaries of the employees is normally distributed.
Calculate the probability of the employees earn:
a.More than or equal to Rp 26 millions.
4. PROBABILITY ( 5 QUESTIONS ):
1. INDEPENDENT TO CONDITIONAL PROB
2. BAYES THEOREM
3. BINOMIAL DISTRIBUTION
4. NORMAL DISTRIBUTION
5. BINOMIAL TO NORMAL
MID EXAM QUESTIONS
BINOMIAL DISTRIBUTION ( 2 QUEST)
NORMAL DISTRIBUTION ( 2 QUEST )
Non-Random sampling
Convenience sampling
Judgment sampling
Snowball sampling
HOW CAN SAMPLES BE USED TO MAKE ESTIMATES?
A SAMPLE IS A TOOL TO INFER SOMETHING ABOUT
POPULATION OR STATISTICS CAN BE USED TO FIND
SOMETHING ABOUT A CHARACTERISTIC OF
POPULATION OR A PARAMETER.
X - tv ( sd/√n) ≤ u ≤ X + tv ( sd/√n )
Level of Confidence
Required level value of α value of Z table
of confidence
90 % 10 % ± 1.65
95 % 5% ± 1.96
99 % 1% ± 2.58
Example : The prices at which certain type of instant coffee was
being sold on a given day were collected from a random
sample of 45 shops around the country. The mean price was $
1.95 with a standard deviation of $ 0.27. Compute a 80 %
confidence interval for the population mean.
3. A CONFIDENCE INTERVAL FOR A
PROPORTION OF POPULATION ( π )
A PROPORTION OF POPULATION ( π ) CAN BE
ESTIMATED BY THE FORMULA:
P = X/n
P = THE FRACTION OR PERCENT INDICATING THE PART
OF THE SAMPLE HAVING A PARTICULAR TRAIT OF
INTEREST.
n = SAMPLE SIZE, X IS A PART OF SAMPLE SIZE WHICH
HAVE A PARTICULAR TRAIT OF INTEREST.
CONFIDENCE INTERVAL FOR POPULATION PROPORTION
(π)
P ± Zv √ P( 1- P ) / n page:300 no:18
Application
The distribution of household that favor a certain bath soap in
West Java Province (use a 90% confidence level).
Brand Number of households
Palmolive 1500
Lux 1800
Zest 1500
Beauty 1000
Minty 1200
Other brands 2000
A. Calculate confidence interval for the proportion of households
that favor LUX bath soap for their families.
B. What is the sample size should be selected if the maximum
error not more than 0.5%.
2. Confidence interval for the population mean ( µ ) when the
sample size ( n < 30 ) t–table ( p:722 )
( X1 – X2 ) ± Zv σx
σx = pooled standard deviation.
X1 = the sample mean of X1
X2 = the sample mean of X2
n1 = the sample size of X1
n2 = the sample size of x2
Example
A study was made to estimate the difference in salaries
of college professors in the private and state colleges
of Virginia. A random sample of 100 professor in the
private colleges showed an average of $ 15.000 per
month with a standard deviation of $ 1200. A random
sample of 200 professors in state colleges showed an
average salary of $ 16.000 with a standard deviation
of $ 1400. Find a 90% confidence interval for the
difference between the average salaries of professors
teaching in state and private colleges in Virginia.
5. Estimating the difference between two population
means ( µ1- µ2 ), n< 30
( X1 - X2) ± tv SP
A taxi company is trying to decide whether to purchase brand A or brand B tires for its
fleets of taxis. To estimate the difference in the two brands, an experiment is conducted
using 12 tires of brand A and 14 tires of brand B. The results are:
Brand A Brand B
Average wear-out 22.500 miles 23.600 miles
Std deviation 100 miles 50 miles
Compute 95% confidence interval for the difference between the average quality for the
two brands.
EXAMPLE: POPULATION PROPORTION
2
∑ XY = a ∑ X + b ∑ X
Mean
G = laju pertumbuhan ekonomi (%) 8
BR = serapan belanja rutin ( Rp T ) 45
BM = serapan belanja modal ( Rp T ) 60
INF = laju inflasi (%) 7
INV = serapan investasi swasta ( $ miliar ) 30
n = jumlah data 28. t alpha df (n – JV) = t 0.05 df 23 = +/-2,069.
F- table = Numerator = 4, denominator = 135, F tabel = 2.44.
Lag variable: BRt-1 ada unsur kelambanan (lag).
Ebr = dG/dBR x BR/G = 0.23 x 45/8 = 1.29 ( elastis ). Bila serapan belanja
rutin naik sebesar 1%, maka laju pertumbuhan ekonomi akan naik 1.29%.
Ebm = dG/dBM x BM/G =
F table numerator = jumlah variable – 1= 4. Denominator JV ( n-1 ) = 5 ( 28-
1) = 135.
A. Jelaskan makna konstanta dan koeffisien.
B. Hitung elastisitasnya dan jelaskan maknanya.
C. Jelaskan hasil uji t, Uji F dan R-Square.
D. Jelaskan hasil uji DW ( otokorelasi ) pada model
tersebut.
Ho: variable-variable BR, BM, INF dan INV secara bersama
sama tidak berpengaruh signifikan terhadap laju pertumbuhan
ekonomi.
Ha: sedikitnya terdapat satu variable bebas di atas yang
berpengaruh signifikan terhadap laju pertumbuhan ekonomi
Keputusan: Ho ditolak
Kesimpulan: jadi dapat disimpulkan bahwa sedikitnya terdapat
satu variable bebas di atas yang berpengaruh signifikan
terhadap laju pertumbuhan ekonomi.
R-square = 0.85 = koeffisien determinasi.
Variasi-variasi perubahan yang terjadi pada variable BR,
BM, INF dan INV dapat menjelaskan atau memberikan
kontribusi sebesar 85% terhadap variasi-variasi
perubahan yang terjadi pada variable laju pertumbuhan
ekonomi, sedangkan 15% variasi lainnya dijelaskan oleh
variable-variable lain di luar model tersebut.
Tabel DW = k = 4 jumlah variable bebas.
N = 28, dl = 1.1, du = 1.75. DW test = 1.98
dl du 4 - du 4 - dl
1.1 1.75 2.25 2.9
Ho di Ragu2 Ho diterima Ragu2 Ho tolak
tolak DW test =1.98
Ho: tidak terdapat gejala otokorelasi dalam model.
Ha: terdapat gejala otokorelasi dalam model.
Ho diterima, Ha ditolak. Berarti tidak terdapat gejala otokorelasi
( kelambanan ) dalam model tersebut.
Hypotheses Testing
Sales = f (promotion, Price, Pc )
TESTING HYPOTHESES ( CH: 10 P 116-521)
(6) Conclusion
1.Testing the Population Mean ( µ ), n ≥ 30
Example: Mr X, a mouthwash distributor, has stated that the
average cost to process a sales order is $ 13,25. Miss Y, cost
controller, fears that the average cost of processing is more
than $13.25. She is interested in taking action if cost are high,
but she does not care if the actual average cost is below the
assumed value. A random sample of 100 orders had a sample
mean of $13.35, assuming the std deviation is $ 0.50. Conduct
a test at 5 % level of significance and can you conclude that
the average cost of a sales order is more than $13.25?
.
TEST STATISTIC FOR µ ( n ≥ 30)
1.Ho= U lebih kecil atau sama dengan $13.25
Ha= U lebih besar dari $13.25
4.TEST STATISTIC: X - µ
Zh = --------- = + 2
SD/√n
. 5.DECISION: Ho is rejected and Ha is accepted.
. 6.CONCLUSION:So we can conclude that the average
of a sales order is more than $13.25.
2. TEST STATISTIC FOR π
(POPULATION PROPORTION)
TEST STATISTIC:
P–π
Zh = --------------- = - 0.40
π ( 1 –π )/n
(2).TESTING FOR A PROPORTION POPULATION ( π )
RESEARCH AT THE FEB-UGM INDICATES THAT 50% OF THE
STUDENTS CHANGE THEIR MAJOR OF STUDY AFTER THEIR FIRST
YEAR IN A PROGRAM. A RANDOM SAMPLE OF 100 STUDENTS IN
THE BUSINESS PROGRAM REVEALED THAT 48% HAD CHANGED
THEIR MAJOR AREA OF STUDY AFTER THEIR FIRST YEAR OF THE
PROGRAM. Has there been a significant change in the proportion of students
who change their major after the first year study in this program? Test at the
0.10 level of significance.
Z table = +/- 1.65. Ho diterima bila Zh lebih besar atau sama dengan -1.65.
Ho ditolak bila Zh lebih kecil dari -1.65. Ho diterima bila Zh lebih kecil atau
sama dengan + 1.65. Ho ditolak bila Zh lebih besar dari + 1.65.
Zh = - 0.4. Keputusan : Ho diterima, Ha ditolak.
Kesimpulan : jadi dapat disimpulkan bahwa proporsi mahasiswa yg merubah
konsentrasi setelah satu tahun kuliah pada kenyataannya tidak mengalami
perubahan secara signifikan.
Example:2
Distribution of data after promoting Lux on Televion a few months ago.
Brand of bath soap Number of Household
Palmolive 1000
Lux 1500
Zest 1000
Beauty 750
Minty 1250
Maya 1000
Other Brands 3500
Assumption: Before promotion, the market share of Lux = 14%.
a. Can you conclude that the proportion of HH that favor Lux is
different from 14%.
b. Can you conclude that the proportion of HH that favor Lux is higher
than 14 %. ( Alpha = 15 % ).
5. Testing The Population Mean (µ ), n < 30
Test Statistic:
X -µ
th = ------------ = (53.33-55)/2.16/2.44 =-
SD / n = -1.89.
Ho: Diterima, Ha: ditolak
Kesimpulan: Jadi dapat disimpulkan bahwa konsumsi BBM dari sepeda
motor tersebut pada kenyataannya tidaklah berbeda dari yang di
klaim oleh pabriknya.
Example
The distribution of gasoline consumption of 6 samples of
Honda Astrea can be reported as follow:
Sample Range Xi – X-bar (Xi – X-bar) 2
1 54 km/l 0.67 0.44
2 53 -0.33 0.108
3 56 2.67 7.128
4 52 -1.33 1.76
5 50 -3.33 11.08
6 55 -1.67 2.78
X-bar = 53.33 = 23.28 SD = 2.16
The manufacture claimed that the average of gasoline
consumption was 55 km/l. Can you conclude that this claim is too
overestimate? ( use alpha = 5 % ). t alpha df ( n-1 ) = +/- 2.571
T table t alpha df (n-1 ) = t o.o5 df 5 = - 2.015
Ho diterima bila th lebih besar atau sama dengan -2.015.
Ho ditolak bila th lebih kecil dari -2.015.
th= (53.33-55): 2.16/akar 6 = - 1.89
Keputusan : Ho diterima, Ha ditolak.
Kesimpulan: Jadi dapat disimpulkan bahwa rata-rata
konsumsi BBM dari sepeda motor tersebut pada
kenyataannya tidaklah overestimate. Jadi pernyataan
pabrik tersebut memang benar adanya.
Contoh Kasus
Bupati suatu daerah menyatakan bahwa rata-rata
income per kapita penduduknya adalah $4500 dan
standard deviasi = $600. Penelitian yang dilakukan
oleh BPS terhadap 400 RT menunjukkan hasil rata-
rata income per kapita sebesar $ 4600. Ujilah
dengan alpha = 2%.
a.Apakah dapat disimpulkan bahwa rata-rata
income perkapita penduduk berbeda dari $ 4500.
b.Apakah dapat disimpulkan bahwa rata-rata
income perkapita penduduk lebih tinggi dari $
4500?
Uji t ( n< 30 )
Pabrik suplemen suatu produk menyatakan
bahwa rata-rata kadar taurine maksimum
pada produk tersebut adalah 10.5 mg. Hasil
uji coba dari BPOM terhadap 10 botol
produk suplemen tersebut menunjukkan
rata-rata kadar taurine sebesar 11 mg
dengan variance sebesar 9 mg. Ujilah dengan
alpha sebesar 10%, Apakah BPOM akan
menarik produk tersebut dari peredaran? t
alpha df ( n-1 ) = t 0,10 df 9.
t table =1.383
Ho diterima bila th kurang dari atau sama
dengan 1.383. Ho ditolak bila th lebih dari
1.383. th
TESTING THE DIFFERENCE BETWEEN TWO
POPULATION MEANS
PARAMETRIC STATISTICS:
1. SPSS: PAIRED SAMPLES
2. EVIEWS: UNPAIRED SAMPLES:
EQUALITY OF VARIANCE TEST
3. EXCEL: DATA ANALYSIS
Sd = Pc ( 1 – Pc ) / n1 + Pc ( 1 – Pc ) / n2 Sd = 0.054
Z table = - 1.65
Ho diterima bila Zh lebih besar atau sama
dengan – 1.65. Ho ditolak bila Zh lebih kecil
dari -1.65. Zh = - 2.22
Ho ditolak, Ha diterima.
Kesimpulan: jadi dapat disimpulkan bahwa
proporsi wanita remaja yg menyukai parfum
tsb pada kenyataanya lebih kecil proporsi dari
wanita dewasa.
Z table = - 1.65
Ho diterima bila Zh lebih besar atau sama dengan -1.65. Ho ditolak bila nilai
Zh lebih kecil dari – 1.65.
Zh = -2.22
Ho ditolak, Ha diterima
Kesimpulan : jadi dapat disimpulkan bahwa proporsi wanita remaja yang
menyukai parfum tersebut memang lebih rendah dibandingkan dengan
proporsi wanita dewasa yang menyukai parfum Heavenly tersebut.
Exercise
Data IPK mahasiswa MM-UGM th 2019
MM-Jogya MM-Jkt
Rata rata IPK 3.56 3.61
Std deviasi 0.78 0.25
Jumlah Sampel 32 mhs 35 mhs
Ujilah dengan alpha = 4%, apakah rata-
rata IPK mhs MM-Jkt lebih tinggi
dibandingkan rata-rata IPK mhs MM-
Jogya.
T table t 0.01 df 26 = +/- 2.779
Ho diterima bila th lebih besar atau sama dengan -2.779
atau th lebih kecil atau sama dengan + 2.779. Ho ditolak
bila th lebih kecil dari -2.779 atau th lebih besar dari +
2.779.
th = + 0.84, Ho diterima, Ha ditolak.
Kesimpulan: jadi dapat disimpulkan bahwa rata-rata IQ
mahasiswi tidak berbeda secara signifikan dibandingkan
dengan rata-rata IQ mahasiswa.
6.TEST STATISTIC FOR (µ1 - µ2), n < 30
( USE- t TABLE )
TEST STATISTIC:
X1 - X2
th = -------------------------------------------
( n1-1) var1+ (n2-1) var2 1 +1
(n1 + n2 – 2) n1 n2
th= + 0.84. Ho diterima, Ha ditolak.
Jadi dapat disimpulkan bahwa rata-rata IQ
kelompok mhwi tidak lebih tinggi dari rata-rata IQ
kelompok mhwa.
Testing (µ1 - µ2 ), n < 30
Dr. Dony, a psychologist, administered IQ tests to
determine if female FEB students were as smart as
male students. The random sample of 15 female
students had a mean score of 131 with std
deviation of 17. The random sample of 13 male
students had a mean of 126 and a std deviation of
14. At 0.01 level of significance:
a. is there a significant difference in their average
IQ? t table = +/- 2.779
b. Can you conclude that the average female
students IQ is greater than male students IQ?
t alpha n1 + n2 – 2 = +/- 2.779
th = 0.84
Ho diterima, Ha ditolak.
Kesimpulan: Jadi dapat disimpulkan bahwa rata
rata IQ kelompok mahasiswi tidak berbeda
dengan rata-rata kelompok mahasiswa.
6. Dependent Samples test
Sample Productivity of Workers
After Training Before Training d ( d-d ) ( d – d )2
1 235 units 228 units 7 2.4 5.76
2 210 205 5 0.4 0.16
3 231 219 12 7.4 54.76
4 242 240 2 -2.6 6.76
5 205 198 7 2.4 5.76
6 230 223 7 2.4 5.76
7 231 227 4 -0.6 0.36
8 210 215 -5 -9.6 92.16
9 225 222 3 -1.6 2.56
10 249 245 4 -0.6 0.36
Can you conclude that the training program change the productivity of the
employees? ( alpha = 5% ).
H0:
Ho
∑ d = 46 d = 46/10 = 4.6, SD = 4.4
SD = ∑ ( d – d )2
(n–1)
Test statistic: d
SD / n th = + 3.3
Ho ditolak, Ha diterima. Jadi dapat disimpulkan bahwa rata-rata
produktivitas tenaga kerja setelah training mengalami perubahan
secara signifikan dibandingkan dengan sebelum diberikan training.
Example:2
Sample of customer Score of: New Menu Old menu
1 36 35
2 48 46
3 50 51
4 76 74
5 55 55
6 60 59
7 71 72
8 66 64
9 75 76
Can you conclude that the new menu is more delicious than
the old one? ( alpha = 1% ). 0811253224
7: CHY SQUARE TEST
(NON-PARAMETRIC STATISTCS)
Chy- square tests are used in a procedure that
involves the comparison of the differences between
the sample frequencies of the occurrence ( Oij )
and the hypothetical or theoretical population
frequencies ( Eij ) or ( Expected value ).
( Goodness of Fit test ).
It can also be used to test relationship between
variables ( independency test ).
Critical value of x2 depends on the number of
rows and columns
Chy square table
Chy- square table is always one tailed in the
right side of the curve.
Critical table is X2 ά df ( r -1 )( c - 1 ).
Ho is accepted if test statistics calculated is
less than or equal to its critical table.
Ho is rejected if test statistics calculated is
greater than its critical table.
Chy-Square test
4. Test statistic:
( Oij – Eij )2
Xh2 =Σ -------------- = 4.79
Eij
Oij = observed frequencies in ith row & jth
column
Eij = expected frequencies in ith row & jth
column
A GARMENT COMPANY IN CAKUNG RECORDS THE
PERFORMANCEEXAMPLE
OF ITS LABOR PRODUCTIVITY
RANDOMLY.
THE LEVEL OF PRODUCTIVITY
LOW MODERATE HIGH
WORK-SHIFT
MORNING (I) 40 45 50
AFTERNOON(II) 60 55 60
NIGHT(III) 40 30 25
Based on this research, what is your conclusion?
(α = 5%). Jadi dapat disimpulkan bahwa perbedaan
shift kerja tidak berpengaruh secara signifikan
terhadap produktivitas pekerja,
CHY SQUARE: INDEPENDENCY TEST
DISTRIBUTION OF DATA COLLECTED BY UNILEVER INDONESIA
WAS RECORDED AS FOLLOW:
Color of bath soap: Pink White Yellow
Gender: Man 10 10 20
Woman 25 20 5
What is your conclusion? ( UseAlpha = 5 % )
E11 = 16, E12 = 13, E13 = 11
E21 = 19, E22 = 17, E23 =14 Test stat = 18.5
Ho diterima bila nilai test statistic lebih kecil atau sama dengan 5.991.
Ho ditolak bila nilai test statistic lebih besar dari 5.991.
Test statistic : 36/16 +9/13 + 81/11 + 36/19 +9/17 +81/14 = 18.51
Ho: ditolak, Ha: diterima
Kesimpulan : jadi dapat disimpulkan bahwa perbedaan gender pada kenyatannya
memiliki perbedaan yang signifikan dalam memilih warna sabun yang
mereka sukai.
ANALYSIS of VARIANCE
This method can be used to test the difference
among population means ( the number of
specific sample is more than two or k > 2.
The steps of testing the hypothesis.
1. Calculate the variance within samples ( σ2w).
2. Calculate variance between or among the
samples (σ2b).
3. Calculate F table.
4. Calculate F statistic.
ANALYSIS OF VARIANCE
FERTILIZER: A B C D
SAMPLE
1 10 kgs 12 kgs 11 kgs 9 kgs
2 12 11 10 10
3 13 10 9 8
4 11 10 10 8
5 14 12 10 10