DJ 14 Ai&ds 3
DJ 14 Ai&ds 3
“ process of taking all the aspects of life and turning then into
Quantified data”.
Once we datafy things , we can’t transform their purposes and
then the into new forms of value.
Applications:
HRM- human resource mgnt
CRM – customer relationship mgnt
Banking
Insurance agency
Copmmercial Real Estate .
For example,
Descriptive :
It is represented general property of data .
It is used to summarize data in the form of names of
the variables, no of variables, no of samples.
It can be represented in numerical quantitative ,
categoricals nominal data, discrete.
Inferential probability distribution:
It is used to predict the future of the sample of
the population.
Supervisied model
Linear regression model
Logistic regression model
Polynomial regression
Classification models
Decision tree model
Bayes model
Navie bayes model
K nearest model
Knn model
Unsuperviused model
Clustering model.
Statistical models are used to make predictions or draw
conclusions.
These predictions are based on how two random variables are
connected.
The models will show a relationship between a variables.
For example
Age Weight
10 25
20 35
35 55
40 65
Weight = Ao +A1(age)+sigma
Y=mx+c
Y= response dependent
m= slope of the variable
x= predicted variable
Measure of Dispersion:
Range: range is the function of setting the value of min – max values
interval.
Variance:The variance measures how far the data points are spread
out from the average value.
follows:
Skewness: It describes how much statistical data distribution is
asymmetrical from the normal distribution, where distribution is
equally divided on each side
If the skewness is between -0.5 and 0.5, the data are fairly
symmetrical.
If the skewness is between -1 and – 0.5 or between 0.5 and 1,
the data are moderately skewed.
If the skewness is less than -1 or greater than 1, the data are
highly skewed.
Photo.
Neural network:
Normal distribution
Bernoulli distribution
Uniform distribution
Binomial distribution
Possion distribution
Exponential
Standard normal
For example, tossing a coin can only yield two outcomes heads or
tails.
The probability of getting head for a single unbiased coin toss will
be p=0.5 as there is an equal chance of getting a result.
Uniform Distribution:
Poisson Distribution:
given by
λ = Poisson parameter
Binomial distribution:
Probability of getting x successor states out of one trail.
Exponential : the exponential distribution is a continuous
probability distribution.
That often concerns the amount of time until some specific event
happens.
FOR example ,
Step 3: As this is a one-tailed test, αα = 100% - 95% = 5%. This can be used to
determine the critical value.
0.95 gives the required area under the curve. Now using a normal distribution
table, the area 0.95 is at z = 1.645. A similar process can be followed for a t-test.
The only additional requirement is to calculate the degrees of freedom given by
n - 1.
Step 4: Calculate the z test statistic. This is because the sample size is 30.
Furthermore, the sample and population means are known along with the
standard deviation.
z = ¯¯¯x−μσ√nx¯−μσn.
z = 112.5−10015√30112.5−1001530 = 4.56
Step 5: Conclusion. As 4.56 > 1.645 thus, the null hypothesis can be rejected.