Distribution Weibull Fitting
Distribution Weibull Fitting
com
Chapter 550
Introduction
This procedure estimates the parameters of the exponential, extreme value, logistic, log-logistic, lognormal,
normal, and Weibull probability distributions by maximum likelihood. It can fit complete, right censored, left
censored, interval censored (readout), and grouped data values. It also computes the nonparametric
Kaplan-Meier and Nelson-Aalen estimates of survival and associated hazard rates. It outputs various
statistics and graphs that are useful in reliability and survival analysis. When the choice of the probability
distribution is in doubt, the procedure helps select an appropriate probability distribution from those
available.
Features of this procedure include:
1. Probability plotting, hazard plotting, and reliability plotting for the common life distributions. The
data may be any combination of complete, right censored, left censored, and interval censored data.
2. Maximum likelihood and probability plot estimates of distribution parameters, percentiles, reliability
(survival) functions, hazard rates, and hazard functions.
3. Confidence intervals for distribution parameters and percentiles.
4. Nonparametric estimates of survival using the Kaplan-Meier procedure.
550-1
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Let T be the elapsed time until the occurrence of a specified event. The event may be death, occurrence of a
disease, disappearance of a disease, appearance of a tumor, etc. The probability distribution of T may be
specified using one of the following basic functions. Once one of these functions has been specified, the
others may be derived using the mathematical relationships presented.
1. Probability density function, f(t). This is the probability that an event occurs at time t.
2. Cumulative distribution function, F(t). This is the probability that an individual survives until time t.
𝑡𝑡
𝐹𝐹(𝑡𝑡) = � 𝑓𝑓(𝑥𝑥)𝑑𝑑𝑑𝑑
0
3. Survival function, S(t) or Reliability function, R(t). This is the probability that an individual survives
beyond time t. This is usually the first quantity that is studied. It may be estimated using the
nonparametric Kaplan-Meier curve or one of the parametric distribution functions.
𝑡𝑡
4. Hazard rate, h(t). This is the probability that an individual at time t experiences the event in the next
instant. It is a fundamental quantity in survival analysis. It is also known as the conditional failure
rate in reliability, the force of mortality in demography, the intensity function in stochastic
processes, the age-specific failure rate in epidemiology, and the inverse of Mill’s ratio in economics.
The empirical hazard rate may be used to identify the appropriate probability distribution of a
particular mechanism, since each distribution has a different hazard rate function. Some
distributions have a hazard rate that decreases with time, others have a hazard rate that increases
with time, some are constant, and some exhibit all three behaviors at different points in time.
𝑓𝑓(𝑡𝑡)
ℎ(𝑡𝑡) =
𝑆𝑆(𝑡𝑡)
𝑡𝑡
550-2
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
1
if 𝑡𝑡 < 𝑡𝑡1
𝑑𝑑
𝑆𝑆̂(𝑡𝑡) = �� �1 − 𝑖𝑖 � if 𝑡𝑡 ≤ 𝑡𝑡
1
𝑌𝑌𝑖𝑖
𝑡𝑡1 ≤𝑡𝑡
In the above equation, 𝑑𝑑𝑖𝑖 represents the number of deaths at time 𝑡𝑡𝑖𝑖 and 𝑌𝑌𝑖𝑖 represents the number of
individuals who are at risk at time 𝑡𝑡𝑖𝑖 .
The variance of S(t) is estimated by Greenwood’s formula
𝑑𝑑𝑖𝑖
𝑉𝑉� �𝑆𝑆̂(𝑡𝑡)� = 𝑆𝑆̂(𝑡𝑡)2 �
𝑌𝑌𝑖𝑖 (𝑌𝑌𝑖𝑖 − 𝑑𝑑𝑖𝑖 )
𝑡𝑡𝑖𝑖 <𝑡𝑡
The product limit estimator may be used to estimate the cumulative hazard function H(t) using the
relationship
� (𝑡𝑡) = − log�𝑆𝑆̂(𝑡𝑡)�
𝐻𝐻
where
𝑉𝑉� �𝑆𝑆̂(𝑡𝑡)�
𝜎𝜎𝑆𝑆2 (𝑡𝑡) =
𝑆𝑆̂ 2 (𝑡𝑡)
and z is the appropriate value from the standard normal distribution. We call this the Linear
(Greenwood) confidence interval.
550-3
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
𝑆𝑆̂(𝑡𝑡)1⁄𝜃𝜃 , 𝑆𝑆̂(𝑡𝑡)𝜃𝜃
where
1⁄2
𝑆𝑆̂(𝑡𝑡)
2
sin �max �0, arcsin (𝑆𝑆 ̂ (𝑡𝑡)1⁄2 − 0.5𝑧𝑧1−𝛼𝛼⁄2 𝜎𝜎𝑆𝑆 (𝑡𝑡) � � �� ≤ 𝑆𝑆(𝑡𝑡)
1 − 𝑆𝑆̂(𝑡𝑡)
1⁄2
𝜋𝜋 𝑆𝑆̂(𝑡𝑡)
≤ sin �min � , arcsin(𝑆𝑆̂(𝑡𝑡) + 0.5𝑧𝑧1−𝛼𝛼⁄2 𝜎𝜎𝑆𝑆 (𝑡𝑡) �
2 1 ⁄2
� ��
2 1 − 𝑆𝑆̂(𝑡𝑡)
This hazard function may be used to generate the Nelson-Aalen estimator of S(t) using the formula
𝑆𝑆̃(𝑡𝑡) = 𝑒𝑒 𝐻𝐻�(𝑡𝑡)
Using these formulas, a fourth set of confidence limits for S(t) may be calculated as
550-4
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Normal Distribution
The normal distribution is one of the most commonly used in statistics. However, it is used infrequently as a
lifetime distribution because it allows negative values while lifetimes are always positive. It has been found
that the logarithms of failure times may be fit by the normal distribution. Hence the lognormal has become
a popular distribution in reliability work, while the normal has been put on the sideline.
The normal distribution is indexed by a location (M) and a scale (S) parameter. A threshold parameter is
meaningless in this case, since it is an adjustment to the location parameter. Using these symbols, the
normal density function may be written as
1 1 𝑡𝑡−𝑀𝑀 2
− � �
𝑓𝑓(𝑡𝑡|𝑀𝑀, 𝑆𝑆) = 𝑒𝑒 2 𝑆𝑆 , − ∞ < 𝑀𝑀 < ∞, 𝑆𝑆 > 0, −∞ < 𝑡𝑡 < ∞
𝑆𝑆√2𝜋𝜋
Location Parameter - M
The location parameter of the normal distribution is often called the mean.
Scale Parameter - S
The scale parameter of the normal distribution is usually called the standard deviation.
Lognormal Distribution
The normal distribution is one of the most commonly used in statistics. Although the normal distribution
itself does not often work well with time-to-failure data, it has been found that the logarithm of failure time
often does. Hence the lognormal has become a popular distribution in reliability work.
The lognormal distribution is indexed by a shape (S) , a scale (M), and a threshold (D) parameter. Using these
symbols, the three parameter lognormal density function may be written as
2
1 1 ln(𝑡𝑡−𝐷𝐷)−𝑀𝑀
− � �
𝑓𝑓(𝑡𝑡|𝑀𝑀, 𝑆𝑆, 𝐷𝐷) = 𝑒𝑒 2 𝑆𝑆 , −∞ < 𝑀𝑀 < ∞, 𝑆𝑆 > 0, −∞ < 𝐷𝐷 < ∞, 𝑡𝑡 > 𝐷𝐷
(𝑡𝑡 − 𝐷𝐷)𝑆𝑆√2𝜋𝜋
It is often more convenient to work with logarithms to the base 10 (denoted by log) rather than logarithms
to the base e (denoted by ln). The lognormal10 density function is written as
2
1 1 log(𝑡𝑡−𝐷𝐷)−𝑀𝑀
− � �
𝑓𝑓(𝑡𝑡|𝑀𝑀, 𝑆𝑆, 𝐷𝐷) = 𝑒𝑒 2 𝑆𝑆 , −∞ < 𝑀𝑀 < ∞, 𝑆𝑆 > 0, −∞ < 𝐷𝐷 < ∞, 𝑡𝑡 > 𝐷𝐷
ln(10)(𝑡𝑡 − 𝐷𝐷)𝑆𝑆√2𝜋𝜋
550-5
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Shape Parameter - S
The shape parameter of the lognormal distribution of t is the standard deviation in the normal distribution
of ln(t-D) or log(t-D). That is, the scale parameter of the normal distribution is the shape parameter of the
lognormal distribution.
Scale Parameter - M
The scale parameter of the lognormal distribution t is the mean in the normal distribution of ln(t-D) or log(t-
D). That is, the location parameter of the normal distribution is the scale parameter of the lognormal
distribution.
Threshold Parameter - D
The threshold parameter is the minimum value of the random variable t. When D is set to zero, we obtain
the two parameter lognormal distribution. When a positive value is given to D, we are inferring that no
failures can occur between zero and D.
Reliability Function
The reliability (or survivorship) function, R(t), gives the probability of surviving beyond time t. For the
Lognormal distribution, the reliability function is
ln(𝑡𝑡 − 𝐷𝐷) − 𝑀𝑀
𝑅𝑅(𝑡𝑡) = 1 − Φ � �
𝑆𝑆
The conditional reliability function, R(t,T), may also be of interest. This is the reliability of an item given that it
has not failed by time T. The formula for the conditional reliability is
𝑅𝑅(𝑇𝑇 + 𝑡𝑡)
𝑅𝑅(𝑡𝑡) =
𝑅𝑅(𝑇𝑇)
Hazard Function
The hazard function represents the instantaneous failure rate. For this distribution, the hazard function is
𝑓𝑓(𝑡𝑡)
ℎ(𝑡𝑡) =
𝑅𝑅(𝑡𝑡)
550-6
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Weibull Distribution
The Weibull distribution is named for Professor Waloddi Weibull whose papers led to the wide use of the
distribution. He demonstrated that the Weibull distribution fit many different datasets and gave good
results, even for small samples. The Weibull distribution has found wide use in industrial fields where it is
used to model time to failure data.
The three parameter Weibull distribution is indexed by a shape (B), a scale (C), and a threshold (D)
parameter. Using these symbols, the three parameter Weibull density function may be written as
𝐵𝐵 𝑡𝑡 − 𝐷𝐷 (𝐵𝐵−1) −�𝑡𝑡−𝐷𝐷�𝐵𝐵
𝑓𝑓(𝑡𝑡|𝐵𝐵, 𝐶𝐶, 𝐷𝐷) = � � 𝑒𝑒 𝐶𝐶 , 𝐵𝐵 > 0, 𝐶𝐶 > 0, −∞ < 𝐷𝐷 < ∞, 𝑡𝑡 > 𝐷𝐷.
C 𝐶𝐶
The symbol t represents the random variable (usually elapsed time). The threshold parameter D represents
the minimum value of t that can occur. Setting the threshold to zero results in the common, two parameter
Weibull distribution.
Shape Parameter - B
The shape (or power) parameter controls the overall shape of the density function. Typically, this value
ranges between 0.5 and 8.0. The estimated standard errors and confidence limits displayed by the program
are only valid when B > 2.0.
One of the reasons for the popularity of the Weibull distribution is that it includes other useful distributions
as special cases or close approximations. For example, if
B=1 The Weibull distribution is identical to the exponential distribution.
B=2 The Weibull distribution is identical to the Rayleigh distribution.
B = 2.5 The Weibull distribution approximates the lognormal distribution.
B = 3.6 The Weibull distribution approximates the normal distribution.
Scale Parameter - C
The scale parameter only changes the scale of the density function along the time axis. Hence, a change in
this parameter has the same effect on the distribution as a change in the scale of time—for example, from
days to months or from hours to days. However, it does not change the actual shape of the distribution.
C is known as the characteristic life. No matter what the shape, 63.2% of the population fails by t = C+D.
Some authors use 1/C instead of C as the scale parameter. Although this is arbitrary, we prefer dividing by
the scale parameter since that is how you usually scale a set of numbers. For example, remember how you
create a z-score when dealing with the normal data or create a percentage by dividing by the maximum.
Threshold Parameter - D
The threshold parameter is the minimum value of the random variable t. Often, this parameter is referred
to as the location parameter. We use ‘threshold’ rather than ‘location’ to stress that this parameter sets the
minimum time. We reserve ‘location’ to represent the center of the distribution. This is a fine point and we
are not upset when people refer to this as the location parameter.
550-7
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
When D is set to zero, we obtain the two parameter Weibull distribution. It is possible, but unusual, for D to
have a negative value. When using a search algorithm to find the estimated value of D, a nonzero value will
almost certainly be found. However, you should decide physically if a zero probability of failure in the
interval between 0 and D is truly warranted.
A downward or upward sloping tail on the Weibull probability plot or values of B > 6.0 are indications that a
nonzero threshold parameter will produce a better fit to your data.
Negative values of D represent an amount of time that has been subtracted from the actual times. On the
other hand, positive values of D represent a period of time between the starting point and when any failures
can occur. For example, positive values of D may represent the amount of time between the production of
an item and when it is placed in service.
Reliability Function
The reliability (or survivorship) function, R(t), gives the probability of surviving beyond time t. For the Weibull
distribution, the reliability function is
𝑡𝑡−𝐷𝐷 𝐵𝐵
−� �
𝑅𝑅(𝑡𝑡) = 𝑒𝑒 𝐶𝐶
The reliability function is one minus the cumulative distribution function. That is,
𝑅𝑅(𝑡𝑡) = 1 − 𝐹𝐹(𝑡𝑡)
The conditional reliability function, R(t,T), may also be of interest. This is the reliability of an item given that it
has not failed by time T. The formula for the conditional reliability is
𝑅𝑅(𝑇𝑇 + 𝑡𝑡)
𝑅𝑅(𝑡𝑡) =
𝑅𝑅(𝑇𝑇)
Hazard Function
The hazard function represents the instantaneous failure rate. For this distribution, the hazard function is
𝑓𝑓(𝑡𝑡) 𝐵𝐵 𝑡𝑡 − 𝐷𝐷 𝐵𝐵−1
ℎ(𝑡𝑡) = = � �
𝑅𝑅(𝑡𝑡) 𝐶𝐶 𝐶𝐶
Depending on the values of the distribution’s parameters, the Weibull’s hazard function can be decreasing
(when B<1), constant (when B=1 at 1/C), or increasing (when B>1) over time.
550-8
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
1 𝑡𝑡 − 𝑀𝑀 𝑡𝑡 − 𝑀𝑀
𝑓𝑓(𝑡𝑡|𝑀𝑀, 𝑆𝑆) = exp � � exp �−exp � �� , 𝑆𝑆 > 0
𝑆𝑆 𝑆𝑆 𝑆𝑆
Exponential Distribution
The exponential distribution was one of the first distributions used to model lifetime data. It has now been
superceded by the Weibull distribution, but is still used occasionally. The exponential distribution may be
found from the Weibull distribution by setting B = 1.
The exponential distribution is a model for the life of products with a constant failure rate. The two
parameter exponential distribution is indexed by both a scale and a threshold parameter. The density of the
exponential distribution may be written as
1 𝑡𝑡 − 𝐷𝐷
𝑓𝑓(𝑡𝑡|𝑆𝑆, 𝐷𝐷) = exp �− �, 𝑆𝑆 > 0
𝑆𝑆 𝑆𝑆
Scale Parameter - S
The scale parameter changes the scale of the density function along the time axis. Hence, a change in this
parameter has the same effect on the distribution as a change in the scale of time—for example, from days
to months or from hours to days. However, it does not change the actual shape of the distribution.
Some authors use 1/S instead of S as the scale parameter. Although this is arbitrary, we prefer dividing by
the scale parameter since that is how a set of numbers is usually scaled. For example, remember how z-
scores are created when dealing with the normal distribution.
Threshold Parameter - D
The threshold parameter is the minimum value of the random variable t. Often, this parameter is referred
to as the location parameter. We use ‘threshold’ rather than ‘location’ to stress that this parameter sets the
minimum time. We reserve ‘location’ to represent the center of the distribution. This is a fine point and we
are not upset when people refer to this as the location parameter.
When D is set to zero, we obtain the two parameter exponential distribution. It is possible, but unusual, for
D to have a negative value. When using a search algorithm to find the estimated value of D, a nonzero value
will almost certainly be found. However, you should decide physically if a zero probability of failure in the
interval between 0 and D is truly warranted.
A downward or upward sloping tail on the exponential probability plot is an indication that a nonzero
threshold parameter will produce a better fit to your data.
550-9
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Negative values of D represent an amount of time that has been subtracted from the actual times. On the
other hand, positive values of D represent a period of time between the starting point and when any failures
can occur. For example, positive values of D may represent the amount of time between the production of
an item and when it is placed in service.
Reliability Function
The reliability (or survivorship) function, R(t), gives the probability of surviving beyond time t. For the
exponential distribution, the reliability function is
𝑡𝑡−𝐷𝐷
−� �
𝑅𝑅(𝑡𝑡) = 𝑒𝑒 𝑆𝑆
Hazard Function
The hazard function represents the instantaneous failure rate. For this distribution, the hazard function is
constant. The rate is given by the function
𝑓𝑓(𝑡𝑡) 1
ℎ(𝑡𝑡) = =
𝑅𝑅(𝑡𝑡) 𝑆𝑆
Logistic Distribution
The density of the logistic distribution may be written as
𝑡𝑡 − 𝑀𝑀
exp � 𝑆𝑆 �
𝑓𝑓(𝑡𝑡|𝑀𝑀, 𝑆𝑆) = , 𝑆𝑆 > 0
𝑡𝑡 − 𝑀𝑀 2
𝑆𝑆 �1 + exp � 𝑆𝑆 ��
Log-logistic Distribution
The density of the log-logistic distribution may be written as
ln(𝑡𝑡 − 𝐷𝐷) − 𝑀𝑀
exp � 𝑆𝑆
�
𝑓𝑓(𝑡𝑡|𝑀𝑀, 𝑆𝑆, 𝐷𝐷) = 2, 𝑆𝑆 > 0
ln(𝑡𝑡 − 𝐷𝐷) − 𝑀𝑀
(𝑡𝑡 − 𝐷𝐷)𝑆𝑆 �1 + exp � ��
𝑆𝑆
The log-logistic distribution is used occasionally to model lifetime data, but it is so similar to the lognormal
and the Weibull distributions that it adds little and is thus often dropped from consideration.
550-10
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Parameter Estimation
The parameters of the reliability distributions may be estimated by maximum likelihood or by applying least
squares regression to the probability plot. The probability plot method is popular because it uses a nice
graphic (the probability plot) which allows a visual analysis of the goodness of fit of the distribution to the
data. Maximum likelihood estimation is usually favored by statisticians because it has been shown to be
optimum in most situations and because it provides estimates of standard errors and confidence limits.
However, there are situations in which maximum likelihood does not do as well as the regression approach.
For example, maximum likelihood does not do a good job of estimating the threshold parameter. When you
want to include the threshold parameter in your model, we suggest that you use the regression approach to
estimate it and then treat the threshold value as a known quantity in the maximum likelihood estimation.
Maximum Likelihood
Maximum likelihood estimation consists of finding the values of the distribution parameters that maximize
the log-likelihood of the data values. Loosely speaking, these are the values of the parameters which
maximize the probability that the current set of data values occur.
The general form of the log-likelihood function is given by
𝐿𝐿�𝑃𝑃� = � ln �𝑓𝑓�𝑃𝑃, 𝑡𝑡𝑘𝑘 �� + � ln �𝑆𝑆�𝑃𝑃, 𝑡𝑡𝑘𝑘 �� + � ln �𝐹𝐹�𝑃𝑃, 𝑡𝑡𝑘𝑘 �� + � ln �𝑓𝑓�𝑃𝑃, 𝑡𝑡𝑢𝑢𝑢𝑢 � − 𝑓𝑓�𝑃𝑃, 𝑡𝑡𝑙𝑙𝑙𝑙 ��
𝐹𝐹 𝑅𝑅 𝐿𝐿 𝐼𝐼
where F stands for the set of failed items, R stands for the set of right censored items, L stands for the set of
left censored items, and I stands for the set of interval censored items. In the case of interval censored
observations, 𝑡𝑡𝑙𝑙𝑙𝑙 represents the first time of the interval and 𝑡𝑡𝑢𝑢𝑢𝑢 represents the last time of the interval.
Also, P represents one or two parameters as the case may be.
L(P) is maximized using two numerical procedures. First, a recently developed method called differential
evolution is used. This is a robust maximization procedure that only requires evaluations of the function, but
not its derivatives. The solution of the differential evolution phase is then used as starting values for the
Newton-Raphson algorithm. Newton-Raphson is used because it provides an estimate of the variance-
covariance matrix of the parameters, which is needed in computing confidence limits. The amount of
emphasis on the differential evolution phase as opposed to the Newton-Raphson phase may be controlled
using the maximum number of iterations allowed for each. Numerical differentiation is used to compute the
first and second order derivatives that are needed by the Newton-Raphson procedure.
550-11
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
𝑗𝑗 − 0.3
𝐹𝐹�𝑡𝑡𝑗𝑗 � =
𝑛𝑛 + 0.4
1
𝐹𝐹�𝑡𝑡𝑗𝑗 � =
𝑛𝑛 − 𝑗𝑗 + 1
1+� � 𝐹𝐹0.5,2(𝑛𝑛−𝑗𝑗+1),2𝑗𝑗
𝑗𝑗
𝑗𝑗
𝐹𝐹�𝑡𝑡𝑗𝑗 � =
𝑛𝑛 + 1
𝑗𝑗 − 3⁄8
𝐹𝐹�𝑡𝑡𝑗𝑗 � =
𝑛𝑛 + 1⁄4
• F(tⱼ) = [j - 0.5]/n
The following formula is sometimes used
𝑗𝑗 − 0.5
𝐹𝐹�𝑡𝑡𝑗𝑗 � =
𝑛𝑛
550-12
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Data Structure
Survival data is somewhat more difficult to enter because of the presence of various types of censoring.
Failed or Complete
A failed observation is one in which the time until the terminal event was measured exactly; for example,
the machine stopped working or the mouse died of the disease being studied.
Right Censored
A right censored observation provides a lower bound for the actual failure time. All that is known is that the
failure occurred (or will occur) at some point after the given time value. Right censored observations occur
when a study is terminated before all items have failed. They also occur when an item fails due to an event
other than the one of interest.
Left Censored
A left censored observation provides an upper bound for the actual failure time. All we know is that the
failure occurred at some point before the time value. Left censoring occurs when the items are not checked
for failure until sometime after the study has begun. When a failed item is found, we do not know exactly
when it failed, only that it was at some point before the left censor time.
Sample Dataset
Most data sets require two (and often three) variables: the failure time variable, an optional censor variable
indicating the type of censoring, and an optional count variable which gives the number of items occurring
at that time. If the censor variable is omitted, all time values represent failed items. If the count variable is
omitted, all counts are assumed to be one.
The table below shows the results of a study to test the failure rate of a particular machine. This particular
experiment began with 30 items being tested. After the twelfth item failed at 152.7 hours, the experiment
was stopped. The remaining eighteen observations were right censored. That is, we know that they will fail
at some time in the future. These data are contained in the WEIBULL database.
550-13
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Weibull Dataset
550-14
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Setup
To run this example, complete the following steps:
Variables Tab
_________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Options Tab
_________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Plots Tab
_________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
550-15
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Data Summary
Data Summary
────────────────────────────────────────────────────────────────────────
Type of
Observation Rows Count Percent (%) Minimum Maximum
────────────────────────────────────────────────────────────────────────────────────────────────────
Failed 12 12 40% 12.5 152.7
Right Censored 1 18 60% 152.7 152.7
Left Censored 0 0
Interval Censored 0 0
Total 13 30 100% 12.5 152.7
────────────────────────────────────────────────────────────────────────
This report displays a summary of the amount of data that were analyzed. Scan this report to determine if
there were any obvious data errors by double checking the counts and the minimum and maximum.
Parameter Estimation
Estimation Details
──────────────────────────────────────────────────
Differential Evolution Iterations 39
Newton-Raphson Restart 1
Newton-Raphson Iterations 10
User-Entered Random Seed 4002922
──────────────────────────────────────────────────
────────────────────────────────────────────────────────────────────────
* Probability plot estimates were generated with F(t) calculated using the approximate median and using the model Time = A +
B(F).
This report displays parameter estimates along with standard errors and confidence limits for the maximum
likelihood estimates. In this example, we have set the threshold parameter to zero so we are fitting the two-
parameter Weibull distribution.
550-16
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
𝑡𝑡−𝐷𝐷 𝐵𝐵
−� �
𝐹𝐹(𝑡𝑡) = 1 − 𝑒𝑒 𝐶𝐶
This is now in a linear form. If we let y = ln(-ln(1-F(t))) and x = ln(t), the above equation becomes
𝑦𝑦 = −𝐵𝐵[ln(𝐶𝐶)] + 𝐵𝐵𝐵𝐵
Using simple linear regression, we can estimate the intercept and slope. Using these estimates, we obtain
estimates of the Weibull parameters B and C as
𝐵𝐵 = slope
and
−intercept
𝐶𝐶 = exp � �
𝐵𝐵
We assumed that D was zero. If D is not zero, it is treated as a known value and subtracted from the time
values before the above regression is calculated.
𝑡𝑡−𝐷𝐷 𝐵𝐵
−� �
𝐹𝐹(𝑡𝑡) = 1 − 𝑒𝑒 𝐶𝐶
1
� � ln �−ln�1 − 𝐹𝐹(𝑡𝑡)�� + ln(𝐶𝐶) = ln(𝑡𝑡)
𝐵𝐵
This is now in a linear form. If we let x = ln(-ln(1-F(t))) and y = ln(t), the above equation becomes
1
𝑦𝑦 = 𝑥𝑥 + ln(𝐶𝐶)
𝐵𝐵
550-17
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Using simple linear regression, we can estimate the intercept and slope. Using these estimates, we obtain
estimates of the Weibull parameters B and C as
1
𝐵𝐵 =
slope
and
𝐶𝐶 = exp(intercept)
Parameter estimates for the other distributions are found in a similar manner.
𝐵𝐵�
𝐵𝐵�𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙,1−𝛼𝛼⁄2 =
𝑧𝑧1−𝛼𝛼⁄2 �𝑣𝑣𝑣𝑣1,1
exp � �
𝐵𝐵�
𝑧𝑧1−𝛼𝛼⁄2 �𝑣𝑣𝑣𝑣1,1
𝐵𝐵�𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢,1−𝛼𝛼⁄2 = 𝐵𝐵�exp � �
𝐵𝐵�
In the case of the Weibull distribution, the confidence limits for C are
𝐶𝐶̂
𝐶𝐶̂𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙,1−𝛼𝛼⁄2 =
𝑧𝑧1−𝛼𝛼⁄2 �𝑣𝑣𝑣𝑣2,2
exp � �
𝐶𝐶̂
𝑧𝑧1−𝛼𝛼⁄2 �𝑣𝑣𝑣𝑣2,2
𝐶𝐶̂𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢,1−𝛼𝛼⁄2 = 𝐶𝐶̂ exp � �
𝐶𝐶̂
In the case of all other distributions, the confidence limits for M are
�𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙,1−𝛼𝛼⁄2 = 𝑀𝑀
𝑀𝑀 � − 𝑧𝑧1−𝛼𝛼⁄2 �𝑣𝑣𝑣𝑣1,1
�𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢,1−𝛼𝛼⁄2 = 𝑀𝑀
𝑀𝑀 � + 𝑧𝑧1−𝛼𝛼⁄2 �𝑣𝑣𝑣𝑣1,1
550-18
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
In the case of all other distributions, the confidence limits for S are
𝑆𝑆̂
𝑆𝑆̂𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙,1−𝛼𝛼⁄2 =
𝑧𝑧1−𝛼𝛼⁄2 �𝑣𝑣𝑣𝑣2,2
exp � �
𝑆𝑆̂
𝑧𝑧1−𝛼𝛼⁄2 �𝑣𝑣𝑣𝑣2,2
𝑆𝑆̂𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢,1−𝛼𝛼⁄2 = 𝑆𝑆̂exp � �
𝑆𝑆̂
Log-Likelihood
This is the value of the log-likelihood function calculated using the maximum likelihood parameter
estimates. This is the value of the function being maximized. It is often used as a goodness-of-fit statistic.
You can compare the log-likelihood values achieved by each distribution and select as the best fitting the
one with the maximum value.
Note that we have found that several popular statistical programs calculate this value without including all
of the terms. Hence, they present erroneous values. The error occurs because they omit the 1/t term in the
denominator of the lognormal and the Weibull log-likelihood functions. Also, they may fail to include a
correction for using the logarithm to the base 10 in the lognormal10. Hopefully, future editions of these
programs will calculate the likelihood correctly.
Mean
This is the mean time to failure (MTTF). It is the mean of the random variable (failure time) being studied
given that the fitted distribution provides a reasonable approximation to your data’s actual distribution. In
the case of the Weibull distribution, the formula for the mean is
1
𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 = 𝐷𝐷 + 𝐶𝐶Γ �1 + �
𝐵𝐵
Median
The median is the value of t where F(t)=0.5. In the case of the Weibull distribution, the formula for the
median is
Mode
The mode of the Weibull distribution is given by
1 1⁄𝐵𝐵
𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 = 𝐷𝐷 + 𝐶𝐶 �1 − �
𝐵𝐵
550-19
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Sigma
This is the standard deviation of the failure time. The formula for the standard deviation (sigma) of a Weibull
random variable is
2 1
𝜎𝜎 = 𝐶𝐶 �Γ �1 + � − Γ 2 �1 + �
𝐵𝐵 𝐵𝐵
Variance-Covariance Matrix
This table gives the inverse of the Fisher information matrix evaluated at the maximum likelihood estimates
which is an asymptotic estimate of the variance-covariance matrix of the two parameters. These values are
calculated using numerical second-order derivatives.
Note that because these are numerical derivatives based on a random start provided by differential
evolution, the values of the last two decimal places may vary from run to run. You can stabilize the values by
changing the value of Derivatives constant, but this will have little effect on the overall accuracy of your
results.
550-20
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This report displays the Kaplan-Meier product-limit survival distribution and hazard function along with
confidence limits. The formulas used were presented earlier. Note that these estimates do not use the
selected parametric distribution in any way. They are the nonparametric estimates and are completely
independent of the distribution that is being fit.
Note that censored observations are marked with a plus sign on their time value. The survival and hazard
functions are not calculated for censored observations. Also note that left censored and interval censored
observations are treated as failed observations for the calculations on this report.
Also note that the Sample Size is given for each time period. As time progresses, participants are removed
from the study, reducing the sample size. Hence, the survival results near the end of the study are based on
only a few participants and are therefore less precise. This shows up as a widening of the confidence limits.
550-21
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This report displays the nonparametric estimate of the hazard rate, h(t). Note that this is not the cumulative
hazard function H(t) shown in the last report. It is the derivative of H(t). Since it is h(t) that needs to be
studied in order to determine the characteristics of the failure process, this report and its associated plot
(which is shown below) become very import.
The formula for the Nelson-Aalen estimator of the cumulative hazard is
0 if 𝑡𝑡 < 𝑡𝑡1
� (𝑡𝑡) = �
𝐻𝐻 𝑑𝑑𝑖𝑖
� if 𝑡𝑡1 ≤ 𝑡𝑡
𝑌𝑌𝑖𝑖
𝑡𝑡1 ≤𝑡𝑡
In the above equation, 𝑑𝑑𝑖𝑖 represents the number of deaths at time 𝑡𝑡𝑖𝑖 and 𝑌𝑌𝑖𝑖 represents the number of
individuals who are at risk at time 𝑡𝑡𝑖𝑖 .
550-22
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
The hazard rate is estimated using kernel smoothing of the Nelson-Aalen estimator as given in Klein and
Moeschberger (1997). The formulas for the estimated hazard rate and its variance are given by
1 𝑡𝑡 − 𝑡𝑡𝑘𝑘
ℎ�(𝑡𝑡) = � 𝐾𝐾 � � (𝑡𝑡𝑘𝑘 )
� ∆𝐻𝐻
𝑏𝑏 𝑏𝑏
𝐷𝐷
1 𝑡𝑡 − 𝑡𝑡𝑘𝑘 2
𝜎𝜎 2 �ℎ�(𝑡𝑡)� = � 𝐾𝐾 � � ∆𝑉𝑉� �𝐻𝐻
� (𝑡𝑡𝑘𝑘 )�
𝑏𝑏 2 𝑏𝑏
𝐷𝐷
� (𝑡𝑡𝑘𝑘 ) = 𝐻𝐻
Δ𝐻𝐻 � (𝑡𝑡𝑘𝑘 ) − 𝐻𝐻
� (𝑡𝑡𝑘𝑘−1 )
Three choices are available for the kernel function K(x) in the above formulation. These are defined
differently for various values of t. Note that the 𝑡𝑡𝑘𝑘 ’s are for failed items only and that 𝑡𝑡𝐷𝐷 is the maximum
failure time. For the uniform kernel the formulas for the various values of t are
1
𝐾𝐾(𝑥𝑥) = for 𝑡𝑡 − 𝑏𝑏 ≤ 𝑡𝑡 ≤ 𝑡𝑡 + 𝑏𝑏
2
where
𝑡𝑡
𝑞𝑞 =
𝑏𝑏
𝑡𝑡𝐷𝐷 − 𝑡𝑡
𝑟𝑟 =
𝑏𝑏
For the Epanechnikov kernel the formulas for the various values of t are
3
𝐾𝐾(𝑥𝑥) = (1 − 𝑥𝑥 2 ) for 𝑡𝑡 − 𝑏𝑏 ≤ 𝑡𝑡 ≤ 𝑡𝑡 + 𝑏𝑏
4
550-23
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
where
240(1 − 𝑞𝑞)2
𝐵𝐵 =
(1 + 𝑞𝑞)4 (19 − 18𝑞𝑞 + 3𝑞𝑞 2 )
𝑡𝑡
𝑞𝑞 =
𝑏𝑏
𝑡𝑡𝐷𝐷 − 𝑡𝑡
𝑟𝑟 =
𝑏𝑏
For the biweight kernel the formulas for the various values of t are
15
𝐾𝐾(𝑥𝑥) = (1 − 𝑥𝑥 2 )2 for 𝑡𝑡 − 𝑏𝑏 ≤ 𝑡𝑡 ≤ 𝑡𝑡 + 𝑏𝑏
16
where
1120(1 − 𝑞𝑞)3
𝐵𝐵 =
(1 + 𝑞𝑞)5 (81 − 168𝑞𝑞 + 126𝑞𝑞2 − 40𝑞𝑞 3 + 5𝑞𝑞 4 )
𝑡𝑡
𝑞𝑞 =
𝑏𝑏
𝑡𝑡𝐷𝐷 − 𝑡𝑡
𝑟𝑟 =
𝑏𝑏
𝑧𝑧1−𝛼𝛼⁄2 𝜎𝜎�ℎ�(𝑡𝑡)�
ℎ�(𝑡𝑡)exp �± �
ℎ�(𝑡𝑡)
Care must be taken when using these kernel-smoothed estimators since they are actually estimating a
smoothed version of the hazard rate, not the hazard rate itself. Thus, they may be biased and are greatly
influenced by the choice of the bandwidth b. We have found that you must experiment with b to find an
appropriate value for each dataset.
550-24
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This report displays the maximum likelihood estimates of the hazard rate, h(t), based on the selected
probability distribution and the definition of the hazard rate
𝑓𝑓(𝑡𝑡)
ℎ(𝑡𝑡) =
𝑅𝑅(𝑡𝑡)
Asymptotic confidence limits are computed using the formula from Nelson (1991) page 294.
𝑧𝑧1−𝛼𝛼⁄2 𝑠𝑠�ℎ�(𝑡𝑡)�
ℎ�(𝑡𝑡)exp �± �
ℎ�(𝑡𝑡)
where
2 2
𝜕𝜕ℎ� 𝜕𝜕ℎ� 𝜕𝜕ℎ� 𝜕𝜕ℎ�
𝑠𝑠 �ℎ�(𝑡𝑡)� = �
2
� 𝑣𝑣𝑣𝑣1,1 + � � 𝑣𝑣𝑣𝑣2,2 + 2 � �� � 𝑣𝑣𝑣𝑣1,2
𝜕𝜕𝑃𝑃1 𝜕𝜕𝑃𝑃2 𝜕𝜕𝑃𝑃1 𝜕𝜕𝑃𝑃2
550-25
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This report displays the estimated values of the cumulative failure distribution, F(t), at the time values that
were specified in the Times option of the Reports Tab. These failure values are the estimated probability
that failure occurs by the given time point. For example, the maximum likelihood estimate that a unit will fail
within 88 hours is 0.1989. The 95% confidence estimate of this probability is 0.1064 to 0.3543.
The asymptotic confidence limits are computed using the following formula:
where
𝑡𝑡 − 𝜇𝜇̂
𝑢𝑢� =
𝜎𝜎�
Note that limits for the Weibull, lognormal, and log-logistic are found using the corresponding extreme
value, normal, and logistic probability functions using the substitution y=ln(t).
550-26
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Parametric Reliability
Weibull Reliability
────────────────────────────────────────────────────────────────────────
Maximum Likelihood (MLE)
───────────────────────
95% Confidence
Probability Interval Limits
Failure Plot Estimate Estimate ────────────
Time of Survival of Survival Lower Upper
───────────────────────────────────────────────────────────────────────────
8 0.9890 0.9941 0.9378 0.9995
16 0.9738 0.9833 0.8989 0.9973
24 0.9566 0.9694 0.8655 0.9933
32 0.9381 0.9531 0.8351 0.9873
40 0.9186 0.9349 0.8065 0.9791
48 0.8986 0.9151 0.7791 0.9690
56 0.8781 0.8940 0.7523 0.9569
64 0.8573 0.8719 0.7259 0.9431
72 0.8363 0.8490 0.6995 0.9277
80 0.8151 0.8253 0.6728 0.9112
88 0.7940 0.8011 0.6457 0.8936
96 0.7729 0.7765 0.6181 0.8755
104 0.7520 0.7517 0.5898 0.8569
112 0.7311 0.7266 0.5609 0.8383
120 0.7105 0.7016 0.5312 0.8199
128 0.6901 0.6766 0.5009 0.8018
136 0.6699 0.6517 0.4702 0.7842
144 0.6500 0.6270 0.4393 0.7672
152 0.6305 0.6025 0.4083 0.7509
160 0.6112 0.5784 0.3775 0.7352
────────────────────────────────────────────────────────────────────────
This report displays the estimated reliability (survival) at the time values that were specified in the Times
option of the Reports Tab. Reliability may be thought of as the probability that failure occurs after the given
failure time. Thus, (using the ML estimates) the probability is 0.9531 that failure will not occur until after 32
hours. The 95% confidence for this estimated probability is 0.8351 to 0.9873.
Two reliability estimates are provided. The first uses the parameters estimated from the probability plot and
the second uses the maximum likelihood estimates. Confidence limits are calculated for the maximum
likelihood estimates. (They have not been derived for the probability plot estimates for all data situations).
The formulas used are as follows.
where
𝑡𝑡 − 𝑀𝑀�
𝑢𝑢� =
𝑆𝑆̂
� � + 𝑢𝑢�2 𝑉𝑉�𝑆𝑆̂� + 2𝑢𝑢�𝐶𝐶𝐶𝐶𝐶𝐶�𝑀𝑀
𝑉𝑉�𝑀𝑀 � , 𝑆𝑆̂�
𝑉𝑉� (𝑢𝑢�) =
𝑆𝑆̂ 2
550-27
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Note that limits for the Weibull, lognormal, and log-logistic are found using the corresponding extreme
value, normal, and logistic probability functions using the substitution y=ln(t).
Parametric Percentiles
Weibull Percentiles
────────────────────────────────────────────────────────────────────────
Maximum Likelihood (MLE)
────────────────────────
95% Confidence
Probability Plot Interval Limits
Failure Time Estimate of Estimate of ────────────
Percentage Failure Time Failure Time Lower Upper
──────────────────────────────────────────────────────────────────────────────────────
5 26.9 33.4 14.2 78.4
10 47.4 53.8 28.5 101.4
15 66.8 71.6 42.7 120.3
20 85.7 88.4 56.5 138.3
25 104.7 104.5 69.7 156.8
30 124.1 120.5 82.2 176.7
35 144.0 136.5 93.9 198.6
40 164.7 152.8 104.8 222.9
45 186.5 169.6 115.0 250.1
50 209.5 187.0 124.7 280.5
55 234.3 205.4 134.0 314.8
60 261.1 225.0 143.1 353.7
65 290.7 246.1 152.1 398.3
70 323.8 269.5 161.3 450.3
75 361.9 295.8 170.9 512.1
80 407.1 326.5 181.3 588.2
85 463.5 364.1 193.0 686.7
90 540.0 413.9 207.4 826.0
95 664.5 492.6 227.8 1065.0
────────────────────────────────────────────────────────────────────────
This report displays failure time percentiles and, for the maximum likelihood estimates, confidence intervals
for those percentiles. For example, the estimated median failure time is 187 hours. The 95% confidence
limits for the median time are 124.7 to 280.5. Note that these limits are very wide for two reasons. First, the
sample size is small. Second, the shape parameter is less than 2.0.
The estimated 100pth percentile and associated confidence interval is computed using the following steps:
1. Compute 𝑤𝑤𝑝𝑝 = 𝐹𝐹 −1 (𝑝𝑝)
� + 𝑤𝑤𝑝𝑝 𝑆𝑆̂. Note that in the case of the Weibull and exponential distributions, we let
2. Compute 𝑦𝑦𝑝𝑝 = 𝑀𝑀
� = ln�𝐶𝐶̂ � and 𝑆𝑆̂ = 1 ∕ 𝐵𝐵�.
𝑀𝑀
3. Compute 𝑉𝑉�𝑦𝑦𝑝𝑝 � = 𝑉𝑉𝑉𝑉1,1 + 𝑦𝑦𝑝𝑝2 𝑉𝑉𝑉𝑉2,2 + 2𝑦𝑦𝑝𝑝 𝑉𝑉𝑉𝑉1,2 .
4. For the normal, extreme value, and logistic distributions, the confidence interval for the percentile is
given by
550-28
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
For the lognormal, exponential, Weibull, and log-logistic distributions, the confidence interval for the
percentile is given by
For the lognormal base 10 distribution, the confidence interval for the percentile is given by
This report gives percentiles of the estimated life remaining after a certain time period. For example, the
estimated median remaining life of items reaching 80.0 hours is 139.9 hours.
550-29
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
The percentile and associated confidence interval of residual (remaining) life is computed using the
following steps:
𝑦𝑦𝑝𝑝 −𝑀𝑀�
1. Compute 𝑧𝑧𝑝𝑝 = . Note that in the case of the Weibull and exponential distributions, we let
𝑆𝑆̂
� = ln�𝐶𝐶̂ � and 𝑆𝑆̂ = 1 ∕ 𝐵𝐵�. Also note that for the normal, extreme value, and logistic distributions,
𝑀𝑀
𝑦𝑦𝑝𝑝 = 𝑡𝑡. For the lognormal, Weibull, and log-logistic distributions, 𝑦𝑦𝑝𝑝 = 𝑒𝑒 𝑡𝑡 . And for the lognormal base
10 distribution, 𝑦𝑦𝑝𝑝 = 10𝑡𝑡 .
3. Compute 𝑝𝑝1 = 𝑝𝑝0 (1 + 𝑃𝑃), where P is the percentile of residual life to be estimated.
� + 𝐹𝐹 −1 (𝑝𝑝1 )𝑆𝑆̂. Note that in the case of the Weibull and exponential distributions, we
4. Compute 𝑤𝑤𝑝𝑝 = 𝑀𝑀
� = ln�𝐶𝐶̂ � and 𝑆𝑆̂ = 1 ∕ 𝐵𝐵�.
let 𝑀𝑀
For the normal, extreme value, and logistic distributions, the estimate is given by
𝑇𝑇𝑝𝑝 = 𝑤𝑤𝑝𝑝
For the lognormal, exponential, Weibull, and log-logistic distributions, the estimate is given by
𝑇𝑇𝑝𝑝 = 𝑒𝑒 𝑤𝑤𝑝𝑝
𝑇𝑇𝑝𝑝 = 10𝑤𝑤𝑝𝑝
Survival Plot
Survival Plot
─────────────────────────────────────────────────────────────────────────
This plot shows the product-limit survivorship function (the step function) as well as the parametric survival
plot and associated confidence intervals. If there are several groups, a separate line is drawn for each group.
550-30
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This plot shows the parametric and nonparametric cumulative hazard functions for the data analyzed.
Confidence limits for the parametric cumulative hazard function are also given.
If you have several groups, then a separate line is drawn for each group. The shape of the hazard function is
often used to determine an appropriate survival distribution.
550-31
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This plot shows the parametric and nonparametric hazard rate plots with confidence limits for the
parametric hazard rate. This plot is especially useful for studying the shape of the nonparametric hazard
rate and comparing that with the parametric hazard rate. When selecting a probability distribution to
represent a set of data, it is important to determine if the parametric hazard rate plot has a general shape
that is consistent both with the nonparametric hazard rate and with your prior knowledge of the hazard
distribution. This plot allows you to make this comparison.
Note that the asymptotic confidence intervals are not well behaved for some distributions. If the confidence
intervals seem to have zero width at some point along the plot, you should realize that they fall into this
category and ignore them.
Probability Plot
Probability Plot(s)
─────────────────────────────────────────────────────────────────────────
This is the Weibull probability plot of these data. The expected quantile of the theoretical distribution is
plotted on the horizontal axis. The natural logarithm of the time value is plotted on the vertical axis. Note
that censored points are not shown on this plot. Also note that for tied data, only one point is shown for
each set of ties.
This plot lets you investigate the goodness of fit of the selected probability distribution to your data. If the
points seem to fall along a straight line, the selected probability model may be useful. If the plot shows a
downward curve, the value of the threshold parameter, D, may need to be increased. If the plot shows an
upward curve, the value of the threshold parameter may need to be decreased. Or you may need to select a
different distribution.
You must decide whether the probability distribution is a good fit to your data by looking at this plot and by
comparing the value of the log-likelihood to that of other distributions.
550-32
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
(𝑛𝑛 + 1) − 𝑂𝑂𝑝𝑝
𝐼𝐼𝑗𝑗 =
1 + 𝑐𝑐
where 𝐼𝐼𝑗𝑗 is the increment for the jth failure; n is the total number of data points, both censored and
uncensored; 𝑂𝑂𝑝𝑝 is the order of the previous failure; and c is the number of data points remaining in the data
set, including the current data. Implementation details of this procedure may be found in Dodson (1994).
Left censored and interval censored data are treated as failures for making the probability plots.
550-33
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Setup
To run this example, complete the following steps:
Variables Tab
___________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Options Tab
___________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Plots Tab
___________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
550-34
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Data Summary
Data Summary
────────────────────────────────────────────────────────────────────────
Type of
Observation Rows Count Percent (%) Minimum Maximum
────────────────────────────────────────────────────────────────────────────────────────────────────
Failed 12 12 40% 12.5 152.7
Right Censored 1 18 60% 152.7 152.7
Left Censored 0 0
Interval Censored 0 0
Total 13 30 100% 12.5 152.7
────────────────────────────────────────────────────────────────────────
This report displays a summary of the data that were analyzed. Scan this report to determine if there were
any obvious data errors by double-checking the counts and the minimum and maximum.
This report displays the values of the log-likelihood for each distribution along with the estimated values of
its parameters. Since our desire is to maximize the likelihood, under normal circumstances, we would pick
the distribution at the top of the report since it has the largest likelihood value. In this example, we would
select the Weibull distribution.
550-35
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Probability Plots
Probability Plot(s)
─────────────────────────────────────────────────────────────────────────
550-36
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
By studying these probability plots, we can determine which distributions fit the data the best. In this
example, since there are only a few observations, it is difficult to select one distribution over another. We
can see that our candidate from the last section, the Weibull distribution, certainly cannot be removed on
the basis of its probability plot. Without further information, our decision would be to select the Weibull
distribution to fit these data.
550-37
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Readout105 Dataset
Setup
To run this example, complete the following steps:
550-38
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Variables Tab
___________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Options Tab
___________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Plots Tab
___________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Data Summary
Data Summary
────────────────────────────────────────────────────────────────────────
Type of
Observation Rows Count Percent (%) Minimum Maximum
────────────────────────────────────────────────────────────────────────────────────────────────────
Failed 0 0
Right Censored 1 16 40% 1500 1500
Left Censored 1 2 5% 24 24
Interval Censored 8 22 55% 24 1500
Total 10 40 100% 24 1500
────────────────────────────────────────────────────────────────────────
This report displays a summary of the data that were analyzed. We note that the number of rows and the
total count appear to be correct.
550-39
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
It appears that the Weibull distribution is a reasonable choice for the parametric distribution, although the
shape parameter is less than one. This may point to the need for a nonzero threshold value.
To finish this example, you would view the probability plots. Finally, you would try fitting the Weibull
distribution to these data. We will leave that to you to do. Simply change the Distribution box to Weibull and
rerun the procedure.
550-40
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Setup
To run this example, complete the following steps:
Variables Tab
___________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Reports Tab
___________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
We will show only those portions of the printout that are necessary to answer the two questions that were
posed at the beginning of this section.
550-41
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Parameter Estimation
Estimation Details
──────────────────────────────────────────────────
Differential Evolution Iterations 32
Newton-Raphson Restart 1
Newton-Raphson Iterations 7
User-Entered Random Seed 2928475
──────────────────────────────────────────────────
────────────────────────────────────────────────────────────────────────
* Probability plot estimates were generated with F(t) calculated using the approximate median and using the model Time = A +
B(F).
This report shows the estimated parameters. We are particularly interested to see that the shape parameter
is almost exactly one. The confidence limits for the estimated shape parameter include one between them.
Remember that when the shape parameter is one, the Weibull distribution reduces to the exponential
distribution, a distribution which ‘has no memory.’ From this, we get an indication that the failure pattern of
the fans does not change over time. That is, the failure rate does not change as the fans get older.
550-42
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This report presents the estimated failure proportions at various time periods. We note that at 8000 hours,
the maximum likelihood estimate for the proportion failing is 0.2471. The 95% confidence limits are 0.1459
to 0.3999. That is, almost 25% of the fans can be expected to fail by 8000 hours—a very high failure rate.
Management will have to change the fans to decrease the proportion failing!
550-43
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This plot shows both the parametric and nonparametric estimates of the hazard rates. First, we analyze the
nonparametric estimate. Notice that the line wanders up and then down, but it does not extend outside the
confidence limits of the parametric hazard rate. The sharp rise at the end of the plot is due to a lack of data
in this region and should be ignored. We see that the parametric estimate of the hazard rate, the middle
horizontal line, is a reasonable approximation for the nonparametric line. The above considerations again
lead us to conclude that the failure rates do not change with age.
This ends this example. Notice how quickly we have been able to answer the two questions posed by
management. The only task that we did not complete was to make sure that the Weibull distribution was
appropriate for these data. A quick look at the probability plot will show you that it is.
550-44
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Setup
To run this example, complete the following steps:
Variables Tab
_________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Options Tab
_________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Reports Tab
_________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
550-45
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Plots Tab
_________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
Survival Tab
Confidence Limits (Distribution Fit Line) .......... Checked
550-46
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Output
Survival Plot
────────────────────────────────────────────────────────────────────────
The modified survival plot is displayed with numbers at risk, cumulative censoring, and cumulative events at
each reference time point. You can further modify the plot as required to suit your needs.
550-47
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This plot is displayed with default at-risk table settings to give you an idea of what you get if you make no
modifications to the table.
550-48
© NCSS, LLC. All Rights Reserved.