Report Reliability in Fatigue
Report Reliability in Fatigue
Report Reliability in Fatigue
Tomas Torstensson
9th January 2004
Reliability in fatigue
On the choice of distributions in the load-strength model
Abstract
In this thesis the influence of the choice of distributions in the loadstrength model is considered. Accurate predictions of the failure probability
is very useful when aiming at the most cost effective design of a component. Two distributions for load and strength are evaluated, the lognormal
distribution and the Weibull distribution. From the load-strength model
the failure probability can be determined which is the probability that the
component in question fails within a specific time. The main conclusion is
that the lognormal distribution should be used rather than the Weibull distribution, especially when the data available is limited. A possible way of
updating the model with observed failure rates using Bayesian methods is
also suggested.
Tillf
orlitlighet inom utmattning
Val av f
ordelningar i last-styrka-modellen
Sammanfattning
I detta exjobb undersoks vilken p
averkan olika fordelningsval har p
a
last-styrka-modellen. Noggranna forutsagelser av felsannolikheten ar mycket anvandbara for kostnadseffektiv dimensionering av komponenter. Tv
a
fordelningar for lasten och styrkan studeras, lognormalfordelningen och
Weibullfordelningen. I last-styrka-modellen kan felsannolikheten beraknas,
d.v.s. sannolikheten att den aktuella komponenten g
ar sonder inom en viss
tid. Huvudslutsatsen ar att lognormalfordelningen bor anvandas snarare an
Weibullfordelningen, i synnerhet vid begransad tillg
ang p
a data. Ett mojligt
satt att uppdatera modellen med felutfall med hjalp av Bayesianska metoder
foresl
as ocks
a.
Acknowledgments
I would like to thank my supervisors Par Johannesson, Jacques de Mare
and Thomas Svensson for their extensive counseling and encouragement. The
frequent meetings have made it possible to get feedback and advice in order
to make progress with the project. I wish to thank my examiner Gunnar
Englund. I am also grateful to Bengt Johannesson at Volvo Trucks and
Bertil Jonsson at Volvo Articulated Haulers. I had access to their reports
and measurement data and my Masters thesis could not have been carried
out without that material. I wish to thank Fraunhofer-Chalmers Research
Centre for giving me this opportunity to do my Masters thesis and it has
been a pleasure to work here. A special thanks to Marlo who reviewed the
manuscript. Finally I would like to thank Ebba for her love and support.
Contents
1 Introduction
2 Background
2.1 Normal distribution . . . . . . . . . . . . . .
2.2 Lognormal distribution . . . . . . . . . . . .
2.3 Three parameter Weibull distribution . . . .
2.4 A target customer . . . . . . . . . . . . . . .
2.4.1 Duty based on normal distribution .
2.4.2 Duty based on lognormal distribution
2.4.3 Duty based on Weibull distribution .
2.5 Failure probability . . . . . . . . . . . . . .
2.5.1 Entire population . . . . . . . . . . .
2.5.2 The target customer . . . . . . . . .
2.6 Distributions for duty and capacity . . . . .
2.7 Models for duty and capacity . . . . . . . .
2.7.1 Estimation of capacity . . . . . . . .
2.7.2 Estimation of duty . . . . . . . . . .
2.8 Applications . . . . . . . . . . . . . . . . . .
2.9 Duty intensity . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
8
8
9
9
9
10
10
10
11
11
11
11
12
13
13
13
13
3 Applications in industry
3.1 Volvo Trucks . . . . . . . . . . .
3.1.1 Load-strength model . . .
3.1.2 Load-strength simulations
3.2 Volvo Articulated Haulers . . . .
3.2.1 Estimation of capacity . .
3.2.2 Estimation of duty . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
15
15
15
16
16
16
18
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
4.2.1
4.2.2
4.2.3
4.2.4
4.2.5
.
.
.
.
21
22
23
24
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
25
25
27
27
28
28
29
30
31
31
32
32
32
33
33
33
.
.
.
.
.
.
36
36
37
38
38
40
41
42
42
43
46
4.3
4.4
4.5
4.6
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
49
50
52
56
58
List of Figures
4.1
4.2
4.3
4.4
4.5
4.6
4.7
4.8
. . . . . .
. . . . . .
. . . . . .
. . . . . .
= 6 106 .
= 6 106 .
. . . . . .
. . . . . .
.
.
.
.
.
.
.
.
25
26
26
27
29
30
34
34
Failure
Failure
Failure
Failure
Failure
Failure
probability
probability
probability
probability
probability
probability
Pf
Pf
Pf
Pf
Pf
Pf
for
for
for
for
for
for
different
different
different
different
different
different
59
59
60
60
61
61
List of Tables
4.1 The entities mE and sE when = 0.65. . . . . . . . . . . . . . 31
4.2 The entities mE and sE when = 1.30. . . . . . . . . . . . . . 31
5.1 Weibull parameter values for different distribution fits. . . . . 40
5.2 Failure probability for different capacity distributions. . . . . . 40
B.1
B.2
B.3
B.4
B.5
B.6
B.7
B.8
B.9
B.10
B.11
B.12
Estimation
Estimation
Estimation
Estimation
Estimation
Estimation
Estimation
Estimation
Estimation
Estimation
Estimation
Estimation
of
of
of
of
of
of
of
of
of
of
of
of
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
52
53
53
53
53
54
54
54
54
55
55
55
Chapter 1
Introduction
The load-strength model is a tool for reliability analysis in fatigue.The damage that an external cyclic load causes to a material is called the fatigue of
the material. The load itself can be characterized by its local maxima and
minima. A good way to examine the load is the rainflow count method,
see Johannesson [3], which gives the load amplitudes. Some definitions are
needed in order to describe the model
C = Capacity (strength) ,
D = Duty (load) .
The capacity can, e.g., represent the strength of a vehicle component and
the duty is then the load that the component is exposed to. The basis of the
model is that failure occurs (the component breaks) if the duty exceeds the
capacity, i.e. D > C. In lecture notes from a course for Swedish industry, see
Johannesson and de Mare [2], the load-strength model and several applications are described. The duty naturally depends on the time or distance the
component has been used. The idea is that both capacity, C, and duty, D,
are modelled as random variables. The scatter in the strength of the components is modelled by C and the scatter in the load is modelled by D. The
scatter in the strength is easy to understand (material and manufacturing
properties), but the scatter in the load is more complex. It consists of several
parts. Duty varies based on how the vehicle is driven, the road conditions,
and so on. Therefore it is much harder to find and motivate a suitable model
for the duty than the capacity. The failure probability is the probability that
failure occurs
Pf = P (D > C) .
The model can be used in several ways. One way of using the model is in
the case where there is a restriction on the failure probability for a critical
6
component. The objective could also be to find the minimum life cycle cost.
The total cost is a sum of the manufacturing and operation cost. When the
failure probability decreases (stronger component) the manufacturing cost
increases and for the operation cost the relationship is reversed. This means
that an optimal design failure probability can be found which minimizes the
life cycle cost for the component. Then the component can be adjusted by
changing the manufacturing procedure in order to satisfy that condition.
The load-strength model have been used by PSA Peugeot Citroen, see
Thomas et al. [7], and Volvo Construction Equipment, see Olsson [5] and
Samuelsson [6]. They use different assumptions for the distributions of the
capacity and load. Volvo Construction Equipment uses a model in which
capacity and duty is based on the three parameter Weibull distribution. PSA
Peugeot Citroen models both capacity and duty with the normal distribution.
In Chapter 2 the background of the load-strength model is described
and the model and definitions are introduced. The use of the load-strength
model in industry is described in Chapter 3. In Chapter 4 the estimation
of parameters in the lognormal and Weibull distribution are examined. A
couple of methods for estimation of the parameters in a three parameter
Weibull distribution are examined. The basis of the estimation is data from
Volvo Trucks. Also the quantitative differences that depends on the choice of
distribution are studied. The model of Volvo Articulated Haulers is analyzed
in Chapter 5. In Chapter 6 feedback using Bayesian methods is examined.
Finally in Chapter 7 conclusions are drawn and proposals for how to use the
load-strength model in the future are suggested.
Chapter 2
Background
The capacity, C, and the duty, D, are assumed to be continuous random
variables. A continuous random variable X is defined by its density function,
fX (x), and its distribution function, FX (x).
P (a X b) =
Zb
a
d
FX (x)
dx
In this thesis principally two distributions will be considered, the lognormal
distribution and the three parameter Weibull distribution (used by Volvo).
The reason why these distributions are used is discussed in Section 2.6. In
some examples in this chapter the normal distribution (used by PSA Peugeot
Citroen) will also be considered.
fX (x) =
2.1
Normal distribution
< x < .
2.2
Lognormal distribution
2.3
1
exp
!
, > 0, x > .
For this distribution there is a explicit expression for the distribution function
!
Zx
x
.
FX (x) = fX (y) dy = 1 exp
2.4
A target customer
2.4.1
2.4.2
If the duty is lognormal the 100p% customer is found in the following way
log D
log zp
P (D zp ) = P (log D log zp ) = P
log zp
=p
=
which gives
log zp
= p
zp = e+p .
Assuming that the mean and variance is the same as in the example with
the normal distribution yields z0.90 = 379 MPa.
2.4.3
Assume that the duty is inversely proportional to a Weibull distributed random variable, i.e. D = 1/Y where Y W (, , 0).
!
y
FY (y) = 1 exp
, y 0, , > 0 .
2.5
Failure probability
There are different kinds of failure probabilities. One of them is the probability that a failure occurs considering the entire population. Another is the
probability that failure occurs for the 100p% customer.
2.5.1
Entire population
The probability that a failure occurs for the entire population is denoted by
Pf and is calculated as
Pf = P (D > C).
2.5.2
2.6
Until now some different distributions have been assumed for duty and
capacity. The choice in a specific case depends on the application. There
are three different models that are natural in the fatigue context.
1.
2.
3.
D=normal
C=normal
1/D=Weibull C=Weibull
D=lognormal C=lognormal
1. PSA Peugeot Citroen uses a model where both duty and capacity are
normally distributed. The capacity is interpreted as the fatigue limit
which explains the assumption of normal distribution.
11
2.7
According to the method that Volvo Construction Equipment uses the damage of a component that is accumulated over time must be transformed to a
scalar value and one way to do that is to use the Palmgren-Miner hypothesis
for accumulated fatigue damage
d=
M
X
i=1
1
N (Si )
where d is the damage. The function that describes the number of cycles to
failure for the component in interest is denoted by N (). The different load
amplitudes are Si , i = 1, . . . , M . Furthermore damage 1 corresponds to
failure.
Basquins equation is frequently used to describe N ()
N = C (S)k
where k is the Wohler exponent. The capacity, C, is assumed to be a random
variable. Now the damage can be written as
d=
M
X
(Si )k
i=1
D=
M
X
D
C
(Si )k .
i=1
Note that failure occurs when d > 1 which is consistent with the previous
definition of failure D > C.
12
2.7.1
Estimation of capacity
MC
X
i=1
M
C
X
1
C Sik
Sik .
i=1
2.7.2
Estimation of duty
The duty, D, is estimated from a load process and an observation is determined by the formula
MD
X
D=
Sjk
j=1
where MD is the total number of load cycles. The mean and the standard
deviation of D can be estimated from load measurements on different customers.
2.8
Applications
In a typical application only the relation between C and D is of interest. Furthermore, a value for the Wohler exponent is chosen, e.g. Volvo Construction
Equipment (VCE) often uses k = 3 since the components are welded. In that
case C and D are determined from the expressions
CV CE =
MC
X
Si3
Sj3 .
j=1
i=1
2.9
DV CE =
MD
X
Duty intensity
14
Chapter 3
Applications in industry
3.1
Volvo Trucks
Volvo Trucks has made extensive trials in order to predict the life distribution
by means of the load strength model.
3.1.1
Load-strength model
where the ni isPthe number of cycles until a failure occurs at the load
level Si and
ni is the total number of cycles until failure occurs.
i
3.1.2
Load-strength simulations
3.2
3.2.1
Estimation of capacity
ZNc
da
=
B (S)k f (a)k
dN
a0
1
B
Zac
da
= Nc (S)k
f (a)k
a0
where a0 is the initial crack length and ac (20 mm here) is the crack length
defined as failure. The number of cycles until failure is denoted by NC .
Extracting Nc gives
Zac
da
1
(S)k
Nc =
B
f (a)k
A0
Zac
da
f (a)k
A0
3.2.2
Estimation of duty
The duty is determined by driving in different ways (normal and forced driving) for one hour and after that making an assumption of how common the
different types of driving styles are. Once that is made an estimation of the
can be determined. In these calculadistribution of the duty intensity, D,
tions the Wohler exponent k = 3 which is a value that is often chosen in
this context. The duty is assumed to be based on a three parameter Weibull
distribution which is fitted to the observations.
18
Chapter 4
Inference based on data from
Volvo Trucks
In this section estimation of parameters in the lognormal and the Weibull
distribution will be examined. The basis of the examination will be the data
from Volvo Trucks. The different estimation methods have been implemented
in Matlab. All numerical calculations in this thesis have been carried out
through the use of Matlab.
4.1
x > 0.
1X
=
log xi ,
n i=1
n
1X
(log xi 1 )2 .
2 =
n i=1
19
2 =
1 X
(log xi 1 )2
n 1 i=1
4.2
where is the shape parameter, is the scale parameter, and is the location
parameter. Note that is a threshold. Given a sample x1 , x2 , . . . , xn from
X1 , X2 , . . . , Xn the task is to find an estimator of . The likelihood function
is
n Y
1 Y
!
n
n
n
Y
xi
xi
L() =
fX (xi ) =
exp
i=1
i=1
i=1
Normally L or log L is maximized in order to find the MLE of , but the problem with the three parameter Weibull distribution is that it has a threshold
which means that L() can be singular due to the factor
(xk )1
where xk = min xi
1in
Since > xi i it is only the factor with the minimum xi that has to be
examined. The expression can be examined by letting tend to xk
<1:
=1:
lim (xk )1 =
x
k
lim (xk )1 = 1
xk
>1:
lim (xk )1 = 0
xk
4.2.1
!
xp
= p
x
= log(1 p)
xp = + ( log(1 p))1/ .
s = i, j, k
log(1
p
)
t
k
k
= log
log
.
log(1 pi )
ti
21
where the estimate also is found with help from the sample percentiles
=
ti tk t2j
.
ti + tk 2tj
If this estimated value exceeds the smallest observed value, y1 , then it is not
permissible and = y1 should be used instead, but the probability that such
a situation occurs is very small.
The values for pi and pk can be chosen such that the asymptotic variance of
the estimator is minimized, see Dubey [1], which yields
pi = 0.16731,
pk = 0.97366.
A efficient estimator of can be found by using the 1st, 2nd and nth ordered
observation of the sample. When the estimation of is determined it is an
easy task to find the estimation of . Finally the estimators are
y1 yn y22
,
y1 + yn 2y2
=
+ yd0.63ne ,
t
log(1
p
)
k
k
log
.
= log
log(1 pi )
ti
=
The advantage of these estimators are both their simplicity and accuracy,
especially when n is small which suits the load-strength application since it
and .
often involves small samples. These estimates will be denoted by ,
4.2.2
Let X W (, , 0) and Y W (, , ). Assume that is known or estimated. Then it is possible to transform the three parameter Weibull distributed random variable into a two parameter such one. The advantage of
this approach is that there will be no singularity problem since the threshold
has already been estimated.
22
4.2.3
A natural estimation of the location parameter is = x(1) . The disadvantage with this estimation is that it will always exceed the true value. It is
therefore of interest to examine the expectation of X(1)
Z
E(X(1) ) =
x fX(1) (x)dx .
!
!
x
x
= 1 exp n
= 1 exp
where
1
=
=
.
n1/
The calculations above show that X(1) W (, , ). The expectation of
X(1) can be determined since the expectation of a two parameter Weibull
distributed variable is known. Assume that Y W (, , 0). Then it follows
that
+1
E(Y ) =
23
E(X(1) ) = +
= + 1/
.
+1
.
n1/
Therefore
+1
= x(1) 1/
n
4.2.4
= x(1)
n1/
+1
x 10
Mix
Zan
Iter
VTC
ML2
Elim
1.5
0.5
4
5
x 10
4.2.5
4.2.6
1
0.8
0.6
Mix
Zan
Iter
VTC
ML2
Elim
Empir
0.4
0.2
0
6
5
x 10
x 10
7
Mix
Zan
Iter
VTC
ML2
Elim
6
5
4
3
2
1
0
10
15
5
x 10
26
1
0.8
0.6
Mix
Zan
Iter
VTC
ML2
Elim
Empir
0.4
0.2
0
0.5
1.5
2
4
x 10
4.3
4.3.1
Simulations
where n is the number of observations. Both the parameters and the number
of observations have values that are similar to the ones that were determined
from the real observations. This is very important since it means that the
conclusions from the simulations are in some way also true for real cases. For
each setting of parameters 10 000 iterations have been carried out to get an
accurate estimate of the m.s.e. The results are found in tables in Appendix
B. The column V ( ) (%) means the proportion in percent of the m.s.e,
V ( )
i.e. 100 m.s.e.(
) . The remaining part of the m.s.e. is due to the bias of the
estimates and it is shown in the column b()2 (%).
4.3.2
Conclusions
The direct method (Zan) and the method that first uses the direct method
in order to estimate and then computes MLE of the other two parameters
(Mix) are the best methods (smallest m.s.e.). For almost every setting of
parameters these methods are first and second best. It is not possible to
conclude which of the two methods is best when looking only at the simulations (see Appendix B). The direct method is simple to use which makes it
more suitable for use in industrial applications.
4.4
Since the duty depends on the driven distance a design distance must be
chosen. A reasonable design distance is 1 000 000 km which means that s =
The distribution of D can be determined since
1 000 000. The duty D = s D.
= 1/Y where Y W (Y , Y , Y ).
the distribution of 1/D is known. Let D
D = sD
d
1
fD
fD (d) =
s
s
1
1
1
d = P Y
= 1 FY
Y
d
d
1
1
=
fY
2
d
d
= P (D
=P
d)
FD (d)
= fY 1 1
fD (d)
d
d2
which implies that
s
1
Y !
Y s ds Y Y
Y
exp d
,
fD (d) =
Y d2
Y
Y
28
0<d<
s
.
Y
11
x 10
distance = 5e+05
distance = 1e+06
distance = 1.5e+06
0.5
1.5
2.5
11
x 10
4.5
Failure probability
s
Y
c=C
s
ZY
c=C
fC (c)
s
c
y=Y
fC (c) FY
fY (y) dy dc =
s
c
dc .
29
s
Y
c=C
11
x 10
distance = 5e+05
distance = 1e+06
distance = 1.5e+06
0.5
1.5
2.5
11
x 10
ZZ
Z
Z
Pf = P (D > C) =
fD ( )
fC (c) dc d
fC,D dc d =
c<
=C
fD ( ) [FC (c)]C d =
=C
c=C
fD ( )FC ( ) d .
=C
In general none of the two integrals for the failure probability can be solved
analytically and numerical methods, e.g. Simpsons rule which is used in this
thesis, must be applied.
4.5.1
Simulations
According to the results of the simulations, see 4.3.2, the methods Zan and
Mix are chosen for further examination. Suppose that the true failure probability is p and one of the methods gives the approximation p . A good
measure of how close the estimate is to the real value is
Eexp
p
= lg
p
30
where lg is the logarithm to the base 10. One problem is how to deal with
the cases when p = 0 since then Eexp = . If those cases are dealt with
separately it is possible to determine the accuracy of the two methods when
p 6= 0. The probability that the estimated Pf is less or equal to zero is
denoted by p0 . The mean (mE ) and standard deviation (sE ) of the error
Eexp can be determined by simulating a number of times, in this case 10 000.
When = 0.65, Pf = 1.4490 104 and when = 1.30, Pf = 1.0634 105 .
It is also of interest to examine how the proportion p0 changes when the
Zan
Mix
mE
0.105
-0.0457
n=5
sE
1.032
1.063
p0
0.4373
0.4386
mE
-0.5007
-0.3364
n=10
sE
0.9605
0.9155
p0
0.2626
0.2627
Zan
Mix
mE
0.9168
0.5553
n=5
sE
1.6
1.657
p0
0.7507
0.7619
n=10
mE
sE
-0.7226 1.508
-0.4021 1.463
p0
0.7808
0.7772
4.5.2
Conclusions
It is hard to draw any conclusions from the simulations that are summarized
in Tables 4.1 and 4.2 since the probability that Pf = 0 is so high. This
measure of the error, Eexp would probably be better in a situation where
the failure probability always is positive, e.g. when either the capacity or the
duty is lognormal.
4.5.3
Sensitivity analysis
big that influence is. One way to do that is to vary one of the parameters
and keeping the other two fixed and then calculate the failure probability for
each combination. The figures are found in Appendix D. The m.s.e. for the
direct method is included in the captions.
4.5.4
Conclusions
Since the estimations of the parameters are dependent the figures in Appendix D do not show the true relationship, but nevertheless they give some
qualitative information. Anyhow it is clear that the failure probability depends mostly on the value of . Therefore the method of determining , if
there should be a threshold at all, will have a big influence on the final result.
Probably it would be better to use a more robust model.
4.6
There are two properties that have to be examined in order to decide how
to model the duty. The first of them, which will be discussed in this section
is the qualitative property of the model. The other of the two properties are
composed of the computational properties, such as stability and accuracy.
4.6.1
which gives
s
=
Y + Y ( log p)1/Y
s
Y
= 15.6 1010
s
Y
= 7.81 1010
4.6.2
In this case there is no upper limit. The quantiles in the lognormal distribution can be found numerically for the two cases.
1. City: z0.50 = 6.98 1010 , z0.95 = 25.0 1010
2. Highway: z0.50 = 2.83 1010 , z0.95 = 8.77 1010
4.6.3
Here there is no upper limit. The quantiles are given by the same expression
as for the three parameter Weibull distribution, but with Y = 0.
1. City: z0.50 = 6.22 1010 , z0.95 = 34.7 1010
2. Highway: z0.50 = 2.60 1010 , z0.95 = 15.3 1010
4.6.4
A model that describes duty in a good way should give reasonable results for
different driven distances (s). One way to visualize this is to determine Pf
for different s and the resulting graphs are found in Figures 4.7 and 4.8. The
curves that are denoted by drivers corresponds to the failure probability for
one certain driver, i.e. one duty intensity value (di ). From this the empirical
distribution function for the duty is generated and it is determined by the
relations
P (D < s di ) = 0
P (D s di ) = 1
33
10
VTC
logn
ML2
drivers
10
10
Pf
10
10
10
10
6
6
x 10
driven distance
10
VTC
logn
ML2
drivers
10
10
P
f
3
10
10
10
10
4
driven distance
8
6
x 10
34
In Figures 4.7 and 4.8 it is clear that the upper limit results in strange
properties for the failure probability. It is zero until a certain distance and
then it suddenly increases very fast. This is not reasonable because the failure
probability should increase in a smoother way as it does for the other two
distributions. It is also interesting to note how much this upper limit differs
in the two cases.
There is also a big difference between the lognormal fit and the Weibull
fit without upper limit (corresponds to two parameter Weibull distribution).
This is due to the fact that the density function for the lognormal distribution decreases faster for larger values compared to the Weibull distribution,
especially for smaller s. That is why the failure probability differs with orders of magnitude for small s. Consequently the load-strength model is very
sensitive with respect to the choice of distribution. Therefore the model must
be compared with real outcomes before it can be used.
35
Chapter 5
Analysis of Volvo Articulated
Haulers model
The model by Volvo Articulated Haulers is described more precisely in Section 3.2. One interesting aspect of this approach is the model for calculating
capacity. It is stated that
Zac
1
C=
B
da
f (a)k
A0
where the random variables and parameters are described in Section 3.2. The
function f () has been determined by tests
f (a) = 0.1388 a2 + 0.35 a + 5.4 .
Since the assumption is that A0 , the initial crack length, and B, a material parameter, are normally distributed tests have been done in order to
determine the mean and variance. The results of these tests are that
A0 N (mA0 , A2 0 ) = N (10, 1.78) ,
B N (mB , B2 ) = N (1.832 1013 , 2.098 1027 ) .
5.1
Model properties
In the model by Volvo Articulated Haulers it is assumed that the initial crack,
A0 , should be greater than zero and less than the crack length by failure, i.e.
0 A0 20 .
36
5.2
which gives
P (xl X xu ) =
=
Zxu
xl
xu m
xl m
fX (x)dx =
=1
xu m
xl m
.
For simulating from this distribution, values are generated from the original
normal distribution and then values outside the interval [xl , xu ] are not used.
5.3
Simulations
5.4
Conclusions
In the Figures 5.1 and 5.2 it is clear that the lognormal distribution gives
a better fit than the Weibull distribution especially for low capacity values
which is the most important part of the distribution. One reason for this difference is that the smallest observed value has a big influence on the Weibull
distribution fit which means that the density function will be comparatively
large for small capacity values. When the lognormal fit is carried out the
smallest capacity value will not have that big of an influence. In this model
the theoretical threshold for the capacity is zero (corresponds to B
or A0 ac ) which means that a distribution with a threshold can not be
justified. It is also of interest to compare the Weibull distribution fit carried
out by Volvo Articulated Haulers, from thirty capacity values, with the one
in this thesis which is based on 10 000 capacity values. The result is found in
Table 5.1. Since the threshold depends very much on the smallest capacity
38
1
Zan
logn
Empir
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0.2
0.4
0.6
0.8
1.2
1.4
1.6
1.8
2
9
x 10
10
x 10
Zan
logn
Empir
6
9
x 10
39
5.5
Failure probability
In the Volvo Articulated Haulers report a distribution for the duty (corre [duty/h], is found
sponds to 1 hours of driving), actually the duty intensity D
1
= where Y W (Y , Y , Y ). The parameter values are Y = 1.0,
and D
Y
Y = 1.065 105 and Y = 3.5 107 . Just three observations were used in
order to determine the parameters which means that the uncertainty is very
high. A reasonable choice of design life for the machine is 10 000 hours which
corresponds to t = 5 000 hours in use (half of the time the engine is idling)
which means that the duty D = t D = Yt . In this case the failure probability
has been determined for different capacity distributions: Weibull, lognormal
and empirical distribution (using 10 000 capacity values). The result is found
in Table 5.2. The failure probabilities are all close to 30% which is the value
that was calculated by Bertil Jonsson in the Volvo Articulated Haulers report. The reason that the values are similar is that the failure probability
is comparatively high which means that the overlap of the distributions of
capacity and duty is large which makes the calculation stable with respect
to the choice of distribution. Since the failure probability of course depends
on the distribution of the duty more extensive experiments with more duty
Parameter
Zan (n=10000)
1.49
1.34 109
1.95 108
Failure probability
0.3353
0.3319
0.3103
x 10
Zan
logn
duty
1.2
0.8
0.6
0.4
0.2
0.5
1.5
2.5
3.5
4.5
5
9
x 10
5.6
One way to visualize the result is to plot the density functions for capacity
and duty. The curves are found in Figure 5.3. Since the density functions
for capacity and duty overlap to a rather large extent it is natural that the
failure probability will be as big as 30%.
41
Chapter 6
Feedback using Bayesian
estimates
In the Volvo Articulated Haulers model the failure rate (proportion of failures in the field) was known, but it has not been used in the model. A way to
improve the estimates by using the failure rate is to use Bayesian methods.
The Bayesian method will here be used for a simple update of the model but
it could be generalized to a more sophisticated update.
6.1
Let it be assumed that both capacity and duty are lognormal. Since the
data from the Volvo Articulated Haulers report is being used the distribution of the duty intensity, which in that model is based on a three parameter Weibull distribution, will be transformed to a lognormal distribu LN ( , 2 ) where
tion with the same mean and variance. This gives D
D
D
= 1.139. The distribution
= 11.97 and 2 = V (log D)
D = E(log D)
D
of the capacity is determined from the extensive simulations which gives
C LN (C , C2 ) where C = 20.97 and C2 = 0.3386. The design life of the
Then
machine, t, is assumed to be 5 000 h which gives the duty D = t D.
2
it will hold that D LN (D + log t, D ). The failure probability, Pf , can be
determined
+ log t C
Pf = P (D > C) = P (log D log C > 0) = Dq
2
2
D
+ C
and with the numerical values
11.97 + log 5 000 20.97
= (0.3972) = 0.346.
Pf =
1.139 + 0.3386
42
This value can be compared to the failure probability when the duty was
based on a three parameter Weibull distribution which gave Pf = 0.335 and
this means that the transformation to a lognormal duty seems reasonable.
6.2
Bayesian estimates
For each machine included in the set of observed machines a random variable
is associated
i = 1{Di >Ci } ,
i = 1, 2, . . . , n.
n
X
i .
i=1
p = Pf .
The prior mean value of is just the estimation of D that was determined
from the duty observations. The prior variance of has here been set to
1, which seems reasonable. It is not evident what variance to use. One
idea could be to take into account the spread in the estimate of D , but
this is hard to carry out since the lognormal distribution of the duty was
transformed from a Weibull distribution and not fit from data. The issue of
choosing the variance is actually a question about how much influence the
prior distribution will have on the posterior distribution, i.e. the final model.
A small prior variance means that the duty observations will have a great
influence on the model, and a large variance means that the failure rate will
have a greater influence. Therefore this issue must be carefully examined in
developing this type of Bayesian methods for the load-strength model.
Due to the Bayesian method a reasonable estimation of D would be
E(|s197 = 18). In general
Z
E(|Sn = sn ) =
f|Sn ( |sn ) d .
The density function f|Sn ( |sn ) can be found via Bayes theorem, see Lindgren [4]
f|Sn ( |sn ) = R
fSn | (sn | ) f ( )
.
fSn | (sn | ) f ( )d
n
psn (1 p)nsn ,
fSn | (sn | ) = P (Sn = sn | = ) =
sn
sn = 0, 1, . . . , n
where
+ log 5 000 20.97
.
p =
1.139 + 0.3386
The normalization integral can then be solved numerically
18
899
Z
Z
917
12.45
12.45
fSn | (sn | )f ( )d =
1
1.216
1.216
18
1
( 11.97)2
d = 0.001427 = C 1 .
exp
2
1
2 1
Now the posterior distribution of can be determined
18
899
12.45
12.45
917
f|Sn ( |sn ) = C
1
18
1.216
1.216
2
1
( 11.97)
.
exp
21
2 1
44
x 10
C
D, before update
D, after update
6
0.5
1.5
2.5
9
x 10
Figure 6.1: Density functions for C and D before and after update.
Since the posterior distribution is known both the mean and variance can be
determined numerically
E(|S917 = 18) = 9.963
V (|S917 = 18) = E(2 |S917 = 18) (E(|S917 = 18))2 = 99.267 9.96262
= 0.0132.
Let
be the estimation of D . The prior estimate
= E() = 11.97 and
the posterior estimate
= E(|S917 = 18) = 9.96 with variance 0.0132.
This means that the parameter D in the load-strength model has decreased
from 11.97 to 9.963 due to the failure rate. It is of interest to compare the
distribution before and after the update. In Figure 6.1 the density functions
are plotted and it can be observed that the distribution of the duty has moved
to the left which means that the duty values in general are much lower after
the update. The model after the update gives the new failure probability
9.963 12.45
Pf =
= (2.0455) = 0.0204.
1.216
which means that the failure probability has decreased from 35% to 2%.
The reason that the new distribution fits well to the observation of the actual
failure probability is a combination of the fact that the number of machines
that is observed (n = 917) is large which gives a high accuracy in the proportion 2% and that the variance where set to 1. This means that the loadstrength model adapts almost completely to the failure rate. Therefore it
45
would be interesting to see how much the result would differ if it instead was
one out of 50 machines that were broken. This is still 2% but the accuracy
is much lower.
Assume that the prior distribution of is the same as before, i.e.
N (11.97, 1) and that the observation s50 = 1 is given. Then the normalization
integral is
Z
1
49
Z
50
12.45
12.45
fSn | (sn | )f ( )d =
1
1.216
1.216
1
1
( 11.97)2
exp
= 0.02690.
21
2 1
In this case the posterior mean and variance of will be
E(|S50 = 1) = 10.228
V (|S50 = 1) = E(2 |S50 = 1) (E(|S50 = 1))2 = 104.76 10.2282
= 0.156.
This means that the estimate
has decreased from 11.97 to 10.23. This
updated parameter value corresponds to the failure probability
10.23 12.45
Pf =
= (1.826) = 0.0339.
1.216
This value is somewhat larger than 2% which was the result in the other
case. Therefore the observation s50 = 1 had less influence on the model than
the observation s917 = 18, just as predicted.
6.3
Conclusions
46
Chapter 7
Conclusions and discussion
The main focus of this report is examining the properties of the load-strength
model with respect to the choice of distributions for capacity and duty. Two
distributions for duty and capacity have been examined, the Weibull and
the lognormal distribution. The estimation methods were evaluated based
on the data measurements from Volvo Trucks. Since it was not obvious
how to estimate the parameters if an entity is modelled as three parameter
Weibull distributed, different methods were considered and the effectiveness
was determined by extensive simulations. A very simple direct method was
one of the two most effective ones and therefore it is recommendable to use
that one. Since the estimation method is general this method could also be
useful in other applications where the three parameter Weibull distribution
is used.
It is also important to study the properties of the distributions in the
context of the load-strength model. The assumption that one over the duty
is three parameter Weibull distributed leads to some strange
intensity 1/D
properties for the distribution of the duty, D, if the shape parameter < 1.
By plotting the density function when < 1 one can see that a rather
high proportion of the probability mass is close to the upper limit which is
obviously unreasonable. Results suggest that a reasonable condition when
using the three parameter Weibull distribution, for modelling D, is that the
shape parameter > 1. Furthermore, the Weibull assumption implies an
upper limit for the duty and a lower limit for the capacity which means that
a safe distance is established. If one were to drive less than this distance
the failure probability is zero. Then, the failure probability increases rather
rapidly as you drive on past that safe distance. In contrast if both capacity
and duty are assumed to be lognormal there will always be an overlap of
the distributions, also for very short distances, which means that the failure
probability will increase more smoothly as the distance increases.
47
The failure probability depends on the upper tail of the duty distribution
and the lower tail of the capacity distribution. Many observations are needed
in order to determine the tail of a distribution with high accuracy. Therefore,
the number of observations must be increased considerably in order to obtain
reasonable accuracy in the calculation of the failure probability. The fact that
the failure probability depends mostly on the tails means that the choice of
distribution has a great impact on the final result. This is true even if there is
no threshold involved. For example, the upper tail of the duty distribution is
in general significantly thinner if the lognormal distribution is used compared
to the tail if it is based on a two parameter Weibull distribution.
In the Volvo Articulated Haulers report a model for determining the distribution of the capacity is used in which no rig test is needed. It is our
opinion that it would be very interesting to examine this method more rigorously. From this model one can easily obtain a large number of capacity
values and then a distribution fit can be carried out. The lognormal distribution seems to describe the capacity distribution better than the three
parameter Weibull distribution in this case. We think that the distributions
of the initial crack length a0 and the parameter B could be determined more
precisely which in turn would improve the model. Further on we think the
model must be compared to rig tests on the same type of components in order
to find out if it works properly. If the model were to give a good description
of the capacity it would be very useful since it is cheaper than a rig test and
it is easier to apply.
Feedback has been carried out in order to use the failure rate measured
in the field. A natural approach is the Bayesian method. Even though only
a simple example has been examined in this report the result of this example shows how powerful this method is. In this example the model adapts
very closely to the failure rate. This is good since this is a direct observation of what we want to predict. If capacity and duty values are observed
they are only indirect observations of the failure probability. Observations
of the failure rate could be useful in many ways. First, the model could
be updated with the Bayesian model. Secondly, this information could be
used for the improvement of the original load-strength model. Then it can,
e.g., be examined if the load-strength model is unbiased with respect to the
failure probability. Examinations of these kind of properties can be the basis
for how to weigh the capacity and duty observations in comparison to the
failure rates. In addition, if the times or distances when components fail
were observed it would be very useful in determining the distributions for
capacity and duty. Then time or distance properties of the model could also
be further examined and the ability to predict the future failure rate could
be checked.
48
7.1
Future research
When using the model it is our opinion that the accuracy in the estimation of
the distribution parameters should be included which would give a confidence
interval for the calculated failure probability. Doing this would gain insight
into what extent you can trust the result.
In general the model can be compared to the use of safety factors. Safety
factors are tools for handling component design. They are not very accurate, but they will continue to be used as long as there is no other method
that works better. Hopefully it will turn out that the load-strength model,
if used properly, gives more precise and accurate results. Since statistics are
involved in the load-strength model it is very important that accuracy in the
predictions are rigorously examined. We think that it is a great challenge
to further develop this model. The fact that recent progress in computer
technology makes it possible to collect huge amounts of data from field measurements creates a situation where statistical methods can be increasingly
useful.
49
Appendix A
Estimated parameters with
different methods
The Weibull distributions fits are carried out on the duty observations from
City and Highway driving and the resulting parameter values are tabulated
for the different estimation methods.
0.634
0.669
8.81 106
2.47 105
6.57 106
1.40 105
0.51
0.670
2.00 105
2.68 105
6.57 106
1.40 105
1.393
1.160
1.89 105
3.77 105
1.25 106
8.62 106
50
0.53
0.85
1.06 105
3.12 106
6.41 106
1.28 105
1.514
1.469
2.05 105
4.94 105
0.508
0.793
4.50 106
2.46 105
6.30 106
1.23 105
51
Appendix B
Results from Weibull inference
The different estimation methods are compared with respect to the mean
squared error (m.s.e.). Two different parameter sets are used
1. = 0.65, = 105 , = 6 106
2. = 1.30, = 105 , = 6 106
and then the m.s.e. is determined for the estimation of , and , respectively. For each combination 10 000 samples have been used in order to give
a high accuracy in the calculation of the m.s.e.
Zan
Mix
Iter
ML2
Elim
m.s.e.( )
0.1315
0.1773
0.3742
1.343
0.1852
V ( ) (%)
95.9
97.5
64.6
22.1
99.9
b()2 (%)
4.1
2.5
35.4
77.9
0.1
Rank
1
2
4
5
3
52
Zan
Mix
Iter
ML2
Elim
m.s.e.( )
2.092 1010
5.391 1011
2.046 1010
2.431 1010
5.317 1011
V ( ) (%)
86.5
96.9
77.4
35.9
62.3
b()2 (%)
13.5
3.1
22.6
64.1
37.7
Rank
4
2
3
5
1
Zan
Mix
Iter
Elim
m.s.e.( )
4.453 1012
4.412 1012
6.473 1012
4.721 1012
V ( ) (%)
87.6
87.1
85.5
98.4
b()2 (%)
12.4
12.9
14.5
1.6
Rank
2
1
4
3
Zan
Mix
Iter
ML2
Elim
m.s.e.( )
0.0571
0.04224
0.2085
0.9158
0.08776
V ( ) (%)
99.5
95.7
59.1
21.6
99.0
b()2 (%)
0.5
4.3
40.9
78.4
1.0
Rank
2
1
4
5
3
Zan
Mix
Iter
ML2
Elim
m.s.e.( )
6.219 1011
2.551 1011
5.601 1011
1.679 1010
3.422 1011
V ( ) (%)
94.4
95.1
85.3
24.7
74.1
b()2 (%)
5.6
4.9
14.7
75.3
25.9
Rank
4
1
3
5
2
53
Zan
Mix
Iter
Elim
m.s.e.( )
5.461 1013
5.488 1013
2.229 1012
7.973 1013
V ( ) (%)
76.2
76.4
70.2
99.9
b()2 (%)
23.8
23.6
29.8
0.1
Rank
1
2
4
3
Zan
Mix
Iter
ML2
Elim
m.s.e.( )
0.448
0.3782
0.391
1.234
0.4347
V ( ) (%)
59.6
84.7
97.2
13.1
80.1
b()2 (%)
40.4
15.3
2.8
86.9
19.9
Rank
4
1
2
5
3
Zan
Mix
Iter
ML2
Elim
m.s.e.( )
2.508 1011
2.574 1011
2.484 1011
7.611 1011
3.672 1011
V ( ) (%)
99.9
58.2
92.1
15.6
50.0
b()2 (%)
0.1
41.8
7.9
84.4
50.0
Rank
2
3
1
5
4
Zan
Mix
Iter
Elim
m.s.e.( )
9.955 1012
9.93 1012
7.911 1012
9.264 1012
V ( ) (%)
64.1
61.1
99.7
83.1
b()2 (%)
35.9
38.9
0.3
16.9
Rank
4
3
1
2
54
Zan
Mix
Iter
ML2
Elim
m.s.e.( )
0.2424
0.2298
0.3711
1.278
0.2774
V ( ) (%)
83.8
66.2
71.7
10.3
99.0
b()2 (%)
16.2
33.8
28.3
89.7
1.0
Rank
2
1
4
5
3
Zan
Mix
Iter
ML2
Elim
m.s.e.( )
1.133 1011
1.278 1011
1.399 1011
6.335 1011
1.558 1011
V ( ) (%)
94.0
53.0
91.7
9.9
84.5
b()2 (%)
6.0
47.0
8.3
90.1
15.5
Rank
1
2
3
5
4
Zan
Mix
Iter
Elim
m.s.e.( )
3.624 1012
3.631 1012
4.502 1012
3.793 1012
V ( ) (%)
46.3
47.6
91.7
99.4
b()2 (%)
53.7
52.4
8.3
0.6
Rank
1
2
4
3
55
Appendix C
Proportion p0 for different n
The entity p0 is the proportion of cases when the numerical calculation of
the failure probability, Pf , gives the result zero even though the actual value
is not zero. This reflects how sensitive the calculation of Pf is when the distribution of the duty has an upper limit and the distribution of the capacity
has a lower limit (threshold).
56
0.5
0.4
0.3
p0
0.2
0.1
10
20
30
40
50
n
60
70
80
90
100
0.9
0.8
0.7
0.6
p
0.5
0.4
0.3
0.2
10
20
30
40
50
n
60
70
80
90
100
57
Appendix D
Sensitiveness in failure
probability
The failure probability is determined by varying one of the parameters at
the time and keeping the other two fixed. Since the parameter estimates are
not independent this does not give the true variation but nevertheless some
qualitative information. The parameters that are varied corresponds to the
distribution of the duty.
58
10
Pf
10
10
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.1
1.2
10
Pf
10
10
0.9
1.1
1.2
1.3
1.4
1.5
1.6
1.7
59
10
10
Pf
10
10
0.5
1.5
2
5
x 10
10
10
10
10
10
0.5
1.5
2
5
x 10
60
10
10
10
10
10
Pf
10
10
10
10
10
2.5
3.5
4.5
5.5
6.5
6
x 10
10
10
10
10
Pf
10
10
10
10
10
10
10
11
10
2.5
3.5
4.5
5.5
6.5
6
x 10
61
Bibliography
[1] Dubey, S.D. (1967). Some percentile estimators for Weibull parameters,
Technometrics 9.
[2] Johannesson, P. and de Mare, J. (2002), (2003-12-19). Problemdriven statistikkurs, Utmattning, belastning och tillf
orlitlighet,
<http://www.fcc.chalmers.se/pj/UTMIS/ProblemdrivenStatistikkurs/>
[3] Johannesson, P. (1999). Rainflow analysis of switching markov loads,
Doctoral Thesis in Mathematical Sciences, Lund Institute of Technology,
ISBN: 91-628-3784-2.
[4] Lindgren, B.W. (1998). Statistical theory, Fourth edition, Florida, Chapman & Hall, ISBN: 0-412-04181-2.
[5] Olsson, K.E. (1989). Fatigue reliability prediction, Scandinavian Journal
of Metallurgy 18.
[6] Samuelsson, J. (1997). Fatigue design of construction equipment, Volvo
Technology Report.
[7] Thomas, J.J., Perroud, G., Bignonnet, A. and Monnet, D. (1999). Fatigue design and reliability in the automotive industry, In Fatigue Design
and Reliability, ESIS publication 23, edited by Marquis, G. and Solin,
J.
[8] Zanakis, S.H. (1979). A Simulation Study of Some Simple Estimators for
the Three-Parameter Weibull Distribution, J. Statist. Comput. Simul.,
Vol. 9.
62