Gompertz Models
This chapter discusses the two Gompertz models that are used in Weibull++: the standard Gompertz and the modified Gompertz.
The Standard Gompertz Model
The Gompertz reliability growth model is often used when analyzing reliability data. It is most applicable when the data set follows a smooth curve, as shown in the plot below.
The Gompertz model is mathematically given by Virene [1]:
where:
- the system's reliability at development time, launch number or stage number,
- the upper limit that the reliability approaches asymptotically as , or the maximum reliability that can be attained
- initial reliability at
- the growth pattern indicator (small values of indicate rapid early reliability growth and large values of indicate slow reliability growth)
As it can be seen from the mathematical definition, the Gompertz model is a 3-parameter model with the parameters , and . The solution for the parameters, given and , is accomplished by fitting the best possible line through the data points. Many methods are available; all of which tend to be numerically intensive. When analyzing reliability data in the Weibull++ software, you have the option to enter the reliability values in percent or in decimal format. However, will always be returned in decimal format and not in percent. The estimated parameters in the Weibull++ software are unitless. The next section presents an overview and background on some of the most commonly used algorithms/methods for obtaining these parameters.
Parameter Estimation
Linear Regression
The method of least squares requires that a straight line be fitted to a set of data points. If the regression is on , then the sum of the squares of the vertical deviations from the points to the line is minimized. If the regression is on , the line is fitted to a set of data points such that the sum of the squares of the horizontal deviations from the points to the line is minimized. To illustrate the method, this section presents a regression on . Consider the linear model given by Seber and Wild [2]:
or in matrix form where bold letters indicate matrices:
where:
and:
The vector holds the values of the parameters. Now let be the estimates of these parameters, or the regression coefficients. The vector of estimated regression coefficients is denoted by:
Solving for in the matrix form of the equation requires the analyst to left multiply both sides by the transpose of , , or :
Now the term becomes a square and invertible matrix. Then taking it to the other side of the equation gives:
Non-linear Regression
Non-linear regression is similar to linear regression, except that a curve is fitted to the data set instead of a straight line. Just as in the linear scenario, the sum of the squares of the horizontal and vertical distances between the line and the points are to be minimized. In the case of the non-linear Gompertz model , let:
where:
and:
The Gauss-Newton method can be used to solve for the parameters , and by performing a Taylor series expansion on Then approximate the non-linear model with linear terms and employ ordinary least squares to estimate the parameters. This procedure is performed in an iterative manner and it generally leads to a solution of the non-linear problem.
This procedure starts by using initial estimates of the parameters , and , denoted as and where is the iteration number. The Taylor series expansion approximates the mean response, , around the starting values, and For the observation:
where:
Let:
So the equation becomes:
or by shifting to the left of the equation:
In matrix form this is given by:
where:
and:
Note that the equation is in the form of the general linear regression model given in the Linear Regression section. Therefore, the estimate of the parameters is given by:
The revised estimated regression coefficients in matrix form are:
The least squares criterion measure, should be checked to examine whether the revised regression coefficients will lead to a reasonable result. According to the Least Squares Principle, the solution to the values of the parameters are those values that minimize . With the starting coefficients, , is:
And with the coefficients at the end of the first iteration, , is:
For the Gauss-Newton method to work properly and to satisfy the Least Squares Principle, the relationship has to hold for all , meaning that gives a better estimate than . The problem is not yet completely solved. Now are the starting values, producing a new set of values . The process is continued until the following relationship has been satisfied:
When using the Gauss-Newton method or some other estimation procedure, it is advisable to try several sets of starting values to make sure that the solution gives relatively consistent results.
Choice of Initial Values
The choice of the starting values for the nonlinear regression is not an easy task. A poor choice may result in a lengthy computation with many iterations. It may also lead to divergence, or to a convergence due to a local minimum. Therefore, good initial values will result in fast computations with few iterations, and if multiple minima exist, it will lead to a solution that is a minimum.
Various methods were developed for obtaining valid initial values for the regression parameters. The following procedure is described by Virene [1] in estimating the Gompertz parameters. This procedure is rather simple. It will be used to get the starting values for the Gauss-Newton method, or for any other method that requires initial values. Some analysts use this method to calculate the parameters when the data set is divisible into three groups of equal size. However, if the data set is not equally divisible, it can still provide good initial estimates.
Consider the case where observations are available in the form shown next. Each reliability value, , is measured at the specified times, .
where:
- is equal to the number of items in each equally sized group
The Gompertz reliability equation is given by:
and:
Define:
Then:
Without loss of generality, take ; then:
Solving for yields:
Considering the definitions for and , given above, then:
or:
Reordering the equation yields:
If the reliability values are in percent then needs to be divided by 100 to return the estimate in decimal format. Consider the definitions for and again, where:
Reordering the equation above yields:
Therefore, for the special case where , the parameters are:
To estimate the values of the parameters and , do the following:
- Arrange the currently available data in terms of
and
as in the table below. The
values should be chosen at equal intervals and increasing in value by 1, such as one month, one hour, etc.
Design and Development Time vs. Demonstrated Reliability Data for a Device Group Number Growth Time (months) Reliability (%) 0 58 4.060 1 1 66 4.190 = 8.250 2 72.5 4.284 2 3 78 4.357 = 8.641 4 82 4.407 3 5 85 4.443 = 8.850 - Calculate the natural log .
- Divide the column of values for log into three groups of equal size, each containing items. There should always be three groups. Each group should always have the same number, , of items, measurements or values.
- Add the values of the natural log in each group, obtaining the sums identified as , and , starting with the lowest values of the natural log .
- Calculate using the following equation:
- Calculate using the following equation:
- Calculate using the following equation:
- Write the Gompertz reliability growth equation.
- Substitute the value of , the time at which the reliability goal is to be achieved, to see if the reliability is indeed to be attained or exceeded by .
Confidence Bounds
The approximate reliability confidence bounds under the Gompertz model can be obtained with non-linear regression. Additionally, the reliability is always between 0 and 1. In order to keep the endpoints of the confidence interval, the logit transformation is used to obtain the confidence bounds on reliability.
where is the total number of groups (in this case 3) and is the total number of items in each group.
Example - Standard Gompertz for Reliability Data
A device is required to have a reliability of 92% at the end of a 12-month design and development period. The following table gives the data obtained for the first five moths.
- What will the reliability be at the end of this 12-month period?
- What will the maximum achievable reliability be if the reliability program plan pursued during the first 5 months is continued?
- How do the predicted reliability values compare with the actual values?
Group Number | Growth Time (months) | Reliability (%) | |
---|---|---|---|
0 | 58 | 4.060 | |
1 | 1 | 66 | 4.190 |
= 8.250 | |||
2 | 72.5 | 4.284 | |
2 | 3 | 78 | 4.357 |
= 8.641 | |||
4 | 82 | 4.407 | |
3 | 5 | 85 | 4.443 |
= 8.850 |
Solution
After generating the table above and calculating the last column to find , and , proceed as follows:
- Solve for the value of :
- Solve for the value of :
- This is the upper limit for the reliability as .
- Solve for the value of :
Now, that the initial values have been determined, the Gauss-Newton method can be used. Therefore, substituting become:
The estimate of the parameters is given by:
The revised estimated regression coefficients in matrix form are:
If the Gauss-Newton method works effectively, then the relationship has to hold, meaning that gives better estimates than , after . With the starting coefficients, , is:
And with the coefficients at the end of the first iteration, , is:
Therefore, it can be justified that the Gauss-Newton method works in the right direction. The iterations are continued until the relationship is satisfied. Note that the Weibull++ software uses a different analysis method called the Levenberg-Marquardt. This method utilizes the best features of the Gauss-Newton method and the method of the steepest descent, and occupies a middle ground between these two methods. The estimated parameters using Weibull++ are shown in the figure below.
They are:
The Gompertz reliability growth curve is:
- The achievable reliability at the end of the 12-month period of design and development is:
- The maximum achievable reliability from Step 2, or from the value of , is 0.9422.
- The predicted reliability values, as calculated from the standard Gompertz model, are compared with the actual data in the table below. It may be seen in the table that the Gompertz curve appears to provide a very good fit for the data used because the equation reproduces the available data with less than 1% error. The standard Gompertz model is plotted in the figure below the table. The plot identifies the type of reliability growth curve that the equation represents.
Comparison of the Predicted Reliabilities with the Actual Data Growth Time (months) Gompertz Reliability (%) Raw Data Reliability (%) 0 57.97 58.00 1 66.02 66.00 2 72.62 72.50 3 77.87 78.00 4 81.95 82.00 5 85.07 85.00 6 87.43 7 89.20 8 90.52 9 91.50 10 92.22 11 92.75 12 93.14
Example - Standard Gompertz for Sequential Data
Calculate the parameters of the Gompertz model using the sequential data in the following table.
Run Number | Result | Successes | Observed Reliability (%) |
---|---|---|---|
1 | F | 0 | |
2 | F | 0 | |
3 | F | 0 | |
4 | S | 1 | 25.00 |
5 | F | 1 | 20.00 |
6 | F | 1 | 16.67 |
7 | S | 2 | 28.57 |
8 | S | 3 | 37.50 |
9 | S | 4 | 44.44 |
10 | S | 5 | 50.00 |
11 | S | 6 | 54.55 |
12 | S | 7 | 58.33 |
13 | S | 8 | 61.54 |
14 | S | 9 | 64.29 |
15 | S | 10 | 66.67 |
16 | S | 11 | 68.75 |
17 | F | 11 | 64.71 |
18 | S | 12 | 66.67 |
19 | F | 12 | 63.16 |
20 | S | 13 | 65.00 |
21 | S | 14 | 66.67 |
22 | S | 15 | 68.18 |
Solution
Using Weibull++, the parameter estimates are shown in the following figure.
Cumulative Reliability
For many kinds of equipment, especially missiles and space systems, only success/failure data (also called discrete or attribute data) is obtained. Conservatively, the cumulative reliability can be used to estimate the trend of reliability growth. The cumulative reliability is given by Kececioglu [3]:
where:
- is the current number of trials
- is the number of failures
It must be emphasized that the instantaneous reliability of the developed equipment is increasing as the test-analyze-fix-and-test process continues. In addition, the instantaneous reliability is higher than the cumulative reliability. Therefore, the reliability growth curve based on the cumulative reliability can be thought of as the lower bound of the true reliability growth curve.
The Modified Gompertz Model
Sometimes, reliability growth data with an S-shaped trend cannot be described accurately by the Standard Gompertz or Logistic curves. Because these two models have fixed values of reliability at the inflection points, only a few reliability growth data sets following an S-shaped reliability growth curve can be fitted to them. A modification of the Gompertz curve, which overcomes this shortcoming, is given next [5].
If we apply a shift in the vertical coordinate, then the Gompertz model is defined by:
where:
- is the system's reliability at development time or at launch number , or stage number
- is the shift parameter
- is the upper limit that the reliability approaches asymptotically as
- is the initial reliability at
- is the growth pattern indicator (small values of indicate rapid early reliability growth and large values of indicate slow reliability growth)
The modified Gompertz model is more flexible than the original, especially when fitting growth data with S-shaped trends.
Parameter Estimation
To implement the modified Gompertz growth model, initial values of the parameters , , and must be determined. When analyzing reliability data in Weibull++, you have the option to enter the reliability values in percent or in decimal format. However, and will always be returned in decimal format and not in percent. The estimated parameters in W eibull++are unitless.
Given that and , it follows that , and , as defined in the derivation of the Standard Gompertz model, can be expressed as functions of .
Modifying the equations for estimating parameters , , , as functions of , yields:
where is the time interval increment. At this point, you can use the initial constraint of:
Now there are four unknowns, , , and , and four corresponding equations. The simultaneous solution of these equations yields the four initial values for the parameters of the modified Gompertz model. This procedure is similar to the one discussed before. It starts by using initial estimates of the parameters, , , and , denoted as and where is the iteration number.
The Taylor series expansion approximates the mean response, , around the starting values, and . For the observation:
where:
Let:
Therefore:
or by shifting to the left of the equation:
In matrix form, this is given by:
where:
The same reasoning as before is followed here, and the estimate of the parameters is given by:
The revised estimated regression coefficients in matrix form are:
To see if the revised regression coefficients will lead to a reasonable result, the least squares criterion measure, , should be checked. According to the Least Squares Principle, the solution to the values of the parameters are those values that minimize . With the starting coefficients, , is:
With the coefficients at the end of the first iteration, , is:
For the Gauss-Newton method to work properly, and to satisfy the Least Squares Principle, the relationship has to hold for all , meaning that gives a better estimate than . The problem is not yet completely solved. Now are the starting values, producing a new set of values The process is continued until the following relationship has been satisfied.
As mentioned previously, when using the Gauss-Newton method or some other estimation procedure, it is advisable to try several sets of starting values to make sure that the solution gives relatively consistent results. Note that Weibull++ uses a different analysis method called the Levenberg-Marquardt. This method utilizes the best features of the Gauss-Newton method and the method of the steepest descent, and occupies a middle ground between these two methods.
Confidence Bounds
The approximate reliability confidence bounds under the modified Gompertz model can be obtained using non-linear regression. Additionally, the reliability is always between 0 and 1. In order to keep the endpoints of the confidence interval, the logit transformation can be used to obtain the confidence bounds on reliability.
where is the total number of groups (in this case 4) and is the total number of items in each group.
Example - Modified Gompertz for Reliability Data
A reliability growth data set is given in columns 1 and 2 of the following table. Find the modified Gompertz curve that represents the data and plot it comparatively with the raw data.
Time (months) | Raw Data Reliability (%) | Gompertz Reliability (%) | Logistic Reliability (%) | Modified Gompertz Reliability (%) |
---|---|---|---|---|
0 | 31.00 | 25.17 | 22.70 | 31.18 |
1 | 35.50 | 38.33 | 38.10 | 35.08 |
2 | 49.30 | 51.35 | 56.40 | 49.92 |
3 | 70.10 | 62.92 | 73.00 | 69.23 |
4 | 83.00 | 72.47 | 85.00 | 83.72 |
5 | 92.20 | 79.94 | 93.20 | 92.06 |
6 | 96.40 | 85.59 | 96.10 | 96.29 |
7 | 98.60 | 89.75 | 98.10 | 98.32 |
8 | 99.00 | 92.76 | 99.10 | 99.27 |
Solution
To determine the parameters of the modified Gompertz curve, use:
and:
for , the equation above may be rewritten as:
The equations for parameters , and can now be solved simultaneously. One method for solving these equations numerically is to substitute different values of , which must be less than , into the last equation shown above, and plot the results along the y-axis with the value of along the x-axis. The value of can then be read from the x-intercept. This can be repeated for greater accuracy using smaller and smaller increments of . Once the desired accuracy on has been achieved, the value of can then be used to solve for , and . For this case, the initial estimates of the parameters are:
Now, since the initial values have been determined, the Gauss-Newton method can be used. Therefore, substituting and , become:
The estimate of the parameters is given by:
The revised estimated regression coefficients in matrix form are given by:
With the starting coefficients , is:
With the coefficients at the end of the first iteration, , is:
Therefore:
Hence, the Gauss-Newton method works in the right direction. The iterations are continued until the relationship of has been satisfied. Using Weibull++RGA, the estimators of the parameters are:
Therefore, the modified Gompertz model is:
Using this equation, the predicted reliability is plotted in the following figure along with the raw data. As you can see, the modified Gompertz curve represents the data very well.
More Examples
Standard Gompertz for Grouped per Configuration Data
A new design is put through a reliability growth test. The requirement is that after the ninth stage the design will exhibit an 85% reliability with a 90% confidence level. Given the data in the following table, do the following:
- Estimate the parameters of the standard Gompertz model.
- What is the initial reliability at ?
- Determine the reliability at the end of the ninth stage and check to see whether the goal has been met.
Stage | Number of Units | Number of Failures |
---|---|---|
1 | 10 | 5 |
2 | 8 | 3 |
3 | 9 | 3 |
4 | 9 | 2 |
5 | 10 | 2 |
6 | 10 | 1 |
7 | 10 | 1 |
8 | 10 | 1 |
9 | 10 | 1 |
Solution
- The data is entered in cumulative format and the estimated standard Gompertz parameters are shown in the following figure.
- The initial reliability at is equal to:
- The reliability at the ninth stage can be calculated using the Quick Calculation Pad (QCP) as shown in the figure below.
The estimated reliability at the end of the ninth stage is equal to 91.92%. However, the lower limit at the 90% 1-sided confidence bound is equal to 82.15%. Therefore, the required goal of 85% reliability at a 90% confidence level has not been met.
Comparing Standard and Modified Gompertz
Using the data in the following table, determine whether the standard Gompertz or modified Gompertz would be better suited for analyzing the given data.
Stage | Reliability (%) |
---|---|
0 | 36 |
1 | 38 |
2 | 46 |
3 | 58 |
4 | 71 |
5 | 80 |
6 | 86 |
7 | 88 |
8 | 90 |
9 | 91 |
Solution
The standard Gompertz Reliability vs. Time plot is shown next.
The standard Gompertz seems to do a fairly good job of modeling the data. However, it appears that it is having difficulty modeling the S-shape of the data. The modified Gompertz Reliability vs. Time plot is shown next. As expected, the modified Gompertz does a much better job of handling the S-shape presented by the data and provides a better fit for this data.