Criticality Analysis
Introduction
Failure mode criticality, as used in FMECA, can be generically described as the joint probability of three events:
- The probability of failure of the component.
- The probability that the mode under consideration was the culprit.
- The probability that this mode would result in a system failure (which for the case of non-redundancy or if in series =1).
This number is then used to provide relative rankings for the different failure modes. This general approach is described in Kececioglu and expanded further here.
General Criticality Number Computation
Mathematically, and for a specific failure mode,
where:
- Criticality number for the failure mode.
- Conditional probability of mission loss.
- Failure mode ratio.
- Unreliability or probability of failure due to that mode at time t.
- Unreliability or probability of failure of the component at time t.
In the case that the probability of occurrence of each mode is defined, and the modes are assumed to be in a series, then a better way to obtain a criticality value is:
Note that this differs from the specialized definition given in Task 102 of MIL-STD-1629A, Section 3.2.1.6 (described next).
Task 102 MIL-STD-1629A
Specifically, a procedure is given for computing the “Failure mode criticality number” when an assumption of a constant failure mode (i.e., an exponential distribution) is used and for small values of lambda:
This is given by:
where:
- Criticality number for a failure mode.
- Conditional probability of mission loss.
- Failure mode ratio.
- Part failure rate.
- Mission Duration.
In addition to the constant failure rate assumption, it is important to note that this formulation assumes that the part failure rate values for a mode are relatively small and thus the following simplification is made:
This approximation holds true when using an exponential distribution and small values of lambda (<1E-4), but not for all values. ReliaSoft chose to use the general formulation instead of the specialized and limited 1629A formulation.
Example
To illustrate this assume that a component has three failure modes (FM1, FM2 and FM3) where any mode can cause system failure (i.e., a series system). The unreliability is defined by the failure model of each mode at t=1,000 hours. (In this case FM1 is Weibull (, hrs) and FM2 and 3 are exponential with MTTF=10,000 hours.)
Then:
and:
Derivation
Given that the component failed before time T, the probability that it is caused by mode B is:
The upper bound of P1 is:
-
- .
The lower bound of P1 is:
Thus, the probability that the failure is caused by mode B is:
Thus the approximation:
is within the bounds.
Similarly, given that the component failed before time T, the probability that it is caused by mode A is:
Then the sum of P1 and P2 is:
Thus the approximation:
is within these bounds.
Also note: