Free Form Folio: Analysis Results
When accessed from a free form folio, the Analysis
Summary window will contain detailed information about analysis
results, including information that describes how each factor
and factorial interaction affects the response that is currently
selected on the control
panel.
If the current response data has been analyzed, you
can open the window by clicking the View
Analysis Summary icon on the control panel.
If the current response data has not been analyzed, the icon will still
be available so you can view the folio's analysis history.
Select an item in the Available Report Items panel to display it on
the spreadsheet. Each item is described next.
Analysis Results
The Analysis of Variance (ANOVA)
table and the Likelihood
table provide general information about the effects of
the factors and factorial interactions on the selected response.
This information may be presented for individual factors and interactions
or for groups of factors and interactions, depending on your analysis
setting on the control panel.
ANOVA Table Columns
-
Source of Variation
is the source that caused the difference in the observed output
values. This can be a factor, factorial interaction, curvature,
block, error, etc. If your design includes more than one factor
and you have selected to use grouped terms in the analysis (specified
on the Analysis Settings page of the control panel), the effects
will be grouped by order (i.e., main effects, two-way interactions,
etc.). Sources displayed in red are considered to be significant.
-
The number of Degrees of
Freedom for the Model
is the number of regression coefficients for the effects included
in the analysis (e.g., two coefficients might be included in the
regression table for a given main effect). The number of degrees
of freedom for the Residual
is the total number of observations minus the number of parameters
being estimated.
-
Sum of Squares is
the amount of difference in observed output values caused by this
source of variation.
-
Mean Squares is
the average amount of difference caused by this source of variation.
This is equal to Sum of Squares/Degrees of Freedom.
-
F Ratio is the ratio
of Mean Squares of this source of variation and Mean Squares of
pure error. A large value in this column indicates that the difference
in the output caused by this source of variation is greater than
the difference caused by noise (i.e., this source affects the
output).
-
P Value (alpha error
or type I error) is the probability that an equal amount of variation
in the output would be observed in the case that this source does
not affect the output. This value is compared to the risk level
(alpha) that you specify on the Analysis Settings page of the
control panel. If the p
value is less than alpha, this source of variation is considered
to have a significant effect on the output. In this case, the
term and its p value
will be displayed in red.
-
The following values are shown underneath the ANOVA table,
and they indicate how well the model fits the data:
-
S is the standard
error of the noise. It represents the magnitude of the response
variation that is caused by noise. Lower values indicate better
fit.
-
R-sq is the
percentage of total difference that is attributable to the
factors under consideration. It is equal to Sum of Squares(factor)/Total
Sum of Squares. Higher values usually indicate better fit.
-
R-sq(adj) is
an R-sq value that is adjusted for the number of parameters
in the model. Higher values indicate better fit.
-
PRESS is the
prediction error sum of squares, which provides a measure
of the model’s validity. The lower the PRESS value, the better
the model’s predictive ability.
-
R-sq(pred) is
a measure of how well the model predicts new observations.
It is equal to 1-PRESS/Total Sum of Squares. The larger the
value, the more accurate the model’s predictions are likely
to be.
Likelihood Table Columns (for reliability
DOE)
- Model displays the
model for which the results apply.
- Reduced assumes
that the product life is the same at different
levels of the factor.
- Full assumes
that the product life is different at different
levels of the factor.
- Degrees of Freedom
is the degrees of freedom of this source of variation.
This is also the number of parameters in the model
for this source.
- Ln(Likelihood Value)
is the logarithm transformation of the likelihood
value for this source of variation.
- Likelihood Ratio
is the likelihood ratio value for this source of variation.
- P Value is the probability
that LR is from a chi-squared distribution, which
would indicate that the factor has no effect on product
life. This value is compared to the risk level (alpha)
that you specify on the Analysis Settings page of
the control panel. If the p
value is less than alpha, then the factor is considered
to have a significant effect on product life. In this
case, the effect and its p
value will be displayed in red.
The Regression table and
the MLE Information table
provide specific information on the contribution of each predictor
to the variation in the response and an analysis of the significance
of this contribution.
Regression Table Columns
-
Term is the factor,
factorial interaction, curvature, block, etc. under consideration.
Terms displayed in red are considered to be significant. In cases
where there is no error in the model, significant effects are
determined according to Lenth’s method and the term names are
displayed in red and followed by an asterisk (*).
-
Coefficient is the
regression coefficient of the term, which represents the contribution
of the term to the variation in the response.
-
Standard Error is
the standard deviation of the regression coefficient.
-
Low Confidence and
High Confidence
are the lower and upper confidence bounds on the regression coefficient.
-
T Value is the normalized
regression coefficient, which is equal to Coefficient/Standard
Error.
-
P Value (alpha error
or type I error) is the probability that an equal amount of variation
in the output would be observed in the case that this term does
not affect the output. This value is compared to the risk level
(alpha) that you specify on the Anaysis Settings page of the control
panel. If the p value
is less than alpha, this source of variation is considered to
have a significant effect on the output. In this case, the term
and its p value will
be displayed in red.
MLE Information Table Columns (for reliability DOE)
- Term
is the factor, factorial interaction, etc. under consideration.
Terms displayed in red are considered to be significant.
- Effect
is a measure of how much the response value (Y) changes
when the value of the corresponding term in the model
(using coded values) changes from -1 to 1.
- Coefficient
is the regression coefficient of the term, which represents
the contribution of the term to the variation in the
response.
- Standard Error
is the standard deviation of the regression coefficient.
- Low Confidence
is the lower confidence bound of the regression coefficient,
using Fisher bounds.
- High Confidence
is the upper confidence bound of the regression coefficient,
using Fisher bounds.
- Z Value
is the normalized regression coefficient, which is
equal to Coefficient/Standard Error.
- P Value
is the probability that an equal amount of variation
in the output would be observed in the case that this
source does not affect the output. This value is compared
to the risk level (alpha) that you specify on the
Analysis Settings page of the control panel. If the
p value is
less than alpha, this source of variation is considered
to have a significant effect on the output. In this
case, the term and its p
value will be displayed in red.
The Regression Equation information
is presented using multiple tables.
Regression Equation
-
The Response table
displays the response that the regression equation applies to
and the units of measurement that were entered for the response
(if any).
-
The Additional Settings
table shows the transformation and risk level you entered for
the response.
-
The Significant Terms table
is applicable only when at least one term was found to be significant.
It shows the significant terms in the Name column and the associated
regression coefficients in the Coefficient column.
-
The Equation tables
show the regression coefficients for the model of the selected
response. For example, consider this table:
The corresponding model for this table is y = 9.3903 - 0.8467x1 + 1.2813x2
+ 0.7303x1x2.
Additional Results
All of the following tables provide information that was generated from
the main calculations. The available tables will vary depending on the
design type you are working with. The results that could be available
include:
Alias
Structure
This item is available for all designs with
at least two factors. It describes the alias structure for the design,
taking into account only the terms
you've selected to include in the analysis. Together with your
engineering knowledge, you can use this information to help determine
whether any important interaction information was lost due to aliasing.
When aliased terms exist, the following areas will be shown:
-
-
Terms selected to be
in the model lists all the terms that are considered
for inclusion in the regression model (i.e., the selections
in the Select Terms window).
-
Terms included in the
model lists all the selected terms that are included
in the model. The alias structure determines which terms are
excluded.
-
Alias Structure
lists the aliased effects based on the selected terms. For
example, A • B = A • B + C • D means the interaction effect
A • B is aliased because it is indistinguishable from effect
C • D. Therefore, the model cannot include both interaction
terms; it will include only one (e.g., A • B).
Alias
Summary
The terms in the first column of this table
are aliased with the terms shown in the second column. Only the terms
in the first column are included in the model.
Var/Cov
Matrix
This shows the variance/covariance matrix,
which is available for one factor R-DOE designs and all other designs
with two or more factors. The diagonal elements in this matrix are
used to calculate the coefficients in the MLE or Regression Information
table.
Diagnostic
Information
This table is available
for one factor R-DOE designs and all other designs with two or more
factors. It displays various analysis results for each run and highlights
significant values. The following columns are included:
-
-
Run Order
is the randomized order, generated by the software, in which
it is recommended to perform the runs to avoid biased results.
Note that any changes made to the Run Order column on the
Data tab will be reflected here.
-
Standard
Order is the basic order of runs, as specified in the
design type, without randomization. Note that any changes
made to the Standard Order column on the Data tab will be
reflected here.
-
Actual
Value (Y) is the observed response value for the run,
as entered in the response column on the Data tab.
-
Predicted
Value (YF) is the response value predicted by the model
given the factor settings used in the run.
-
Residual
(or "regular residual") is the difference between
the actual value (Y) and the predicted value (YF) for the
run.
-
Standardized
Residual is the regular residual for the run divided
by the constant standard deviation across all runs.
-
Studentized
Residual is the regular residual for the run divided
by an estimate of its standard deviation.
-
External
Studentized Residual is the regular residual for the
run divided by an estimate of its standard deviation, where
the run in question is omitted from the estimation.
-
Leverage
is a measure of how much the run influences the predicted
values of the model, stated as a value between 0 and 1, where
1 indicates that the actual response value of the run is exactly
equal to the predicted value (i.e.
the predicted value is completely dependent upon the observed
value).
-
Cook’s
Distance is a measure of how much the output is predicted
to change if the run is deleted from the analysis.
Values that are considered to be significant,
or outliers, are displayed in red. For the residual columns, significant
or critical values are those that fall outside the residual’s upper
or lower bounds, calculated based on the specified alpha (risk) value.
The ReliaWiki resource portal has more information
on how significant values are determined for the Leverage and Cook's
Distance columns at: http://www.reliawiki.org/index.php/Multiple_Linear_Regression_Analysis.
Least
Squares Means
This table shows the predicted response values
for the given factor levels. It includes the following columns:
-
-
Effect is the
main effect or interaction used to predict the response. The
coefficients for effects not used in the prediction are set
to zero.
-
Level is the
combination of factor levels used to predict the response.
-
Mean is the
predicted response value.