The data set used in this example is available in the example database installed with the software (called "Weibull20_DOE_Examples.rsgz20"). To access this database file, choose File > Help, click Open Examples Folder, then browse for the file in the Weibull sub-folder.
The name of the example project is "Multiple Linear Regression - Ice Cream Consumption."
The multiple linear regression folio provides an easy way to quickly analyze existing data. In this example, ice cream consumption per capita was measured in four-week periods from March 18, 1951 to July 11, 1953 in order to determine whether it depended on the price of ice cream, the weekly family income and/or the mean temperature.
To investigate the predictors, the analysts create a multiple linear regression folio, and enter the predictor and response values on the Data tab. The design matrix and the response data are given in the "Ice Cream Consumption" folio. The following steps describe how to create this folio on your own.
On the Home tab, click the Multiple Linear Regression icon in the Insert gallery.
Right-click the column headings to rename each of the columns.
Click the arrow inside a heading to specify whether the column contains predictor data (e.g., mean temperature) or response data (i.e., ice cream consumption).
The data set for this example is given in the "Ice Cream Consumption" folio of the example project. After you enter the data from the example folio, you can perform the analysis by clicking the Calculate icon.
The ANOVA table and the Regression Information table for the model are available in the Analysis Summary window after you click the View Analysis Summary icon on the control panel. The R-sq and R-sq(adj) values in the ANOVA table indicate that the model fits the data relatively well.
The Regression Information table indicates that weekly family income and mean temperature are good predictors for ice cream production.
Since the price of ice cream does not seem to predict ice cream consumption, the folio was duplicated in the "Reduced Model" folio, where the price column was removed from the analysis by clicking the arrow in the heading and clearing all the check boxes. Click the View Analysis Summary icon in the new folio to view the final regression model.
Click the Plot icon to view the residual vs. order plot, which you can use to look for outliers in the data set.
Based on the analysis, weekly family income and mean temperature are found to be the most significant factors in ice cream consumption. From the residual vs. order plot, we see that there are a few outliers but the majority of our data set fits the model. Thus, if desired, we could use the Prediction window to predict future ice cream consumption based on different combinations of family income and mean temperature.