• Home
  • Software
  • Industries
  • Application Areas
  • Academic
  • Training
  • Support
  • User Community
Free Trial Buy JMP International About Us Contact News and Events Blog Resource Center
  • JMP FAQs
  • System Requirements
  • Alphabetical List of Features
  • Downloads
  • Knowledge Base
  • Books and JMP
  • Report a Problem
  • JMP Administrators
  • Find Your Serial Number or Site ID
  • Why Register JMP?
  • Register JMP

*On SAS Download site

Statistical FAQs

Can I perform a one-way analysis of variance with only summary data in JMP Software?

Computing an Analysis of Variance with Summary Statistics in JMP® Software

In some instances, an experimenter may want to perform an Analysis of Variance (ANOVA), but only the summary statistics are available. Most statistical programs are designed to compute ANOVA models with the full, complete set of data. However, David A. Larson describes a method to generate surrogate data from the summary statistics which can be used to fit the ANOVA of interest(1). That is, if the analysis is comparing k categories, and only the summary statistics (ni, meani,s2i I= 1, 2, 3, …, k) are available, then data can be generated to perform the desired analysis. Utilizing Larson's ideas, JMP Software can be used to perform this type of analysis. The following is an example demonstrating the appropriate steps taken to fit this ANOVA within JMP.

Suppose all the information available is the summary statistics below:

Treatment Sample Size Mean Standard Deviation
A415.21.5769
B612.81.7216
C619.01.6038
D517.12.1424
E314.51.5524

The first step is to create a JMP data table with the above data (Table 1).

JMP data table

Table 1: The JMP data table.

According to Larson, two new columns will need to be generated. So, create two new columns named "Xi's" and "Xn's" with Formula Column properties. Then, using JMP's Formula Editor, define these formulas:

formula for Xis
formula for Xns

Once these columns are created, they will need to be “stacked.” From the Tables menu, select Stack and choose the columns “Xi's” and “Xn's” to be stacked, and also, change the name of the stacked column from the default “_Stacked_” to “Y” (Figure 1).

Stack dialog box
Figure 1: The “Stack” dialog box.

Now, all that is necessary to run the model is an appropriate frequency column. Using the If function (found in the JMP Formula Editor's Conditional list) create one more additional column “Frequency” with the formula as shown below in Figure 2.

Frequency formula
Figure 2: The “if” selection and the formula for “Frequency.”

The final data table should appear as shown in Table 2.

Final data table
Table 2: Final data table.

The surrogate data has been generated, so the ANOVA can now be performed. From the Analyze menu, choose Fit Y by X. Specify “Treatment” as the “X”, “Y” as the “Y”, and “Frequency” as the “Freq” and click OK to run the analysis. The first item seen in the output is a scatterplot of the points. From the Oneway Analysis pulldown menu, choose Means/Anova/t Test to get the resulting output as seen in Figure 3.

ANOVA resuls
Figure 3: ANOVA results.

Oneway ANOVA
Summary of Fit

Rsquare0.690838
Adj Rsquare0.625752
Root Mean Square Error1.751986
Mean of Response15.85833
Observations (or Sum Wgts)24

Analysis of Variance
SourceDF Sum of SquaresMean SquareF Ratio Prob > F
Treatment4130.3183332.5796 10.61410.0001
Error1958.319653.0695
C. Total23188.63798

Means for Oneway ANOVA
LevelNumberMean Std ErrorLower 95%Upper 95%
A415.20000.8760 13.36717.033
B612.80000.7152 11.30314.297
C619.00000.7152 17.50320.497
D517.10000.7835 15.46018.740
E314.50001.0115 12.38316.617

Std Error uses a pooled estimate of error variance

As you can see, the means are exactly those that were specified in the initial summary statistics. The standard errors given are estimated using a pooled estimate of the error variance. To compare all the results, Table 3 gives the actual data from which the summary data is generated. The results from an analysis of variance using the actual data will match perfectly the output given with the summary statistics.

real data
Table 3: Actual data.

In conclusion, if only the summary statistics are available for an oneway analysis, the method described above can be followed to generate surrogate data in JMP Software to complete the desired analysis of variance.

REFERENCES.

Larson, David A. (1992), “Analysis of Variance With Just Summary Statistics
       as Input,” American Statistician, 46, 151-152.


FAQ # 1604
Last Updated: 2002 Nov 22

Browse JMP FAQs

General FAQs

Statistical FAQs


Support Documents

JMP FAQs

Knowledge Base

System Requirements

Alphabetical List of Features

Known Issues


Contact JMP Support

Report a Problem

Update a Report

support@jmp.com

919.677.8008 (US)

International Support via Worldwide SAS Offices

 
SAS | JMP is a business unit of SAS. SAS Campus Drive, Building S, Cary, NC, 27513 • Phone: 1.919.677.8000
© 2008 SAS Institute, Cary, NC, USA. All Rights Reserved. Terms of Use • Privacy Policy