Deborah Lynn Guber
The University of Vermont
Journal of Statistics Education v.7, n.2 (1999)
Copyright (c) 1999 by Deborah Lynn Guber, all rights reserved. This text may be freely shared among individuals, but it may not be republished in any medium without express written consent from the author and advance notification of the editor.
Key Words: Multiple regression; Omitted variable bias; Partial correlation; Scatterplot.
Using data from the 1997 Digest of Education Statistics, this teaching case addresses the relationship between public school expenditures and academic performance, as measured by the SAT. While an initial scatterplot shows that SAT performance is lower, on average, in high-spending states than in low-spending states, this statistical relationship is misleading because of an omitted variable. Once the percentage of students taking the exam is controlled for, the relationship between spending and performance reverses to become both positive and statistically significant. This exercise is ideally suited for classroom discussion in an elementary statistics or research methods course, giving students an opportunity to test common assumptions made in the news media regarding equity in public school expenditures.
1 In the debate over educational reform there are few statistics that provoke as much public controversy as the inequality of school expenditures. Used as a bellwether by some, money would seem to measure academic quality, or at the very least, the ability to pay for the resources children need to learn effectively for things such as textbooks and libraries, better teacher salaries, and state-of-the-art computer facilities. To the extent that money matters, affluent communities willing and able to pay high property taxes to finance local schools are pitted against poorer communities who argue that the prevailing system is unfair. That argument has met with some success in recent years. For example, Vermonts Act 60 attempts to equalize spending across the state using a "Robin Hood" approach that would raise property taxes in wealthy towns and use those revenues to better schools in poorer areas. While such efforts tend to polarize neighbors and communities into an intense ideological debate, they do seem to beg one essential question that is largely empirical: "Does money really matter?"
2 Intuitively, while it may be difficult to deny the importance of money and all it can purchase, critics of educational reform in Vermont and elsewhere often counter that the amount of money schools spend per student has no systematic relationship to educational performance. They argue that it is not money per se that matters, but rather how wisely those funds are spent, and often they seem to have an arsenal of statistics to support that claim. As newspaper columnist George Will (1993, p. C7) bluntly puts it:
[T]he 10 states with the lowest per pupil spending included four North Dakota, South Dakota, Tennessee, Utah among the 10 states with the top SAT scores. Only one of the 10 states with the highest per pupil expenditures Wisconsin was among the 10 states with the highest SAT scores. New Jersey has the highest per pupil expenditures, an astonishing $10,561, which teachers unions elsewhere try to use as a negotiating benchmark. New Jerseys rank regarding SAT scores? Thirty-ninth... The fact that the quality of schools... [fails to correlate] with education appropriations will have no effect on the teacher unions' insistence that money is the crucial variable. The public education lobby's crumbling last line of defense is the miseducation of the public.
3 For Will, and others, the message is clear money simply does not correlate with educational excellence. Or does it? If there is a case to be made for equalizing public school expenditures across districts (and even states), data of the kind George Will cites are especially troublesome and worthy of a closer look. The objective of this exercise is to introduce students to this continuing debate and allow them the opportunity to build their own conclusions upon a base of solid statistical reasoning.
4 This exercise was designed for use in an elementary statistics course or an introductory class in social science research methods. The learning objectives are three-fold:
5 The variables in this dataset were extracted from the 1997 Digest of Education Statistics, an annual publication of the U.S. Department of Education. Variables from a number of different tables were downloaded from the National Center for Education Statistics (NCES) website and merged into a single data file. While this exercise uses the most current data available, similar variables are analyzed in Hawkins (1994) and Powell and Steelman (1984, 1996).
6 Since the data are aggregated to the state level, the first variable specifies each state by name, in alphabetical order, allowing students to easily identify outliers. The next four variables all represent educational "inputs," that is, factors that should (but may not) influence student performance in ways we would expect. These variables include average annual expenditures per student, the student/teacher ratio, the average annual salary for teachers in each state, and the percentage of all eligible students taking the SAT. Finally, the three remaining variables in the dataset measure educational "output" (e.g., performance) using the SAT. Verbal and math scores appear separately, and an average total score is given for each state.
7 Although some guidance from the instructor is to be expected, students should be encouraged to examine the data thoroughly before attempting to resolve the spending/performance paradox. They should note the general properties of each of the variables, including their range and any obvious outliers. They may also wish to rank the states in ascending or descending order based on their public school expenditures and/or SAT performance. Do the same states appear at or near the top of each list? Why or why not? To answer that question easily, students should be encouraged the view the relationship between spending and performance visually in a scatterplot (Figure 1).
Figure 1. Current Per-Pupil Expenditure by Average SAT Score
8 As the scatterplot demonstrates, the relationship between expenditures and performance appears to be negative that is, the states that score highest on the SAT are the states that, on average, spend less money per student. For example, high spending states such as New Jersey, New York, Alaska, and Connecticut all post aggregate scores well below the national average combined score of 966.
9 These results can be further summarized in a simple
bivariate regression. As the model shown in
Table 1 suggests,
every $1,000 increase in spending per student per year is
associated with a decline of nearly 21 points in
the average statewide SAT score, an estimate that easily
reaches conventional levels of statistical significance
Table 1. SAS Regression Output for Bivariate Regression Model
General Linear Models Procedure
Dependent Variable: TOTALSAT
Sum of Mean
Source DF Squares Square F Value Pr > F
Model 1 39722.0592 39722.0592 8.13 0.0064
Error 48 234585.6208 4887.2004
Corrected Total 49 274307.6800
R-Square C.V. Root MSE TOTALSAT Mean
0.144808 7.237506 69.9085 965.920
Source DF Type I SS Mean Square F Value Pr > F
EXPEND 1 39722.0592 39722.0592 8.13 0.0064
Source DF Type III SS Mean Square F Value Pr > F
EXPEND 1 39722.0592 39722.0592 8.13 0.0064
T for H0: Pr > |T| Std Error of
Parameter Estimate Parameter=0 Estimate
INTERCEPT 1089.293718 24.54 0.0001 44.38995020
EXPEND -20.892174 -2.85 0.0064 7.32820865
10 The strength of this relationship might seem puzzling to students at first. After all, it suggests not that money is irrelevant to academic performance, but that spending more on public schools only seems to exacerbate the problem. Or does it? Just as SAT scores vary widely from state to state, so too does the percentage of high school students taking the exam. In fact, participation ranges from a high of 81% in Connecticut to a low of 4% in Utah. That rate of participation is determined, in part, by academic interest in attending college, but also by the preferred standardized test in that region. As Hawkins (1994) points out, the ACT remains a popular alternative to the SAT among college admissions offices in certain parts of the country. In these states, a select group of bright students intent on attending competitive out-of-state schools are more likely to take (and score well on) the SAT.
11 The consequence of this may not be immediately apparent to the class, but the issue should lead to a lively discussion. For example, if the very best students in each state self-select themselves into the pool of test-takers, the average score naturally rises. Conversely, in states such as Connecticut, Massachusetts, New York, and New Jersey, where a substantial majority of high school students take the SAT, a less elite population of students contributes to a lower average score. The strength and importance of this relationship can be confirmed with a scatterplot and/or a Pearsons correlation coefficient (Figure 2).
Figure 2. Average SAT Score by the Percentage of All Eligible Students Taking the Test
12 Does the composition of test-takers influence our
original question relating to school expenditures and
academic performance? Indeed it does. Without controlling
for the percentage of students taking the exam, the
correlation between spending and performance was a
surprising -.381
13 A dramatic improvement in model fit can also be seen in
a second multiple regression model, shown in
Table 2. SAS Regression Output for Multiple Regression Model
General Linear Models Procedure
Dependent Variable: TOTALSAT
Sum of Mean
Source DF Squares Square F Value Pr > F
Model 2 224787.621 112393.810 106.67 0.0001
Error 47 49520.059 1053.618
Corrected Total 49 274307.680
R-Square C.V. Root MSE TOTALSAT Mean
0.819473 3.360474 32.4595 965.920
Source DF Type I SS Mean Square F Value Pr > F
TAKESAT 1 215874.533 215874.533 204.89 0.0001
EXPEND 1 8913.088 8913.088 8.46 0.0055
Source DF Type III SS Mean Square F Value Pr > F
TAKESAT 1 185065.562 185065.562 175.65 0.0001
EXPEND 1 8913.088 8913.088 8.46 0.0055
T for H0: Pr > |T| Std Error of
Parameter Estimate Parameter=0 Estimate
INTERCEPT 993.8316590 45.52 0.0001 21.83323352
TAKESAT -2.8509292 -13.25 0.0001 0.21511232
EXPEND 12.2865185 2.91 0.0055 4.22431595
14 While the analysis presented up to this point is quite simple to reproduce in the classroom, it should be noted that this dataset is worth exploring further using more sophisticated techniques. For instance, if time and interest permit, students may want to test the importance of school expenditures in a more complex model. Several additional variables are provided for this purpose, including student/teacher ratios and average annual teacher salaries.
15 The dataset also gives an instructor the opportunity to address common hazards in regression analysis. For example, astute students who visually plot the residuals from the second regression model might question an irregular U-shaped pattern that clearly hints at a nonlinearity problem involving the percentage of students taking the SAT (Figure 3). Since an assumption of linearity is key to ordinary least squares regression estimation, students could be instructed on a variety of possible solutions, including the addition of a polynomial term to the equation or a logarithmic transformation (Lewis-Beck 1980). Powell and Steelman (1984, 1996) address this very issue at some length and find that the best-fitting nonlinear regression equation is one that includes both the percentage of test-takers and its square root, a conclusion that could be tested and confirmed in the classroom.
Figure 3. Residual Plot for Multiple Regression Model
16 As the debate over public school expenditures rages on, it is clear that simple statistical analyses can mislead pundits, policymakers, and parents alike. The goal of this exercise is to encourage students to be acutely sensitive to these debates and to the way in which statistics are used to inform various political arguments publicized in the news media.
17 Aside from that note of warning, however, what contribution does this dataset make to our understanding of the public school debate? Do the data conclusively "prove" that spending more money on public education produces better results? Hardly, although the case is instructive in many ways. As Hawkins (1994, p. 26) warns, "The question of whether money matters in the public schools is a legitimate research question. In fact, under the right research conditions, it might be possible to answer it. Simple correlations based on aggregated data from the 50 states, however, dont begin to adequately explain the complexities." For example, knowing that money matters does not solve the riddle of "when," and "under what conditions." Nor does it tell us how that money should be spent in order to produce the biggest bang for the buck. In the end, the strength of this exercise rests not only on what issues are resolved, but also on the importance of those that remain.
18 The file sat.dat contains the raw data. The file sat.txt is a documentation file containing a brief description of the dataset.
Columns
1 - 16 Name of state (in quotation marks)
18 - 22 Current expenditure per pupil in average daily attendance
in public elementary and secondary schools, 1994-95 (in
thousands of dollars)
24 - 27 Average pupil/teacher ratio in public elementary and
secondary schools, Fall 1994
29 - 34 Estimated average annual salary of teachers in public
elementary and secondary schools, 1994-95 (in thousands
of dollars)
36 - 37 Percentage of all eligible students taking the SAT, 1994-95
39 - 41 Average verbal SAT score, 1994-95
43 - 45 Average math SAT score, 1994-95
47 - 50 Average total score on the SAT, 1994-95
Values are aligned and delimited by blanks. There are no missing values.
Lewis-Beck, M. S. (1980), Applied Regression: An Introduction. Newbury Park, CA: Sage Publications.
National Center for Education Statistics (1997), Digest of Education Statistics [Online], Washington, DC: U.S. Department of Health, Education, and Welfare, Education Division. (http://nces01.ed.gov/pubs/digest97/index.html)
Will, G. F. (September 12, 1993), "Meaningless Money Factor," The Washington Post, C7.
Deborah Lynn Guber
Assistant Professor of Political Science
The University of Vermont
P.O. Box 54110
Burlington, VT 05405-4110
JSE Homepage | Subscription Information | Current Issue | JSE Archive (1993-1998) | Data Archive | Index | Search JSE | JSE Information Service | Editorial Board | Information for Authors | Contact JSE | ASA Publications