Explore

Support The 74 and stories like this one. Donate Today!

News

What Happens When a School Gets a Failing Grade? It Gets Better

By Matt Barnum

June 1, 2016

Education news and commentary, delivered right to your inbox.

Most Popular

education
Penny Schwinn Drops Out of the Running for Ed Department’s Deputy Role
special education
For Decades, the Feds Were the Last, Best Hope for Special Ed Kids. What Happens Now?
Texas
Texas Passed a Bible-Themed Curriculum. But Many Districts Aren’t Using It
commentary
Truly Shifting to Science of Reading Sometimes Takes ‘Balanced Literacy Rehab’
Artificial Intelligence
Will New AI Academy Help Teachers or Just Improve Tech’s Bottom Line?

States will be required to give every school a “comprehensive, summative” score rating its performance under draft regulations proposed by the Department of Education for implementing the nation’s new education law.

State education departments typically provide a “data dashboard” for each school that includes a variety of metrics. If the new proposal goes into effect, they will have to combine those metrics into one total score — a letter grade, number, ranking, among other options — which some states and districts already do.

Some educators and analysts are skeptical that such an approach is supported by evidence. Josh Starr, the head of Phi Delta Kappa International and former Montgomery County Schools superintendent, tweeted as much. A report from the University of Colorado’s National Education Policy Center released last year claimed, that A–F school report cards “merit a failing grade.”

Education Department proposes rules for judging schools https://t.co/k27WQSh77r What's evidence that letter grades have had positive effect

— Josh Starr (@JoshuaPStarr) May 27, 2016

In fact, there is evidence that suggests giving grades to schools has an impact on student outcomes, at least for schools that get the lowest grade. According to research in New York City and Florida, these schools respond by improving their students’ test scores — and it doesn’t appear that gains are just the result of gaming the system or teaching to the test.

School grades lead to gains in New York City and Florida

In 2014, New York City removed letter grades from its school report cards in favor of a dashboard approach providing disaggregated ratings on a variety of measures, including test scores, strength of instruction, and quality of school environment.

Two earlier studies published in peer-reviewed journals showed that, under the old system, students in New York City schools that received an ‘F’ made somewhat larger than expected test score gains the following year.¹ The researchers were able to determine the impact of the letter grade by comparing schools near the score cutoff between an F and a D or C — schools that are similar, with the only difference being how they were labeled.²

New research from University of Colorado at Colorado Springs Professor Marcus Winters, released by the Manhattan Institute, a conservative think tank, uses more recent data that once more shows students in F schools, under the old system, making significant test score improvements.³

Winters then looked at the new system, reconstructing the report cards using 2014 data and determine the grades schools would have received under the previous regime. Based on 2015 results, he finds that once letter grades stopped being used, the test score bump associated with them disappeared.

Devora Kaye, a spokesperson for the New York City Department of Education, said in an email, “Letter grades were misleading and oversimplified school quality which is why the new Snapshot evaluates schools using multiple measures and more data so families have a complete picture of a school.”

The research in New York City generally squares with multiple studies from Florida, which also show test scores increased after schools received an F.

Gaming or learning?

Some observers have questioned whether the gains were the result of greater learning — especially given a good deal of evidence that evaluation by test scores can lead to unintended consequences like cheating and teaching to the test.

We can’t say for certain but the research gives us reason to believe that the gains were at least in part educationally meaningful.

One of the New York studies showed increased parental satisfaction, measured by district-administered schools surveys, along with the rise in test scores. The schools tended to spend more time on direct instruction, which some research has found to be quite effective. On the other hand, student satisfaction in schools dropped as this shift occurred.

Another of the New York City studies found test scores gains persisted two years after schools received an F rating, suggesting that the improvement wasn’t solely the result of interventions like short-term test-taking strategies.

In perhaps the most comprehensive review of school grades, a Florida study found that after receiving a failing grade⁴, schools “appear to focus on low-performing students, lengthen the amount of time devoted to instruction, adopt different ways of organizing the day and learning environment of the students and teachers, increase resources available to teachers, and decrease principal control.” In response to accountability pressure, in other words, schools didn’t just start teaching to the test; they made significant changes. Moreover, test score gains persisted three years after the initial improvement.

Another study of Florida found that improvements on low-stakes exams were about half the size of those on the high-stakes tests used for accountability purposes. This suggests that gains may have resulted from some combination of “gaming” and meaningful improvement.

Unanswered questions, unintended consequences

This research should not be read as closing the case on whether giving letter grades to schools is a good idea.

Perhaps the biggest caveat is that studies have generally focused on the impact of grades on low-performing schools. It’s harder to know how the report card approach affects the system as a whole; that is, the research can show that F schools make more gains than D schools, but it doesn’t tell us the aggregate effect of using letter grades as opposed to a different system. (However, research has found that stringent accountability systems generally tend to improve overall student achievement.)

There also may be unintended consequences of school letter grades. For instance, New York City faced frequent complaints that letter grades bounced around significantly from year to year for no apparent reason; there is also evidence that the system reduced parental support for higher standards when they led to lower school grades.

In Florida, one study showed that a failing grade created a significant increase in teacher turnover — particularly among the most effective teachers. (The research also found that teachers who remain showed improvement, consistent with the studies that finding gains in student achievement.)

There are many factors to consider when designing an accountability system; reasonable people can disagree about how to weigh the costs and benefits of different approaches. It is clear, though, that by by some measures, students in low-performing schools appear to benefit from grading schools.

Disclosure: The Walton Family Foundation, which is a funder of The 74, also funded the Manhattan Institute study on New York City school accountability

Footnotes:

1. One study showed gains in both math and English with larger increases in math; the other showed gains in English, but no gains in math. (return to story)

2. Some may be wondering whether the gains are simply the result of “regression to the mean.” That’s quite unlikely because the researchers approach compared schools with relatively similar starting levels of achievement. (return to story)

3. The gains Winters finds are statistically significant but about half the size of those found in one of the previous studies. (return to story)

4. A failing letter grade also meant that students at the school had the opportunity to use a voucher to attend a private school. The study examined the combined impact of the letter grade and this ‘voucher threat’ — though other research suggests that most of the school improvement was due to stigma associated with the failing grade. (return to story)

Get stories like these delivered straight to your inbox. Sign up for The 74 Newsletter

Republish This Article Learn More

Matt Barnum is a senior staff writer at The 74.

@matt_barnum matt@the74million.org

Republish This Article

We want our stories to be shared as widely as possible — for free.

Please view The 74's republishing terms.

                <h1>What Happens When a School Gets a Failing Grade? It Gets Better</h1>

                <h2></h2>

                <p class="sans">By <a rel="author" href="https://www.the74million.org/contributor/matt-barnum/">Matt Barnum</a></p>

                <img src="https://www.the74million.org/wp-content/uploads/2017/01/1458337489_7611.png">

                <p>This story first appeared at <a href="https://www.the74million.org">The 74</a>, a nonprofit news site covering education. <a href="https://www.the74million.org/about/newsletters/?utm_source=republish-button&utm_medium=website&utm_campaign=republish">Sign up for free newsletters from The 74</a> to get more like this in your inbox.</p>
                <div class="article__paragraph">
<div class="article__paragraph opening" dir="ltr"><span id="docs-internal-guid-8736053b-0d38-c716-7a0c-71a7c0334797">States will be required to give every school a “comprehensive, summative” score rating its performance under </span><a href="http://blogs.edweek.org/edweek/campaign-k-12/2016/05/essa_accountability_rules_release_education_department.html">draft regulations</a> proposed by the Department of Education for implementing the nation’s new education law.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d38-c716-7a0c-71a7c0334797">State education departments typically provide a “data dashboard” for each school that includes a variety of metrics. If the new proposal goes into effect, they will have to combine those metrics into one total score — a letter grade, number, ranking, among other options — which some states and districts already do. </span></div>
<div class="article__paragraph"><span id="docs-internal-guid-8736053b-0d38-c716-7a0c-71a7c0334797">Some educators and analysts are skeptical that such an approach is supported by evidence. Josh Starr, the head of Phi Delta Kappa International and former Montgomery County Schools superintendent, </span><a href="https://twitter.com/JoshuaPStarr/status/736171429402664962">tweeted</a> as much. A <a href="http://nepc.colorado.edu/files/pb-statereportcards.pdf">report</a> from the University of Colorado’s National Education Policy Center released last year claimed, that A–F school report cards “merit a failing grade.”</p>
<hr />
<blockquote class="twitter-tweet" data-lang="en">
<p dir="ltr" lang="en">Education Department proposes rules for judging schools <a href="https://t.co/k27WQSh77r">https://t.co/k27WQSh77r</a> What's evidence that letter grades have had positive effect</p>
<p>— Josh Starr (@JoshuaPStarr) <a href="https://twitter.com/JoshuaPStarr/status/736171429402664962">May 27, 2016</a></p></blockquote>
<p><script async src="//platform.twitter.com/widgets.js" charset="utf-8"></script></p>
<hr />
<p><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">In fact, there is evidence that suggests giving grades to schools has an impact on student outcomes, at least for schools that get the lowest grade. According to research in New York City and Florida, these schools respond by improving their students’ test scores — and it doesn’t appear that gains are just the result of gaming the system or teaching to the test.</span></p>
</div>
<div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">School grades lead to gains in New York City and Florida </span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">In 2014, New York City </span><a href="http://www.wsj.com/articles/no-letter-grades-in-new-nyc-school-rating-system-1412180083">removed</a> letter grades from its school report cards in favor of a <a href="http://schools.nyc.gov/Accountability/tools/report/default.htm">dashboard approach</a> providing disaggregated ratings on a variety of measures, including test scores, strength of instruction, and quality of school environment.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2"><a id="1" name="1"></a>Two </span><a href="https://www.aeaweb.org/articles?id=10.1257/pol.2.4.119">earlier</a> <a href="http://epa.sagepub.com/content/34/3/313">studies</a> published in peer-reviewed journals showed that, under the old system, students in New York City schools that received an ‘F’ made somewhat larger than expected test score gains the following year.<a href="https://www.the74million.org/article/what-happens-when-a-school-gets-a-failing-grade-it-gets-better#footnotes"><sup>1</sup></a> <a id="2" name="2"></a>The researchers were able to determine the impact of the letter grade by comparing schools near the score cutoff between an F and a D or C — schools that are similar, with the only difference being how they were labeled.<a href="https://www.the74million.org/article/what-happens-when-a-school-gets-a-failing-grade-it-gets-better#footnotes"><sup>2</sup></a></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2"><a href="http://www.manhattan-institute.org/html/grading-schools-promotes-accountability-and-improvement-evidence-nyc-2013-15-8912.html" id="3" name="3">New research</a></span> from University of Colorado at Colorado Springs Professor Marcus Winters, released by the Manhattan Institute, a conservative think tank, uses more recent data that once more shows students in F schools, under the old system, making significant test score improvements.<a href="https://www.the74million.org/article/what-happens-when-a-school-gets-a-failing-grade-it-gets-better#footnotes"><sup>3</sup></a></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">Winters then looked at the new system, reconstructing the report cards using 2014 data and determine the grades schools would have received</span> under the previous regime. Based on 2015 results, he finds that once letter grades stopped being used, the test score bump associated with them disappeared.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">Devora Kaye, a spokesperson for the New York City Department of Education, said in an email, “Letter grades were misleading and oversimplified school quality which is why the new Snapshot evaluates schools using multiple measures and more data so families have a complete picture of a school.”</span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">The research in New York City generally squares with </span><a href="http://www.sciencedirect.com/science/article/pii/S0047272709000693">multiple</a> <a href="http://www.sciencedirect.com/science/article/pii/S0047272705001246">studies</a> <a href="https://www.aeaweb.org/articles?id=10.1257/pol.5.2.251">from</a> Florida, which also show test scores increased after schools received an F.</div>
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">Gaming or learning?</span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">Some observers have questioned whether the gains were the result of greater learning — especially given a good deal of evidence that evaluation by test scores can lead to unintended consequences like </span><a href="http://qje.oxfordjournals.org/content/118/3/843.short">cheating</a> and <a href="http://edr.sagepub.com/content/43/8/381">teaching to the test</a>.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">We can’t say for certain but the research gives us reason to believe that the gains were at least in part educationally meaningful.</span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">One of the New York </span><a href="https://www0.gsb.columbia.edu/faculty/jrockoff/papers/rockoff_turner_aej_ep_final_Jan_10.pdf">studies</a> showed increased parental satisfaction, measured by district-administered schools surveys, along with the rise in test scores. The schools tended to spend more time on direct instruction, which <a href="https://www.aft.org/sites/default/files/periodicals/Clark.pdf">some research</a> has found to be quite effective. On the other hand, student satisfaction in schools dropped as this shift occurred.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">Another of the New York City </span><a href="http://epa.sagepub.com/content/34/3/313">studies</a> found test scores gains persisted two years after schools received an F rating, suggesting that the improvement wasn’t solely the result of interventions like short-term test-taking strategies.<a id="4" name="4"></a></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">In perhaps the most comprehensive review of school grades, a Florida </span><a href="http://www.nber.org/papers/w13681.pdf">study</a> found that after receiving a failing grade<a href="https://www.the74million.org/article/what-happens-when-a-school-gets-a-failing-grade-it-gets-better#footnotes"><sup>4</sup></a>, schools “appear to focus on low-performing students, lengthen the amount of time devoted to instruction, adopt different ways of organizing the day and learning environment of the students and teachers, increase resources available to teachers, and decrease principal control.” In response to accountability pressure, in other words, schools didn’t just start teaching to the test; they made significant changes. Moreover, test score gains persisted three years after the initial improvement.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">Another </span><a href="http://www.sciencedirect.com/science/article/pii/S0047272705001246">study</a> of Florida found that improvements on low-stakes exams were about half the size of those on the high-stakes tests used for accountability purposes. This suggests that gains may have resulted from some combination of “gaming” and meaningful improvement.</div>
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">Unanswered questions, unintended consequences</span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">This research should not be read as closing the case on whether giving letter grades to schools is a good idea. </span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">Perhaps the biggest caveat is that studies have generally focused on the impact of grades on low-performing schools. It’s harder to know how the report card approach affects the system as a whole; that is, the research can show that F schools make more gains than D schools, but it doesn’t tell us the aggregate effect of using letter grades as opposed to a different system. (However, </span><a href="https://ideas.repec.org/a/wly/jpamgt/v30y2011i3p418-446.html">research</a> <a href="http://web.stanford.edu/~sloeb/papers/EEPAaccountability.pdf">has found</a> that stringent accountability systems generally tend to improve overall student achievement.)</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">There also may be unintended consequences of school letter grades. For instance, New York City faced frequent complaints that letter grades </span><a href="http://blogs.edweek.org/edweek/eduwonkette/2008/09/come_on_feel_the_noise.html">bounced around</a> significantly from year to year for no apparent reason; there is also <a href="http://epx.sagepub.com/content/27/2/360">evidence</a> that the system reduced parental support for higher standards when they led to lower school grades.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">In Florida, </span><a href="http://www.urban.org/research/publication/school-accountability-and-teacher-mobility">one study</a> showed that a failing grade created a significant increase in teacher turnover — particularly among the most effective teachers. (The research also found that teachers who remain showed improvement, consistent with the studies that finding gains in student achievement.)</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">There are many factors to consider when designing an accountability system; reasonable people can disagree about how to weigh the costs and benefits of different approaches. It is clear, though, that by by some measures, students in low-performing schools appear to benefit from grading schools.<a id="footnotes" name="footnotes"></a> </span></div>
<div class="article__paragraph"><em><span id="docs-internal-guid-8736053b-0d39-d32b-9843-f6e6275343e2">Disclosure: The Walton Family Foundation, which is a </span><a href="https://www.the74million.org/page/supporters">funder</a> of The 74, also funded the Manhattan Institute <a href="http://www.manhattan-institute.org/html/grading-schools-promotes-accountability-and-improvement-evidence-nyc-2013-15-8912.html">study</a> on New York City school accountability</em></p>
<hr />
<div class="article__intro">Footnotes:</div>
<p><em>1. <span id="docs-internal-guid-8736053b-0d3e-9fef-e00c-722ce057f8e7">One study showed gains in both math and English with larger increases in math; the other showed gains in English, but no gains in math. <a href="https://www.the74million.org/article/what-happens-when-a-school-gets-a-failing-grade-it-gets-better#1">(return to story)</a></span></em></p>
<p><em>2. Some may be wondering whether the gains are simply the result of “r<a href="https://en.wikipedia.org/wiki/Regression_toward_the_mean">egression to the mean</a>.” That’s quite unlikely because the researchers approach compared schools with relatively similar starting levels of achievement. <a href="https://www.the74million.org/article/what-happens-when-a-school-gets-a-failing-grade-it-gets-better#2">(return to story)</a></em></p>
<p><em>3. The gains Winters finds are statistically significant but about half the size of those found in <a href="https://www0.gsb.columbia.edu/faculty/jrockoff/papers/rockoff_turner_aej_ep_final_Jan_10.pdf">one</a> of the previous studies. <a href="https://www.the74million.org/article/what-happens-when-a-school-gets-a-failing-grade-it-gets-better#3">(return to story)</a></em></p>
<p><em>4. A failing letter grade also meant that students at the school had the opportunity to use a voucher to attend a private school. The study examined the combined impact of the letter grade and this ‘voucher threat’ — though other <a href="http://www.sciencedirect.com/science/article/pii/S0047272705001246">research</a> suggests that most of the school improvement was due to stigma associated with the failing grade. <a href="https://www.the74million.org/article/what-happens-when-a-school-gets-a-failing-grade-it-gets-better#4">(return to story)</a></em></p>
</div>
</div>
</div>

Contact Us

Follow Us

Explore

What Happens When a School Gets a Failing Grade? It Gets Better

Education news and commentary, delivered right to your inbox.

Most Popular

Penny Schwinn Drops Out of the Running for Ed Department’s Deputy Role

For Decades, the Feds Were the Last, Best Hope for Special Ed Kids. What Happens Now?

Texas Passed a Bible-Themed Curriculum. But Many Districts Aren’t Using It

Truly Shifting to Science of Reading Sometimes Takes ‘Balanced Literacy Rehab’

Will New AI Academy Help Teachers or Just Improve Tech’s Bottom Line?

On The 74 Today