Explore

Support The 74 and stories like this one. Donate Today!

News

New Research: Chicago Teacher Eval Pilot Moved Out Struggling Teachers, Kept Stronger Ones

By Matt Barnum

September 20, 2016

Untangle Your Mind!

Most Popular

literacy
We Started Grouping Students by Reading Ability vs. Grade. Here’s What Happened
core topics
New Report: States Need to Up Their Game on Preparing Elementary Math Teachers
school choice
More Than a Third of Homeschool Families Also Use Public Schools, New Data Shows
Indiana
Tiny Indiana District With Online School Worth Millions Ordered To Close
commentary
It’s Time to Reject Chronic Absenteeism as the New Normal in Student Attendance

New research on a Chicago teacher evaluation pilot put in place when Arne Duncan was running the city’s schools shows it worked to move out struggling teachers while not sacrificing effectives ones.

“The overall quality of the teachers in treatment schools improved as a result of the … initiative,” the study finds.

The evaluation system studied is particularly noteworthy because as secretary of education, Duncan pushed states across the country to adopt evaluation models that in some respects mirrored the Chicago pilot.

“This paper provides evidence that teacher evaluation reform … has the potential to improve the overall quality of teachers and teaching in our nation’s schools,” wrote researchers Lauren Sartain of the University of Chicago and Matthew Steinberg of the University of Pennsylvania.

(The 74: Research Suggests D.C.’s Tough Teacher Evaluation System Helped Students. 7 Big Lessons for Other Cities)

The study, titled “Teachers’ Labor Market Responses to Performance Evaluation Reform,” appeared in the August edition of the peer-reviewed Journal of Human Resources.

It examined a pilot evaluation program introduced in a random set of Chicago elementary schools in the 2008–09 school year.

The randomization is key because it allowed the researchers to be confident that any differences between schools where the program was piloted and those where it wasn’t were caused by the evaluation system.

A separate study, by the same researchers, found that the program led to gains in student achievement for the first group of schools in the initiative.

The latest research looks at what happened to teacher turnover in these pilot schools. In aggregate there was no effect; that is, attrition went neither up nor down. That the program didn’t prompt an across-the-board exodus is probably a positive finding.

“Policymaker[s] and district leaders would likely be concerned if a teacher evaluation system induced turnover for the average teacher in the district, particularly in light of the aim of evaluation reform — to provide the majority of teachers with feedback to improve their practice,” the paper says.

Sartain and Steinberg then look specifically at non-tenured teachers who had received a poor evaluation rating under the previous system. Here they see a jump in teacher turnover: In the pilot schools, about 35 percent (eight out of 23) of those teachers left, compared with just 4 percent (one out of 23) of teachers in the control schools.

This is a big difference in percentage terms, but it represents a small number of actual teachers, since very few got low ratings. (Despite the small numbers, though, the results were highly statistically significant, so they weren’t likely to have been caused by chance.) This also helps explain why the net impacts on turnover were negligible even while the effects for this specific group were large.

Even more encouraging, the teachers who left were replaced by ones who were subsequently rated higher. This suggests, contrary to frequently heard concerns, that schools may be able get effective replacements for teachers they dismiss.

Notably, the study finds that tenured teachers with low ratings were no more likely to leave the classroom under the pilot system. Although they can’t be sure, the researchers chalk this up to the lengthy process necessary to dismiss a tenured teacher.

“Job protections guaranteed to tenured teachers appear to have sufficiently insulated them from the type of job displacement experienced by their non-tenured counterparts,” the researchers said.

The pilot was different from now-prevailing evaluation models in that student test scores were not considered part of teachers’ scores. Teachers were evaluated solely based on classroom observation.

Other research has also found that simply providing more information to principals on their teachers’ performance can increase turnover of less-effective teachers.

The latest study provides encouraging evidence for the teacher evaluation systems that gained traction under President Barack Obama and Duncan’s Race to the Top initiative. They have been politically controversial because they led to a large increase in student testing, and they were disappointing even to some reform advocates because — like the systems they replaced — they rated the vast majority of teachers as effective or better.

(The 74: Test Scores and Teacher Evals: A Complex Controversy Explained)

Still, research has found that new evaluations have likely led to small upticks in the numbers of teachers deemed less than effective.

Although much of the pushback has focused on testing, under these systems, the majority of teacher evaluations are based on classroom observations.

Source: Education Finance and Policy

This Chicago study — along with evidence from Washington, D.C. — provides some reason for optimism about these new systems. For one thing, it shows that even if relatively few teachers get poor marks, new evaluation systems may still lead to increases in the number of struggling teachers who leave the classroom, without hurting overall turnover.

Another recent study from Chicago — this one done based on its relatively new evaluation system, known as REACH, currently in use citywide — found less top-heavy evaluation ratings compared with the previous system.

Source: University of Chicago Consortium on School Research

Even more encouraging, Chicago teachers and, especially, school principals gave the system fairly high marks, with majorities saying it “will lead to improved student learning.” Despite anecdotal claims suggesting otherwise, this squares with other studies.

Source: University of Chicago Consortium on School Research

But in one major respect, teachers are not happy with Chicago’s current evaluation system: its use of student test scores. The majority said that assessments were not a fair way to measure their performance.

The older pilot program — which, again, didn’t use test scores at all — suggests that incorporating student growth in evaluations is not necessary to induce less-effective teachers to leave. In fact, some argue that assessment-based evaluation can undermine other aspects of the evaluation process, such as teacher collaboration; others suggest that stringent evaluations may drive prospective teachers away from the classroom. (There’s little firm evidence that either of these claims are true.)

Test-based evaluations, depending on how exactly they’re designed, tend to award more lower ratings than classroom observations — perhaps one reason teachers don’t like them.

The Chicago studies, then, highlight both the benefits and the drawbacks of using observations as the sole evaluation measure. Teachers would likely prefer such a system, which still seems able to weed out a handful of the least-effective teachers (at least the ones without tenure).

On the other hand, test scores seem to provide information distinct from classroom observations, which tend to be biased against teachers with struggling students, according to research from Chicago and elsewhere.

And with the feds no longer pressuring states to adopt certain evaluation schemes, these are exactly the sorts of trade-offs that districts and states must consider, as many once again consider revamping their systems.

Get stories like these delivered straight to your inbox. Sign up for The 74 Newsletter

Republish This Article Learn More

Matt Barnum is a senior staff writer at The 74.

@matt_barnum matt@the74million.org

Republish This Article

We want our stories to be shared as widely as possible — for free.

Please view The 74's republishing terms.


                <h1>New Research: Chicago Teacher Eval Pilot Moved Out Struggling Teachers, Kept Stronger Ones</h1>

                <h2></h2>

                <p class="sans">By <a rel="author" href="https://www.the74million.org/contributor/matt-barnum/">Matt Barnum</a></p>

                <img src="https://www.the74million.org/wp-content/uploads/2017/01/1473196450_8736.png">

                <p>This story first appeared at <a href="https://www.the74million.org">The 74</a>, a nonprofit news site covering education. <a href="https://www.the74million.org/about/newsletters/?utm_source=republish-button&utm_medium=website&utm_campaign=republish">Sign up for free newsletters from The 74</a> to get more like this in your inbox.</p>
                <div class="article__paragraph">
<div class="article__paragraph opening" dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">New </span><a href="http://jhr.uwpress.org/content/51/3/615.abstract">research</a> on a Chicago teacher evaluation pilot put in place when Arne Duncan was running the city’s schools shows it worked to move out struggling teachers while not sacrificing effectives ones.</div>
<p dir="ltr"><span>“The overall quality of the teachers in treatment schools improved as a result of the … initiative,” the study finds.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">The evaluation system studied is particularly noteworthy because as secretary of education, Duncan pushed</span> states across the country to <a href="https://www.the74million.org/article/the-war-over-evaluating-teachers-where-it-went-right-and-how-it-went-wrong">adopt evaluation models</a> that in some respects mirrored the Chicago pilot.</p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">“This paper provides evidence that teacher evaluation reform … has the potential to improve the overall quality of teachers and teaching in our nation’s schools,” wrote researchers Lauren Sartain of the University of Chicago and Matthew Steinberg of the University of Pennsylvania.</span></p>
<p dir="ltr"><em><strong><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">(</span><a href="https://www.the74million.org/article/research-suggests-dcs-tough-teacher-evaluation-system-helped-students-7-big-lessons-for-other-cities">The 74: Research Suggests D.C.’s Tough Teacher Evaluation System Helped Students. 7 Big Lessons for Other Cities</a>)</strong></em></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">The study, titled </span><a href="http://jhr.uwpress.org/content/51/3/615.abstract">“Teachers’ Labor Market Responses to Performance Evaluation Reform,”</a> appeared in the August edition of the peer-reviewed <em>Journal of Human Resources</em>.</p>
<p dir="ltr"><span>It examined a pilot evaluation program introduced in a random set of Chicago elementary schools in the 2008–09 school year.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">The randomization is key because it allowed the researchers to be confident that any differences between schools where the program was piloted and those where it wasn’t were caused by the evaluation system.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">A </span><a href="http://educationnext.org/better-observation-make-better-teachers/">separate study</a>, by the same researchers, found that the program led to gains in student achievement for the first group of schools in the initiative.</p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">The latest research looks at what happened to teacher turnover in these pilot schools. In aggregate there was no effect; that is, attrition went neither </span><span>up nor down. That the program didn’t prompt an across-the-board exodus is probably a positive finding.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">“Policymaker[s] and district leaders would likely be concerned if a teacher evaluation system induced turnover for the average teacher in the district, particularly in light of the aim of evaluation reform — to provide the majority of teachers with feedback to improve their practice,” the paper says.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">Sartain and Steinberg then look specifically at non-tenured teachers who had received a poor evaluation rating under the previous system. Here they see a jump in teacher turnover: In the pilot schools, about 35 percent (eight out of 23) of those teachers left, compared with just 4 percent (one out of 23) of teachers in the control schools.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">This is a big difference in percentage terms, but it represents a small number of actual teachers, since very few got low ratings. (Despite the small numbers, though, the results were highly statistically significant, so they weren’t likely to have been caused by chance.) This also helps explain why the net impacts on turnover were negligible even while the effects for this specific group were large.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">Even more encouraging, the teachers who left were replaced by ones who were subsequently rated higher. This suggests, contrary to frequently heard concerns, that schools may be able get effective replacements for teachers they dismiss.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">Notably, the study finds that tenured teachers with low ratings were no more likely to leave the classroom under the pilot system. Although they can’t be sure, the researchers chalk this up to the lengthy process necessary to dismiss a tenured teacher.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">“Job protections guaranteed to tenured teachers appear to have sufficiently insulated them from the type of job displacement experienced by their non-tenured counterparts,” the researchers said.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">The pilot was different from now-prevailing evaluation models in that student test scores were not considered part of teachers’ scores. Teachers were evaluated solely based on classroom observation.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22"><a href="https://www0.gsb.columbia.edu/faculty/jrockoff/papers/Information%20and%20Evaluation%20RSKT%20February%202011.pdf">Other research</a> </span>has also found that simply providing more information to principals on their teachers’ performance can increase turnover of less-effective teachers.</p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">The latest study provides encouraging evidence for the </span><a href="https://www.the74million.org/listicle/5-key-lessons-from-the-successes-and-failures-of-president-obamas-teacher-evaluation-reforms">teacher evaluation systems</a> that gained traction under President Barack Obama and Duncan’s Race to the Top initiative. They have been politically controversial because they led to a <a href="https://www.the74million.org/article/arne-duncans-wrong-turn-on-reform-how-federal-dollars-fueled-the-testing-backlash">large increase</a> in student testing, and they were disappointing even to some reform advocates because — like the systems they replaced — they rated the vast majority of teachers as effective or better.</p>
<p dir="ltr"><em><strong><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">(</span><a href="https://www.the74million.org/flashcard/test-scores-and-teacher-evals-a-complex-controversy-explained/1">The 74: Test Scores and Teacher Evals: A Complex Controversy Explained</a>)</strong></em></p>
<p dir="ltr"><span>Still, </span><a href="http://scholar.harvard.edu/mkraft/publications/revisiting-widget-effect-teacher-evaluation-reforms-and-distribution-teacher">research</a> has found that new evaluations have likely led to small upticks in the numbers of teachers deemed less than effective.</p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">Although much of the pushback has focused on testing, under these systems, the majority of teacher evaluations </span><a href="http://www.mitpressjournals.org/doi/pdf/10.1162/EDFP_a_00186">are based</a> on classroom observations.</p>
<div class="article-image-container">
<div title="Source: Education Finance and Policy"><img decoding="async" class="article-image" src="https://www.the74million.org/wp-content/uploads/2017/01/1474409901_9432.jpg" /></div>
<div class="article__image-caption">Source: <a href="http://www.mitpressjournals.org/doi/pdf/10.1162/EDFP_a_00186">Education Finance and Policy</a></div>
</div>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">This Chicago study — along with evidence from Washington, D.C. — provides some reason for optimism about these new systems. For one thing, it shows that even if relatively few teachers get poor marks, new evaluation systems may still lead to increases in the number of struggling teachers who leave the classroom, without hurting overall turnover.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">Another </span><a href="https://consortium.uchicago.edu/sites/default/files/publications/Teacher%20Evaluation%20in%20Practice-Aug2016-Consortium.pdf">recent study</a> from Chicago — this one done based on its relatively new evaluation system, known as REACH, currently in use citywide — found less top-heavy evaluation ratings compared with the previous system.</p>
<div class="article-image-container">
<div title="Source: University of Chicago Consortium on School Research"><img decoding="async" class="article-image" src="https://www.the74million.org/wp-content/uploads/2017/01/1474411222_7536.jpg" /></div>
<div class="article__image-caption">Source: <a href="http://consortium.uchicago.edu/sites/default/files/publications/Teacher%20Evaluation%20in%20Practice-Aug2016-Consortium.pdf">University of Chicago Consortium on School Research</a></div>
</div>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">Even more encouraging, Chicago teachers and, especially, school principals gave the system fairly high marks, with majorities saying it “will lead to improved student learning.” </span>Despite anecdotal claims suggesting otherwise, this squares with <a href="https://ies.ed.gov/ncee/edlabs/regions/northeast/pdf/REL_2016133.pdf">other studies</a>.</p>
<div class="article-image-container">
<div title="Source: University of Chicago Consortium on School Research"><img decoding="async" class="article-image" src="https://www.the74million.org/wp-content/uploads/2017/01/1474411573_8186.jpg" /></div>
<div class="article__image-caption">Source: <a href="https://consortium.uchicago.edu/sites/default/files/publications/Teacher%20Evaluation%20in%20Practice-Aug2016-Consortium.pdf">University of Chicago Consortium on School Research</a></div>
</div>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">But in one major respect, teachers are not happy with Chicago’s current</span> evaluation system: its use of student test scores. The majority said that assessments were not a fair way to measure their performance.</p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">The older pilot program — which, again, didn’t use test scores at all — suggests that incorporating student growth in evaluations is not necessary to induce less-effective teachers to leave. In fact, some </span><a href="http://edr.sagepub.com/content/44/2/117.abstract">argue</a> that assessment-based evaluation can undermine other aspects of the evaluation process, such as teacher collaboration; others suggest that stringent evaluations may drive prospective teachers away from the classroom. (There’s little firm evidence that either of these claims are true.)</p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">Test-based evaluations, depending on how exactly they’re designed, tend to award more lower ratings than classroom observations — perhaps one reason teachers don’t like them.</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">The Chicago studies, then, highlight both the benefits and the drawbacks of using observations as the sole evaluation measure. Teachers would likely prefer such a system, which still seems able to weed out a handful of the least-effective teachers (at least the ones without tenure).</span></p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">On the other hand, test scores seem to </span><a href="http://www.rajchetty.com/chettyfiles/w19424.pdf">provide</a> <a href="https://uteach.utexas.edu/sites/default/files/WalkingtonMarderMET2013.pdf">information</a> <a href="http://scholar.harvard.edu/files/mkraft/files/kraft_-_teacher_layoffs_-_publication_version_0.pdf">distinct</a> from classroom observations, which tend to be biased against teachers with struggling students, according to research from <a href="https://consortium.uchicago.edu/sites/default/files/publications/Teacher%20Evaluation%20in%20Chicago-Jan2016-Consortium.pdf">Chicago</a> and <a href="https://www.the74million.org/article/2-new-reports-show-that-we-really-dont-have-a-great-way-to-evaluate-teachers">elsewhere</a>.</p>
<p dir="ltr"><span id="docs-internal-guid-5d031f91-4996-fc74-c101-6e07679c7c22">And with the feds no longer pressuring states to adopt certain evaluation schemes, these are exactly the sorts of trade-offs that districts and states must consider, as many once again consider revamping their systems.</span></p>
</div>

Contact Us

Follow Us

Explore

New Research: Chicago Teacher Eval Pilot Moved Out Struggling Teachers, Kept Stronger Ones

Untangle Your Mind!

Most Popular

We Started Grouping Students by Reading Ability vs. Grade. Here’s What Happened

New Report: States Need to Up Their Game on Preparing Elementary Math Teachers

More Than a Third of Homeschool Families Also Use Public Schools, New Data Shows

Tiny Indiana District With Online School Worth Millions Ordered To Close

It’s Time to Reject Chronic Absenteeism as the New Normal in Student Attendance

On The 74 Today