Explore

Support The 74 and stories like this one. Donate Today!

News

Have We Built a Better Test? 9 Ways ‘Next Generation Assessments’ Are Different — and the Same

By Matt Barnum

February 11, 2016

Untangle Your Mind!

Most Popular

school choice
Big Tax Bill Passes Senate With Less ‘Beautiful’ Plan for National School Choice
school choice
More Than a Third of Homeschool Families Also Use Public Schools, New Data Shows
Indiana
Tiny Indiana District With Online School Worth Millions Ordered To Close
commentary
It’s Time to Reject Chronic Absenteeism as the New Normal in Student Attendance
data analysis
Suspensions for Students with Disabilities Are Far More Frequent in These States

In 2010, then–U.S. Secretary of Education Arne Duncan promised to usher in the “next generation of assessments” that move “beyond the bubble tests.”

Duncan invested hundreds of millions ¹ of federal dollars into supporting the development of two sets of Common Core–aligned tests — called PARCC² and Smarter Balanced — that are now in use in many states across the country.

The goals of these tests were ambitious. As Duncan put it, ”I am convinced that this new generation of state assessments will be an absolute game-changer in public education. For the first time, millions of schoolchildren, parents, and teachers will know if students are on-track for colleges and careers — and if they are ready to enter college without the need for remedial instruction. … For the first time, many teachers will have the state assessments they have longed for — tests of critical thinking skills and complex student learning that are not just fill-in-the-bubble tests of basic skills but support good teaching in the classroom.”

However, a number of states have dropped the federally supported tests and instead created their own version of Common Core-aligned assessments. By recent counts about half of the states that adopted Common Core are using either Smarter Balanced or PARCC; others are using home-brewed exams that purport to align to the standards.

So has Duncan’s vision come to pass? Here’s what we know about the new tests and how they’re different (and the same) as the ones they replaced:

Smarter Balanced and PARCC are pretty well aligned with the Common Core standards.

A new study by Nancy Doorey and Morgan Polikoff, released by the Fordham Institute, a pro–Common Core think tank, compares Smarter Balanced and PARCC to the Massachusetts state test — considered by some the gold standard of state assessments — and the ACT Aspire, a test used by three states and made by the company that also produces the college admissions exam, the ACT.³

The report finds that the new assessments generally do a better job, particularly in English, of emphasizing the most important content in the Common Core and requiring “the range of thinking skills, including higher order skills, called for by those standards.”

Whether this is a good thing depends on what you think of the Common Core and whether its emphasis on gathering evidence from the text and explaining mathematical thinking is worthwhile. But for states that are using one of the federally backed tests this is certainly good news, since academic standards and the tests used to assess them should be aligned.

As Polikoff put it, “Even if you're not a fan of Common Core, [it] is the set of standards that's in place in most states … We want those assessments to accurately assess what's in the standards."

Smarter Balanced and PARCC have a lot more questions that aren’t multiple choice.

True to Duncan’s word, both sets of new assessments significantly reduce the reliance on multiple choice items as compared to the ACT Aspire and the Massachusetts state test. In fact, according to the Fordham study, most questions on both the PARCC and Smarter Balanced tests are not traditional multiple choice.

However, it’s not readily apparent just how different these new types of questions actually are. Some of them combine traditional multiple choice questions with questions where students have to cite sections of the text in their answer or they require the student to perform a task on a computer, such as dragging and dropping or highlighting text.

Check out sample questions from Smarter Balanced and PARCC to judge for yourself.

Smarter Balanced and PARCC take a lot longer for students to complete.

Probably because of the inclusion of so many non-traditional questions — which are more time intensive — Smarter Balanced and PARCC take about five-and-a-half and seven-and-a- half hours, respectively. In contrast, both the ACT Aspire and Massachusetts exam take a little over three hours, according to the Fordham study.⁴

In response to complaints about length, PARCC shortened its test by an hour and a half, though at six hours it’s still significantly longer than the ACT Aspire and Massachusetts test. Meanwhile New York recently announced that its Common Core-aligned assessments would be untimed. This highlights a key trade off in test design: more in-depth, challenging tasks mean longer test times. But state officials may be damned if they do and damned if they don’t since members of the growing opt-out movement have complainted about both too many multiple choice questions and that the tests take too long.

Smarter Balanced and PARCC both have some quality issues to work out.

Although the Fordham report gives the tests high marks on alignment with the Common Core, the study also raises some pointed questions about basic quality issues with the exams.

On the math tests, reviewers gave the ACT Aspire and Massachusetts exam higher marks on overall quality than both Smarter Balanced and PARCC. A small number of questions on those exams were difficult to read or inaccurate. In a few cases, questions appeared to have more than one right possible answer, the Fordham report notes.

On the English tests, ACT Aspire, the Massachusetts state test, and PARCC all got top marks, but Smarter Balanced got a lower rating because of instances of spelling errors and questions with more than one potentially correct answer.

The report notes, ”Although this concern applies to a small percentage of items, the review panels expressed the need that a very high bar be set on the quality of items used on consequential tests.”

Next generation assessments are more likely to be completed on a computer — but that’s led to some major problems.

Students now take Smarter Balanced, PARCC and a number of state tests on a computer, with the hope that the technology will allow for more item variety, fewer security issues, and quicker turnaround of results.⁵

But this ideal has run into numerous challenges as states across the country — including Indiana, Tennessee, Montana, Wisconsin, and Nevada, among others — have struggled to get technology-based testing off the ground.

Smarter Balanced and PARCC also have paper-and-pencil versions of their tests that are supposed to be used if a computer-based option is not feasible. But a recent report from Education Week showed that PARCC scores were significantly lower for students who took the exam by computer, raising questions about whether the two versions were equivalent. Some research has found that students less familiar with technology don’t perform as well on computer-based exams.

PARCC predicts college success at a similar rate as the Massachusetts state test and the SAT.

A 2015 study found that PARCC, the Massachusetts state exam, and the SAT all were modestly predictive of first-year grades in college; none stood out as better or worse.⁶

On the one hand, this finding suggests that PARCC is no better — at least in terms of assessing college readiness — than the test Massachusetts was already using. On the other, the Massachusetts exam has long been seen as a high-quality test, so for PARCC to be viewed as equally good can be portrayed as positive news. Of course, predicting first-year grades may not be the only way to judge the quality of state exams; for instance, there is also alignment with state standards.

For what it’s worth, officials in Massachusetts decided to adopt a hybrid exam that includes aspects of both PARCC and its existing state test.

PARCC and Smarter Balanced may help better distinguish between more and less effective teaching.

A new study ⁷, released by Harvard’s Center for Education Policy Research, found that PARCC and Smarter Balanced tests had more “instructional sensitivity” than previous tests used in several states.⁸ What this means is that there was more fine-tuned variation in teachers’ impacts on how students scored on the Smart Balanced and PARCC tests than past state tests. This suggests that the new assessments may be better suited for use in teacher evaluation.

Some teachers say the new tests are better — but most don’t want them to count in teacher evaluation.

The National Network of State Teachers of the Year, which supports Common Core, assembled a panel of top teachers to compare PARCC and Smarter Balance to previous state tests.⁹ The teachers overwhelmingly believed that the new tests were better indicators of quality teaching and learning than the old ones. However, the Harvard study, mentioned earlier, found that teachers in several states only felt moderately prepared to teach students what they needed to know for the Smarter Balanced or PARCC assessments.

Teachers are skeptical of having the new tests count towards their evaluation: a Gallup poll reported that eighty 80 percent did not want new Common Core test scores ever linked to their evaluations. Still, the Harvard study found students scored higher on math tests when they were connected to their teacher’s evaluation.¹⁰

There’s still a lot to learn about these new assessments.

One of the most important unanswered questions is how PARCC and Smarter Balanced compare to the many state assessments that have been redesigned to align with the Common Core. There just isn’t much evidence on whether these tests are Common Core–aligned in name only — like many textbooks — or are actually doing a good job assessing the skills in the standards.

It’s also not clear how to think about some of the tradeoffs and purported benefits of these new tests. Is the extra time required worth it? How valuable is the is the ability to compare student test scores across states, even as many states have dropped out? Will the technological glitches eventually work themselves out?

And perhaps the key unknown question is a political one: Will any new states adopt the Smarter Balanced or PARCC — or will any more leave?

Footnotes:

1. Still, tests account for only a small fraction — just $27 per student according to a 2012 study — of education spending nationally. (return to story)

2. PARCC stands for Partnership for Assessment of Readiness for College and Career. (return to story)

3. It’s important to note that in an appendix to the report, responses from all four testing groups indicate ongoing efforts to improve the tests, including in response to finding from the Fordham study. (return to story)

4. These estimates are based on the average time to complete both the math and English sections for each per, in grades 5–8. (return to story)

5. Another potential benefit of computer-based assessments is that they can be “adaptive” (as opposed to “fixed form”). This means that test changes difficulty based on students’ performance, measured by how they do on previous questions. The idea is that the questions will be at an appropriate level to challenge students, and can measure growth of both high- and low-achieving students. Smarter Balanced’s summative assessment is computer adaptive, but PARCC’s isn’t. (return to story)

6. The study has a significant limitation, however, in that it is based on administering each test to first-year college students and then looking at how well those students’ scores correlated with their GPA. Ideally, the study would look at how well test performance in high school correlated with success in college. But because such a study would take a significantly longer time — multiple years — to conduct, the researchers used this less ideal, but still informative approach. (return to story)

7. Disclosure: This study was funded in part by Bloomberg Philanthropies, which is also a funder of The Seventy Four. (return to story)

8. The states studied were Delaware, Massachusetts, Maryland, New Mexico, and Nevada. (return to story)

9. Specifically, the state tests examined were those of New Hampshire, Delaware, Illinois, and New Jersey. (return to story)

10. The study is correlational, so it can’t be assumed that linking test scores to teacher evaluation had a causal effect on student test scores. (return to story)

Get stories like these delivered straight to your inbox. Sign up for The 74 Newsletter

Republish This Article Learn More

Matt Barnum is a senior staff writer at The 74.

@matt_barnum [email protected]

Republish This Article

We want our stories to be shared as widely as possible — for free.

Please view The 74's republishing terms.


                <h1>Have We Built a Better Test? 9 Ways ‘Next Generation Assessments’ Are Different — and the Same</h1>

                <h2></h2>

                <p class="sans">By <a rel="author" href="https://www.the74million.org/contributor/matt-barnum/">Matt Barnum</a></p>

                <img src="https://www.the74million.org/wp-content/uploads/2017/01/1444927664_7748.png">

                <p>This story first appeared at <a href="https://www.the74million.org">The 74</a>, a nonprofit news site covering education. <a href="https://www.the74million.org/about/newsletters/?utm_source=republish-button&utm_medium=website&utm_campaign=republish">Sign up for free newsletters from The 74</a> to get more like this in your inbox.</p>
                <div class="article__paragraph">
<div class="article__paragraph opening" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">In 2010, then–U.S. Secretary of Education Arne Duncan </span><a href="http://www.ed.gov/news/speeches/beyond-bubble-tests-next-generation-assessments-secretary-arne-duncans-remarks-state-leaders-achieves-american-diploma-project-leadership-team-meeting">promised</a> to usher in the “next generation of assessments” that move “beyond the bubble tests.”<a id="1" name="1"></a><a id="2" name="2"></a></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Duncan invested </span><a href="http://www.ed.gov/news/press-releases/us-secretary-education-duncan-announces-winners-competition-improve-student-assessments">hundreds of millions</a><a href="#Footnotes"><sup>1</sup></a> of federal dollars into supporting the development of two sets of <a href="https://www.the74million.org/flashcard/understanding-the-common-core-what-it-is-what-it-isnt/1">Common Core</a>–aligned tests — called PARCC<a href="#Footnotes"><sup>2</sup></a> and Smarter Balanced — that are now in use in many states across the country.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">The goals of these tests were ambitious. As Duncan put it, ”</span>I am convinced that this new generation of state assessments will be an absolute game-changer in public education. For the first time, millions of schoolchildren, parents, and teachers will know if students are on-track for colleges and careers — and if they are ready to enter college without the need for remedial instruction. … For the first time, many teachers will have the state assessments they have longed for — tests of critical thinking skills and complex student learning that are not just fill-in-the-bubble tests of basic skills but support good teaching in the classroom.”</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">However, a </span><a href="http://www.cleveland.com/metro/index.ssf/2015/06/ohio_dumps_the_parcc_common_core_tests_after_woeful_first_year.html">number</a> <a href="http://blogs.edweek.org/edweek/state_edwatch/2015/06/maine_leaves_common-core_test_consortium.html">of states</a> have dropped the federally supported tests and instead created their own version of Common Core-aligned assessments. By recent counts about half of the states that adopted Common Core are using either Smarter Balanced or PARCC; others are using home-brewed exams that purport to align to the standards.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">So has Duncan’s vision come to pass? Here’s what we know about the new tests and how they’re different (and the same) as the ones they replaced:<a id="3" name="3"></a></span></div>
<ol style="list-style-type:decimal;">
<li dir="ltr">
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Smarter Balanced and PARCC are pretty well aligned with the Common Core standards.</span></div>
</li>
</ol>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53"><a href="http://edexcellence.net/publications/evaluating-the-content-and-quality-of-next-generation-assessments">A new study</a> by Nancy Doorey and Morgan Polikoff, released by the Fordham Institute, a pro–Common Core think tank, compares Smarter Balanced and PARCC to the Massachusetts state test — considered by some the gold standard of state assessments — and the ACT Aspire, a test used by three states and made by the company that also produces the college admissions exam, the ACT.<a href="#Footnotes"><sup>3</sup></a></span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">The report finds that the new assessments generally do a better job, particularly in English, of emphasizing the most important content in the Common Core and requiring “the range of thinking skills, including higher order skills, called for by those standards.”</span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Whether this is a good thing depends on what you think of the Common Core and whether its emphasis on gathering evidence from the text and explaining mathematical thinking is worthwhile. But for states that are using one of the federally backed tests this is certainly good news, since academic standards and the tests used to assess them should be aligned.</span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">As Polikoff put it, “</span>Even if you're not a fan of Common Core, [it] is the set of standards that's in place in most states … We want those assessments to accurately assess what's in the standards."</div>
<ol style="list-style-type:decimal;">
<li dir="ltr" value="2">
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Smarter Balanced and PARCC have a lot more questions that aren’t multiple choice.</span></div>
</li>
</ol>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">True to Duncan’s word, both sets of new assessments significantly reduce the reliance on multiple choice items as compared to the ACT Aspire and the Massachusetts state test. In fact, according to the Fordham study, most questions on both the PARCC and Smarter Balanced tests are not traditional multiple choice. </span></div>
<hr />
<div class="article-image-container">
<div class="article__paragraph" title=""><img decoding="async" class="article-image" src="https://www.the74million.org/wp-content/uploads/2017/01/1455145259_1556.png" /></div>
</div>
<hr />
<div class="article__paragraph" dir="ltr"><span>However, it’s not readily apparent just how different</span> these new types of questions actually are. Some of them combine traditional multiple choice questions with questions where students have to cite sections of the text in their answer or they require the student to perform a task on a computer, such as dragging and dropping or highlighting text.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Check out sample questions from </span><a href="http://www.smarterbalanced.org/sample-items-and-performance-tasks/">Smarter Balanced</a> and <a href="http://www.parcconline.org/assessments/practice-tests">PARCC</a> to judge for yourself.</div>
<ol style="list-style-type:decimal;">
<li dir="ltr" value="3">
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Smarter Balanced and PARCC take a lot longer for students to complete.<a id="4" name="4"></a></span></div>
</li>
</ol>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Probably because of the inclusion of so many non-traditional questions — which are more time intensive — Smarter Balanced and PARCC take about five-and-a-half and seven-and-a- half hours, respectively. In contrast, both the ACT Aspire and Massachusetts exam take a little over three hours, according to the Fordham study.<a href="#Footnotes"><sup>4</sup></a></span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">In response to complaints about length, PARCC </span><a href="http://www.nj.com/education/2015/05/parcc_states_vote_to_shorten_testing_time.html">shortened</a> its test by an hour and a half, though at six hours it’s still significantly longer than the ACT Aspire and Massachusetts test. Meanwhile New York recently <a href="http://ny.chalkbeat.org/2016/01/27/students-will-not-face-time-limits-on-this-years-state-tests-official-says/">announced</a> that its Common Core-aligned assessments would be untimed. This highlights a key trade off in test design: more in-depth, challenging tasks mean longer test times. But state officials may be damned if they do and damned if they don’t since members of the growing opt-out movement have complainted about both too many multiple choice questions and that the tests take too long.</div>
<ol style="list-style-type:decimal;">
<li dir="ltr" value="4">
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Smarter Balanced and PARCC both have some quality issues to work out.</span></div>
</li>
</ol>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Although the Fordham report gives the tests high marks on alignment with the Common Core, the study also raises some pointed questions about basic quality issues with the exams.</span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">On the math tests, reviewers gave the ACT Aspire and Massachusetts exam higher marks on overall quality than both Smarter Balanced and PARCC. A small number of questions on those exams were difficult to read or inaccurate</span>. In a few cases, questions appeared to have more than one right possible answer, the Fordham report notes.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">On the English tests, ACT Aspire, the Massachusetts state test, and PARCC all got top marks, but Smarter Balanced got a lower rating because of instances of spelling errors and questions with more than one potentially correct answer.</span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">The report notes, ”Although this concern applies to a small percentage of items, the review panels expressed the need that a very high bar be set on the quality of items used on consequential tests.”  </span></div>
<ol style="list-style-type:decimal;">
<li dir="ltr" value="5">
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Next generation assessments are more likely to be completed on a computer — but that’s led to some major problems.<a id="5" name="5"></a></span></div>
</li>
</ol>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Students now take Smarter Balanced, PARCC and a number of state tests on a computer, with the hope that the technology will allow for more item variety, fewer security issues, and quicker turnaround of results.<a href="#Footnotes"><sup>5</sup></a></span></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">But this ideal has run into numerous challenges as states across the country — including </span><a href="https://www.the74million.org/article/how-a-months-long-delay-on-test-scores-could-be-the-final-blow-for-indianas-istep-test">Indiana</a>, <a href="http://tn.chalkbeat.org/2016/02/08/tennessee-ed-officials-are-back-to-the-drawing-board-after-online-testing-fiasco/#.VrodzLTZfww">Tennessee</a>, <a href="http://missoulian.com/news/state-and-regional/computer-glitches-hit-common-core-tests-in-montana/article_ea3e5761-1528-5f43-ba01-59dd9398095a.html">Montana</a>, <a href="http://www.jsonline.com/news/education/latest-glitch-delays-common-core-exam-in-wisconsin-b99469929z1-297708641.html">Wisconsin</a>, and <a href="http://lasvegassun.com/news/2015/apr/14/computer-glitch-halts-common-core-testing-nevada/">Nevada</a>, among <a href="https://www.washingtonpost.com/news/answer-sheet/wp/2015/04/25/more-than-a-dozen-states-report-trouble-with-computerized-common-core-tests/">others</a> — have struggled to get technology-based testing off the ground.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Smarter Balanced and PARCC also have paper-and-pencil versions of their tests that are supposed to be used if a computer-based option is not feasible. But a </span><a href="http://www.edweek.org/ew/articles/2016/02/03/parcc-scores-lower-on-computer.html">recent report</a> from Education Week showed that PARCC scores were significantly lower for students who took the exam by computer, raising questions about whether the two versions were equivalent. Some <a href="http://blogs.edweek.org/edweek/DigitalEducation/2016/02/comparing_paper_computer_test_scores_research.html">research</a> has found that students less familiar with technology don’t perform as well on computer-based exams.<a id="6" name="6"></a></div>
<ol style="list-style-type:decimal;">
<li dir="ltr" value="6">
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">PARCC predicts college success at a similar rate as the Massachusetts state test and the SAT.</span></div>
</li>
</ol>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">A 2015 </span><a href="http://www.mathematica-mpr.com/our-publications-and-findings/publications/predictive-validity-of-mcas-and-parcc-comparing-10th-grade-mcas-tests-to-parcc-integrated-math-ii">study</a> found that PARCC, the Massachusetts state exam, and the SAT all were modestly predictive of first-year grades in college; none stood out as better or worse.<a href="#Footnotes"><sup>6</sup></a></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">On the one hand, this finding suggests that PARCC is no better — at least in terms of assessing college readiness — than the test Massachusetts was already using. On the other, the Massachusetts exam has long been seen as a high-quality test, so for PARCC to be viewed as equally good can be portrayed as positive news. Of course, predicting first-year grades </span><a href="http://morganpolikoff.com/2015/10/27/do-the-content-and-quality-of-state-tests-matter/">may not be the only way</a> to judge the quality of state exams; for instance, there is also alignment with state standards.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">For what it’s worth, officials in Massachusetts </span><a href="https://www.bostonglobe.com/metro/2015/11/17/state-education-board-vote-whether-replace-mcas/aex1nGyBYZW2sucEW2o82L/story.html">decided to adopt</a> a hybrid exam that includes aspects of both PARCC and its existing state test.<a id="7" name="7"></a><a id="8" name="8"></a></div>
<ol style="list-style-type:decimal;">
<li dir="ltr" value="7">
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">PARCC and Smarter Balanced may help better distinguish between more and less effective teaching.</span></div>
</li>
</ol>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">A new </span><a href="http://cepr.harvard.edu/files/cepr/files/teaching-higher-report.pdf?m=1454988762">study</a><a href="#Footnotes"><sup>7</sup></a>, released by Harvard’s Center for Education Policy Research, found that PARCC and Smarter Balanced tests had more “instructional sensitivity” than previous tests used in several states.<a href="#Footnotes"><sup>8</sup></a> What this means is that there was more fine-tuned variation in teachers’ impacts on how students scored on the Smart Balanced and PARCC tests than past state tests. This suggests that the new assessments may be better suited for use in <a href="https://www.the74million.org/flashcard/test-scores-and-teacher-evals-a-complex-controversy-explained/1">teacher evaluation</a>.<a id="9" name="9"></a></div>
<ol style="list-style-type:decimal;">
<li dir="ltr" value="8">
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Some teachers say the new tests are better — but most don’t want them to count in teacher evaluation.</span></div>
</li>
</ol>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">The National Network of State Teachers of the Year, which supports Common Core, </span><a href="http://www.nnstoy.org/wp-content/uploads/2015/11/Right-Trajectory-FINAL.pdf">assembled a panel</a> of top teachers to compare PARCC and Smarter Balance to previous state tests.<a href="#Footnotes"><sup>9</sup></a> The teachers overwhelmingly believed that the new tests were better indicators of quality teaching and learning than the old ones. However, the Harvard <a href="http://cepr.harvard.edu/files/cepr/files/teaching-higher-report.pdf?m=1454988762">study</a>, mentioned earlier, found that teachers in several states only felt moderately prepared to teach students what they needed to know for the Smarter Balanced or PARCC assessments.<a id="10" name="10"></a></div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">Teachers are skeptical of having the new tests count towards their evaluation: a </span><a href="http://www.gallup.com/poll/178997/teachers-favor-common-core-standards-not-testing.aspx">Gallup poll</a> reported that eighty 80 percent did not want new Common Core test scores ever linked to their evaluations. Still, the Harvard study found students scored higher on math tests when they were connected to their teacher’s evaluation.<a href="#Footnotes"><sup>10</sup></a></div>
<ol style="list-style-type:decimal;">
<li dir="ltr" value="9">
<div class="article__intro" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">There’s still a lot to learn about these new assessments.</span></div>
</li>
</ol>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">One of the most important unanswered questions is how PARCC and Smarter Balanced compare to the many state assessments that have been redesigned to align with the Common Core. There just isn’t much evidence on whether these tests are Common Core–aligned in name only — </span><a href="http://www.edweek.org/ew/articles/2015/03/04/most-math-curricula-found-to-be-out.html">like many textbooks</a>  — or are actually doing a good job assessing the skills in the standards.</div>
<div class="article__paragraph" dir="ltr"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">It’s also not clear how to think about some of the tradeoffs and purported benefits of these new tests. Is the extra time required worth it? How valuable is the is the ability to compare student test scores across states, even as many states have dropped out? Will the technological glitches eventually work themselves out?</span></div>
<div class="article__paragraph"><span id="docs-internal-guid-d075130f-cd65-66d4-ce39-8920cc70bf53">And perhaps the key unknown question is a political one: Will any new states adopt the Smarter Balanced or PARCC — or will any more leave?</span></div>
<div class="article__paragraph">
<hr />
<div class="article__intro"><a id="Footnotes" name="Footnotes"></a>Footnotes:</div>
<div class="article__paragraph">
<p><em>1. <span style="line-height: 1.833; letter-spacing: 0.045em;">Still, tests account for only a small fraction — just $27 per student according to a 2012 </span><a href="http://www.brookings.edu/~/media/research/files/reports/2012/11/29-cost-of-assessment-chingos/11_assessment_chingos_final_new.pdf" style="line-height: 1.833; letter-spacing: 0.045em;">study</a><span style="line-height: 1.833; letter-spacing: 0.045em;"> — of education spending nationally. (<a href="#1">return to story</a>)</span></em></p>
<p><em><span style="line-height: 1.833; letter-spacing: 0.045em;">2. PARCC stands for Partnership for Assessment of Readiness for College and Career. (<a href="#2">return to story</a>)</span></em></p>
<p><em>3. It’s important to note that in an appendix to the report, responses from all four testing groups indicate ongoing efforts to improve the tests, including in response to finding from the Fordham study. <span style="line-height: 1.833; letter-spacing: 0.045em;">(<a href="#3">return to story</a>)</span></em></p>
<p><em>4. These estimates are based on the average time to complete both the math and English sections for each per, in grades 5–8. <span style="line-height: 1.833; letter-spacing: 0.045em;">(<a href="#4">return to story</a>)</span></em></p>
<p><em>5. Another potential benefit of computer-based assessments is that they can be “adaptive” (as opposed to “fixed form”). This means that test changes difficulty based on students’ performance, measured by how they do on previous questions. The idea is that the questions will be at an appropriate level to challenge students, and can measure growth of both high- and low-achieving students. Smarter Balanced’s summative <a href="http://www.smarterbalanced.org/smarter-balanced-assessments/computer-adaptive-testing/">assessment</a> is computer adaptive, but PARCC’s isn’t. <span style="line-height: 1.833; letter-spacing: 0.045em;">(<a href="#5">return to story</a>)</span></em></p>
<p><em>6. The study has a significant limitation, however, in that it is based on administering each test to first-year college students and then looking at how well those students’ scores correlated with their GPA. Ideally, the study would look at how well test performance in high school correlated with success in college. But because such a study would take a significantly longer time — multiple years — to conduct, the researchers used this less ideal, but still informative approach. <span style="line-height: 1.833; letter-spacing: 0.045em;">(<a href="#6">return to story</a>)</span></em></p>
<p><em>7. Disclosure: This study was funded in part by Bloomberg Philanthropies, which is also a <a href="https://www.the74million.org/page/supporters">funder</a> of The Seventy Four. <span style="line-height: 1.833; letter-spacing: 0.045em;">(<a href="#7">return to story</a>)</span></em></p>
<p><em>8. The states studied were Delaware, Massachusetts, Maryland, New Mexico, and Nevada. <span style="line-height: 1.833; letter-spacing: 0.045em;">(<a href="#8">return to story</a>)</span></em></p>
<p><em>9. Specifically, the state tests examined were those of New Hampshire, Delaware, Illinois, and New Jersey. <span style="line-height: 1.833; letter-spacing: 0.045em;">(<a href="#9">return to story</a>)</span></em></p>
<p><em>10. The study is correlational, so it can’t be assumed that linking test scores to teacher evaluation had a causal effect on student test scores. <span style="line-height: 1.833; letter-spacing: 0.045em;">(<a href="#10">return to story</a>)</span></em></p>
</div>
</div>
</div>

Contact Us

Follow Us

Explore

Have We Built a Better Test? 9 Ways ‘Next Generation Assessments’ Are Different — and the Same

Untangle Your Mind!

Most Popular

Big Tax Bill Passes Senate With Less ‘Beautiful’ Plan for National School Choice

More Than a Third of Homeschool Families Also Use Public Schools, New Data Shows

Tiny Indiana District With Online School Worth Millions Ordered To Close

It’s Time to Reject Chronic Absenteeism as the New Normal in Student Attendance

Suspensions for Students with Disabilities Are Far More Frequent in These States

On The 74 Today