Not surprisingly, a composite teacher evaluation measure that mixes classroom observations and student survey results
with test score gains is generally no better and sometimes much worse at predicting out of sample test score gains.
In the very next sentence, he points out that the MET project has two stated premises guiding its work — that, whenever feasible, teacher evaluations should be based «to a significant extent» on student test score gains; and that other components of evaluations (such as observations), in order to be considered valid, must be correlated
with test score gains.
Even if observation scores are completely uncorrelated
with test score gains, parents and communities might prefer that teachers score well on them.
What's more, significantly improved predictive power from a mixture of classroom observations
with test score gains could have made the case for why we need both.
Not exact matches
The latest round of state standardized academic
test scores showed
gains both across New York State and locally.But rather than celebrate the largest bump since New York adopted new
tests tied to the Common Core Learning Standards, education officials reported the increases
with caution.
The agreement allows the new evaluation system to proceed, but delays the impact of state
test scores until teachers have
gained experience
with Common Core standards and
tests.
«In addition to
gains in achievement
test scores we also saw improvements in engagement
with school, such as an increase in attendance of about 2.5 weeks per year» said Jonathan Guryan, Associate Professor of Human Development and Social Policy in the Institute for Policy Research at Northwestern University and Co-director of the University of Chicago Urban Education Lab.
The largest
gains for the
test — the Kentucky Instructional Results System, or KIRIS — came in reading and mathematics,
with fewer students
scoring at the «novice,» or lowest, level and more students
scoring at the «proficient» and «distinguished» levels.
While these schools succeed in generating
test -
score gains for students of all cognitive abilities, it is still the case that students
with strong fluid cognitive skills learn more.
In the school
with more teachers, the diffusion of responsibility for
test -
score gains across many teachers may erode the incentive that any individual teacher has to increase effort in the classroom.
But
gains in precision obtained by increasing the number of students observed will be offset by losses associated
with failing to control for baseline
test scores.
For example, the effect of a one - hour later start time on math
scores is roughly 14 percent of the black - white
test -
score gap, 40 percent of the gap between those eligible and those not eligible for free or reduced - price lunch, and 85 percent of the
gain associated
with an additional year of parents» education.
By comparing each student's
gain to
gains among students who performed at a similar level and would have experienced a similar, natural shift toward the average
score, I can better separate legitimate
test -
score gains and losses from change associated
with mean reversion.
These new systems depend primarily on two types of measurements: student
test score gains on statewide assessments in math and reading in grades 4 - 8 that can be uniquely associated
with individual teachers; and systematic classroom observations of teachers by school leaders and central staff.
Thus we use a method that in effect compares the
test -
score gains of individual students in charter schools
with the
test -
score gains made by the same students when they were in traditional public schools.
And the situation is even worse because most regulators making decisions about what choice schools should be opened, expanded, or closed are not relying on rigorously identified
gains in
test scores — they just look primarily at the levels of
test scores and call those
with low
scores bad.
If charter schools were primarily established in response to dissatisfaction
with traditional public schools, they would tend to be located in areas
with low - quality traditional public schools where students would tend to make below - average
test -
score gains.
We are still left
with the question of whether regulators are any good at identifying which schools will contribute to
test score gains.
And adapting the models to accommodate a project requires a lot of skill — and bravery, if the publisher claims you won't get the promised
test score gains if you monkey around
with their sequence and instructional methods!
In our study, the teachers
with larger
gains on low - cost state math
tests also had students
with larger
gains on the Balanced Assessment in Mathematics, a more - expensive - to -
score test designed to measure students» conceptual understanding of mathematics.
But other states
with large spending increments — New York, Wyoming, and West Virginia, for example — had only marginal
test -
score gains to show for all that additional expenditure.
The council's Beating the Odds VI report, a city - by - city analysis of student performance, recently revealed that urban students»
scores on state assessments in reading and math as well as on the more rigorous federal
test — the National Assessment of Educational Progress (NAEP)-- are rising,
with urban students making the most
gains in mathematics.
Contrast this information
with what we know about the relationship between credentials and classroom effectiveness, as measured by student
test -
score gains.
Unfortunately, no excuses charters don't seem to produce long - term benefits that are commensurate
with their huge
test score gains.
States should report how a school's
test scores and
test -
score gains compare
with the
scores and
gains of demographically similar schools across the state and locally.
They suggest that in a school
with more teachers, «the diffusion of responsibility for
test -
score gains across many teachers may erode the incentive that any individual teacher has to increase effort in the classroom.»
Under the new rules, private schools
with 30 or more FTC scholarship students must release to the public
gain scores on standardized
tests for those students.
It's true that students from those schools who did enroll in post-secondary schooling were more likely to go to a 4 than 2 year college, but it is unclear if this is a desirable outcome given that it may be a mismatch for their needs and this more nuanced effect is not commensurate
with the giant
test score gains.
The
gains are not an artifact of the elimination of social promotion in 3rd grade or of the ease
with which low
test scores can be lifted.
Among African Americans,
test -
score gains were also larger in the states
with genuine alternative certification.
This year, the state of California distributed $ 100 million to teachers in schools that started
with test scores in the bottom half of schools in 1999 and achieved large
gains in performance between 1999 and 2000.
It is possible that the disproportionately large
gains in
test -
score performance in the states
with genuine alternative certification were due to some other factor, possibly other education reforms those states were introducing at the same time they were widening the door to the teaching profession.
Second, Rick thinks there is an inconsistency in my suspicion that
test - prep and manipulation are largely responsible for
test score improvements by Milwaukee choice schools after they were required to take high - stakes
tests, while I interpret research from Florida as showing schools made exceptional
test score gains when faced
with the prospect of having vouchers offered to their students if
scores did not improve.
However, given the amount of sampling variation and other non-persistent fluctuations in
test -
score levels and
gains, schools
with particularly low
test scores in one year would be expected to bounce back in subsequent years.
Also, there is much information to be
gained from having individual conversations
with students who have these contradictions between their standardized
test scores and their classroom grades and performance.
This argument begs the question about how large correlations should be to be considered as indicators of adult outcomes, and it also discounts recent research showing that
test scores improvements related to effective teachers were correlated
with gains in adult labor - market outcomes.
To
test for this possibility, we compared the
gains made by F schools
with the performance of an even smaller subset of schools whose 2002
test scores were similar but had never received an F (which we termed low - performing non-F schools).
With improved educational materials readily available, home schoolers are winning spelling and geography bees,
scoring off the charts on statewide
tests, and
gaining access to elite colleges.
Always - D schools, which, faced
with the real danger of receiving their first F, had some incentive to improve, made a relative
gain of 4.3 scale -
score points on the math FCAT and 1.3 percentile points on the Stanford - 9 math
test.
It is not enough to show that omitted family characteristics have not been confounded
with value - added as a predictor of future
test -
score gains.
Classrooms
with relatively big
gains on this year's
test and relatively small
gains on next year's
test will
score high on this indicator.
The second type of classroom, which we used as a control group, consisted of classrooms
with large
test -
score gains but no evidence of cheating in their answer strings, a sign of plain - old good teaching.
«We find that following a ban on phone use, student
test scores improve by 6.41 %... our results indicate that there are no significant
gains in student performance if a ban is not widely complied
with,» Beland and Murphy write in the LSE paper.
He also found that
gain scores from the two
tests were not as strongly correlated,
with a 0.61 correlation of PISA and TIMSS
gains from 2003 to 2015.
The researchers found, for example, that a school
with improvements in school climate and violence in one time period tended not to see
test score gains in a subsequent time period.
Arlington schools as a whole saw notable
gains in reading and math
scores,
with 86 percent and 87 percent of students districtwide passing those subject
tests.
Overall
scoring patterns in New York State remained largely unchanged,
with black and Hispanic students making small proficiency
gains but remaining at least 20 percentage points behind white
test - takers.
Nor is it the frequency
with which states and districts manipulate the
scoring of the
tests to produce inflated
gains.
All schools
with at least 30 students in grades 3 — 10 in two or more consecutive years will have standardized
test score gains analyzed by state researchers.
In just two years, Green Dot has improved
test scores and achieved impressive
gains in leading indicators, and has done so
with less funding per student than the Los Angeles Unified School District and the national average.