Using value-added data to evaluate Tennessee teachers

To close the achievement gap between poor and affluent students in Tennessee, some students may need to learn at double the rate of their high-performing peers, according to Tennessee Department of Education materials.

But this goal could create a potential Catch-22 for teachers, who for the first time this year will be measured—and rated—on whether all of their students make large gains on standardized tests, as determined by the controversial statistical formula known among researchers as “value-added modeling.”

Tennessee teachers — Millington Middle School teacher Kay Obenchain walks through her 7th grade enriched math class to check on students as they review for testing. Obenchain has been teaching full-time at the school for 8 years. (Mike Brown/The Commercial Appeal)

“There’s something suspicious about that formula,” said Keith Williams, president of the Memphis Education Association, a teachers union. “You’re using something that has some real flaws.”

In Tennessee, 45 percent of teachers teach in subjects with standardized tests, and for more than a decade, Tennessee has rated these teachers using their students’ progress on the tests. School officials use complex statistics to predict how individual students will perform, based on their past scores.

Teachers whose students achieve higher than predicted scores are deemed highly effective. Teachers whose students don’t hit their predicted marks are seen as less so.

Until now, the state did nothing more than report the data to districts, with no consequences for teachers. This year, however, student test-score growth will count for 35 percent of a teacher’s year-end evaluation. Districts will use the data to decide which teachers deserve tenure and which should be fired. (Another 15 percent of a teacher’s score is made up of achievement measures chosen by the district, and 50 percent is based on classroom observations and other measures.)

The 55 percent of teachers who don’t teach in subjects with standardized tests will be rated based on the test-score ratings of other teachers in their schools.

Teacher evaluations

The Hechinger Report and Memphis Commercial Appeal recently teamed up to produce a series on new teacher effectiveness measures in Tennessee.

Read the rest of the series

You can also read our previous series on the similar issues in Milwaukee and Florida.

Under the Tennessee 5-point rating system, teachers defined as a 3, or “at expectations,” are those whose students make at least a year’s worth of growth on state tests. To receive an “above expectations” score of 4 or 5, which new teachers must do for two years to get tenure, a teacher’s students must demonstrate more than a year of growth.

Whether to use test-score data in teacher hiring and firing decisions has fueled heated debates in states across the country. Until recently, most teachers were evaluated based only on infrequent classroom observations by principals. Now, more than two dozen states are looking to student test-scores to supplement observations, spurred on by the Obama administration’s Race to the Top federal grant competition in which Tennessee was a first-round winner.

“Relative to what exists today, ‘value-added’ does a much better job of predicting how a teacher is going to be in the future,” said Dan Goldhaber, director of the Center for Education Data & Research at the University of Washington. But, he added, “some people don’t think that test-scores are the right way to judge the output of students.”

The statistical formulas are highly complex—the one used in Tennessee is especially complicated—and, critics say, therefore not transparent. Research has suggested that the calculations are best used for identifying the very best and very worst teachers, but less reliable when it comes to rating teachers in the middle. A 2010 study by Mathematica Policy Research found that formulas using three years of test-score data misclassify 1 in 4 teachers.

Educators and researchers have also debated whether the models should account for poverty and other factors that can make a difference in how students perform. And teachers and advocates like Williams worry about “a ceiling effect,” in which teachers with high-achieving students receive low ratings because their students have less room for improvement.

“Research has shown practically no relationship between the entering academic achievement level for a class of students and a teacher’s subsequent value-added estimate,” Kelli Gauthier, a spokesperson for the Tennessee Department of Education, said. “There should be little concern that teachers with an entering high level of students would have difficulty receiving a 4 or 5 level status.”

William Sanders, a former University of Tennessee researcher who now works for SAS, a private business-intelligence company, developed Tennessee’s formula (and is often called the father of value-added modeling). SAS now administers the state’s teacher ratings based on standardized tests, and its formula is considered private intellectual property.

Sanders has countered critics calling for more transparency by arguing that his formula’s complexity makes it more accurate than simpler versions. The “layered model,” as it is called by researchers, collects between three and five years of test-score data for each student in as many subjects as possible, including reading, math, science and social studies, in order to make predictions about how a student will score on a given test.

It also looks into the “future,” says Sanders, recording how students do as they progress on to the next grade and giving credit to their previous teachers for how they perform.

The equations don’t factor in any individual student characteristics, like poverty or special-education status, in contrast to formulas in Florida and Washington, D.C. By comparing individual students to themselves over long periods of time, Sanders argues, statistical errors are reduced and “you don’t need to make any kind of adjustment.”

For some Memphis teachers, the biggest concern with the new system is the fact that the majority of teachers don’t teach in subjects with standardized tests.

“That’s the piece I don’t like,” said Detra Humble, a science teacher at Manassas High School in Memphis. “My level of performance is on the backs of other teachers.”

School administrators argue that shared scores will lead to more collaboration among teachers, however.

“The big lift is on” teachers of tested subjects, said Kriner Cash, superintendent of the Memphis City Schools. “What I say is it should not only be only on them. It should be on everybody.”

A version of this story appeared in the Memphis Commercial Appeal on February 6, 2012.

3 replies on “Using value-added data to evaluate Tennessee teachers”

At The Hechinger Report, we publish thoughtful letters from readers that contribute to the ongoing discussion about the education topics we cover. Please read our guidelines for more information. We will not consider letters that do not contain a full name and valid email address. You may submit news tips or ideas here without a full name, but not letters.

By submitting your name, you grant us permission to publish it with your letter. We will never publish your email address. You must fill out all fields to submit a letter.

Letters are closed