During the impending NFL strike in 2011—the act of a union—I drew a comparison between how the public in the U.S. responds to unionization in different contexts:
“I am speaking about the possible NFL strike that hangs over this coming Super Bowl weekend: a struggle between billionaires and millionaires, which, indirectly, shines an important light on the rise of teacher and teacher union-bashing in the US. Adam Bessie, in Truthout, identifies how the myth of the bad teacher has evolved.”
Once again, the NFL is facing a situation that I believe and even hope is another harbinger of how education reform can be halted: A suit filed by the family of Junior Seau:
“The family said the league not only ‘propagated the false myth that collisions of all kinds, including brutal and ferocious collisions, many of which lead to short-term and long-term neurological damage to players, are an acceptable, desired and natural consequence of the game,’ but also that ‘the N.F.L. failed to disseminate to then-current and former N.F.L. players health information it possessed’ about the risks associated with brain trauma.”
This law suit has prompted a considerable amount of debate concerning whether or not the NFL as we currently know it could be dramatically reconfigured under the pressure of more law suits. In other words, the inherent but often ignored or concealed dangers of football are now being exposed by legal action, in much the same way as the tobacco industry was unmasked and thus the entire culture of smoking has radically changed in the last couple decades.
With the release of the Education Policy Analysis Archives (EPAA) Special Issue on “Value-Added Model (VAM) Research for Educational Policy,” a similar question should now be raised about the future of implementing high-stakes accountability policies that focus on teacher evaluation and retention through VAM-style metrics.
“High-Stakes Implementation of VAM,…Premature”
Two articles in the special issue from EPAA examines the validity and reliability of VAM-based teacher evaluation in high-stakes settings and then places these policies in the context of legal ramifications faced by districts and states for those policies.
“The Legal Consequences of Mandating High Stakes Decisions Based on Low Quality Information: Teacher Evaluation in the Race-to-the-Top Era” (Baker, Oluwole, & Green, 2013) identifies the current trend: “Spurred by the Race-to-the-Top program championed by the Obama administration and a changing political climate in favor of holding teachers accountable for the performance of their students, many states revamped their tenure laws and passed additional legislation designed to tie student performance to teacher evaluations” (p. 3). Because of the political and public momentum behind reforming teacher evaluation, Baker, Oluwole, and Green seek “to bring some urgency to the need to re-examine the current legislative models that put teachers at great risk of unfair evaluation, removal of tenure, and ultimately wrongful dismissal” (p. 5).
While Baker, Oluwole, and Green offer a detailed and evidence-based examination of the VAM-based and student growth model approaches to high-stakes teacher accountability, they ultimately place the weaknesses of reform policies in the context of potential challenges from teachers who believe they have been wrongfully evaluated or dismissed:
“In this section, we address the various legal challenges that might be brought by teachers dismissed under the rigid statutory structures outlined previously in this article. We also address how arguments on behalf of teachers might be framed differently in a context where value-added measures are used versus one where student growth percentiles are used. Where value-added measures are used, we suspect that teachers will have to show that while those measures were intended to attribute student achievement to their effectiveness, the measures failed to do so in a number of ways. That is, where value-added measures are used to assign effectiveness ratings, we suspect that the validity and reliability, as well as understandability of those measures would need to be deliberated at trial. However, where student growth percentiles are used, we would argue that the measures on their face are simply not designed for attributing responsibility to the teacher, and thus making such a leap would necessarily constitute a wrongful judgment. That is, one would not necessarily even have to vet the SGP measures for reliability or validity via any statistical analysis, because on their face they are invalid for this purpose.”
The analysis ultimately discredits both the use of narrow metrics to determine teacher quality and the high-stakes policies being implemented using those metrics, concluding with the ironic consequences of these policies: “Overly prescriptive, rigid teacher evaluation mandates, in our view, are likely to open the floodgates to new litigation over teacher due process rights. This is likely despite the fact that much of the policy impetus behind these new evaluation systems is the reduction of legal hassles involved in terminating ineffective teachers” (pp. 18-19).
In “Legal Issues in the Use of Student Test Scores and Value-added Models (VAM) to Determine Educational Quality” (Pullin, 2013), the rapid increase of VAM-based accountability is further examined in the context of “a wide array of potential legal issues [that] could arise from the implementation of these programs” (p. 2).
Pullin notes the motivation for reforming teacher evaluation:
“VAM initiatives are consistent with a highly publicized press from the business community and many politicians to make government services more like private business, data-driven to measure productivity and accountability (Kupermintz, 2003). VAM approaches are in part a response to concerns that the current system of selecting and compensating teachers based their education and credentials is insufficient for insuring teacher quality (Corcoran, 2011; Gordon, Kane & Staiger, 2006; Hanushek & Rivkin, 2012; Harris, 2011). There have been increasing expressions of concern that teacher evaluation practices are not robust and do not improve practice (Kennedy, 2010). In the contemporary public policy context, much of the support for the use of student test scores for educator evaluation comes from a concern that the current system for evaluation is ineffective and that the current legal protections for teachers are too cumbersome for schools seeking to terminate teachers (Harris, 2009, 2011).”
While a business model for addressing quality control of a work force may seem efficient, Pullin highlights that legal ramifications are likely with these new models.
Pullin’s analysis offers a detailed and useful examination of previous court cases involving the use of test scores to evaluate educators, including recent cases involving VAM, concluding that the picture is not clear on how the courts may rule in the future, but that a pattern exists of “heavy judicial deference to state and local education policymakers and the allure of using test scores to make decisions about education quality” (p. 5).
Further, Pullin notes “there are differences of perspective among social scientists about VAM and the defensibility of using it to make high-stakes decisions about educators,” further complicating the concerns of legal action (p. 9).
While raising many other complications, Pullin also notes that students and parents may enter legal battles using VAM metrics “to substantiate their own legal claims that schools are not meeting their obligations to provide education” (p. 14).
Pullin concludes with a sobering look at teacher quality reform built on VAM and implemented in high-stakes environments:
“In the broad contemporary public policy context for education reform, the desire for accountability and transparency in government, coupled with heavily financed criticisms of public school teachers and their unions, may mean that VAM initiatives will prevail. The concerns of education researchers about VAM, coupled with legal obligations for the validity and reliability of education and evaluation programs should require judges and education policymakers to take a closer look for future decision-making. At the same time, the social science research community should be generating substantial new and persuasive evidence about VAM and the validity and reliability of all of its potential uses. For public policymakers, there are strong reasons to suggest that high-stakes implementation of VAM is, at best, premature and, as a result, the potential for successful legal challenge to its use is high. The use of VAM as a policy tool for meaningful education improvement has considerable limitations, whether or not some judges might consider it legally defensible.” (p. 17)
Like the NFL, federal and state governments may soon be compelled to reform the reform movement under the threat of legal action from a variety of stakeholders since the science of teacher evaluation remains far behind the curve of implementation, particularly when teacher evaluation is high-stakes and based on VAM and other metrics linked to student test scores.
The special issue from EPAA is yet another call for political leadership to pause if not end wide-scale teacher evaluation and retention models that pose legal, statistical, and funding challenges that those leaders appear unwilling to acknowledge or address.
Blogging at Education Week, Larry Ferlazzo posted a series of blogs addressing ways to prepare students for Common Core State Standards (CCSS) in English/Language Arts. In a response post, Ferlazzo and Stephen Krashen—an outspoken scholar, along with Susan Ohanian, who steadfastly rejects implementing CCSS and the inevitable tests to follow—shared a series of exchanges.
Krashen, in part, argues that implementing flawed practice simply because CCSS requires them is inexcusable:
No. There is no evidence supporting this view. There is massive evidence for the superiority of comprehensible input/reading as by far the best way (really the only way) to develop academic vocabulary and academic writing. Just because the common core demands these competencies, doesn’t mean we should use ineffective and painful methods to try to teach them.
Ferlazzo takes a different view, one committed to implementing CCSS as well as possible since their adoption is a done deal, he believes:
I can think of no realistic political scenario that would stop Common Core from being implemented for at least ninety percent of millions of teachers and students in the United States. I have also not heard anyone else share one, though I am all ears….
Given that political reality on the ground, I think the political capital of teachers, students and their families is better spent on other issues that also affect the working and learning conditions in our schools and the living conditions in our communities — teacher evaluation procedures, adequate funding for schools, class size, parent engagement — just to name a few. In my political judgment, teachers and their allies are much more likely to be able to influence those issues.
While I think it’s useful to debate which instructional strategies might be most engaging and effective for our students and also enable teachers to say they are implementing Common Core, I just [think] it’s less useful to fight a battle that has already been lost.
Given the tremendous political, professional, and commercial momentum behind CCSS, Ferlazzo appears to have a solid point. But this exchange raises an important question about fatalism and teacher professionalism that is much larger than just debating CCSS
Fatalism and Teacher Professionalism
The debate between Ferlazzo and Krashen mirrors a similar debate within the National Council of Teachers of English, one in which Krashen, Ohanian, and I have had little success as we have argued for teacher professionalism and autonomy instead of implementing CCSS and preparing students for the tests with commercial materials focusing on those standards and the new tests.
Concurrent with the debate at EdWeek, as well, has been faculty at Garfield High School refusing to implement MAP testing. Jesse Hagopian, a teacher at Garfield, explains:
America faces incredible challenges: endless war, climate change and worldwide economic implosion. Our kids will need both traditional academic abilities and innovative critical-thinking skills to solve these real problems. If we inundate our students with standardized testing year-round, these larger lessons are lost.
Garfield’s teachers are preparing students for the real-life tests they will face, and reject the computer multiple-choice rituals that fail to measure grade-level content — not to mention character, commitment, courage or talent.
Since this act of professional conscience by Garfield teachers, a group of educators has issued a statement of support, rejecting the misuse and abuse associated with high-stakes standardized tests.
If implementing CCSS is inevitable as Ferlazzo claims and if school, district, state, or federal mandates will continue to support those standards and the related high-stakes tests, teaching is reduced to an act of fatalism, and in effect, teachers are de-professionalized and students are similarly reduced to passive recipients of state-mandated knowledge, what Paulo Freire (1998) labeled as “the bureaucratizing of the mind” (p. 102).
Fatalism about inevitable education reform or current policy and practices benefits neither students nor teachers—and ultimately devalues education in a free society.
For students, Freire challenges the prescriptive nature of standards and high-stakes testing stemming from a neoliberal ideology:
If I am a pure product of genetic, cultural, or class determination, I have no responsibility for my own action in the world and, therefore, it is not possible for me to speak of ethics….It means that we know ourselves to be conditioned but not determined. It means recognizing that History is time filled with possibility and not inexorably determined—that the future is problematic and not already decided, fatalistically….The most dominant contemporary version of such fatalism is neoliberalism….From the standpoint of such an ideology, only one road is open as far as educative practice is concerned: adapt the student to what is inevitable, to what cannot be changed. In this view, what is essential is technical training, so that the student can adapt and, therefore, survive. This book…is a decisive NO to an ideology that humiliates and denies our humanity. (pp. 26-27)
If teachers, then, see CCSS implementation or fulfilling ploicies to implement MAP testing as requirements of their role as compliant workers, they have succumbed to “conformity in the face of situations considered to be irreversible because of destiny,” Freire explains (1998, p.102). Then, “To that degree, there is no room for choice. There is only room for well-behaved submission to fate. Today. Tomorrow. Always,” Freire believes, adding, “I have always rejected fatalism. I prefer rebelliousness because it affirms my status as a person who has never given in to the manipulations and strategies designed to reduce the human person to nothing” (pp. 102-103).
And here is where I must side with Krashen.
To see CCSS or MAP testing as inevitable, to see our roles as educators being reduced to technicians working to implement CCSS or MAP testing as well as possible, to allow students to be reduced to “a pure product of genetic, cultural, or class determination” is to render both teachers and students fatalistic—both as tools of others’ determinations and as products of those who create the inevitable system.
The financial, cultural, and human costs of fatalism are simply too high.