Fourth and Eighth-graders across New York State are in the midst of taking their 2012 standardized “science performance tests.” With some exceptions (see the May 2012 memo issued by the NYS Department of Education), “all public school students in Grade 4 and Grade 8 must take the State assessments administered for their grade level.”
The 2012 manual for administrators and teachers involved in the fourth-grade tests says that NYS Science Performance Test is meant “to serve as a basis for determining students’ needs for academic intervention services in science.” The test is designed to measure the content and skills contained in the Elementary-Level Science Core Curriculum, Grades K–4. It has two parts, a written test and a performance test.
The Written Test consists of multiple-choice and open-ended questions and requires about one hour to administer. The Performance Test (Form A) consists of hands-on tasks set up at three stations and requires about 75 minutes to administer.
Here is a question from the 2011 written test:
It rained on a hot summer afternoon and a puddle formed. After several hours, the puddle was gone. Which two processes made the puddle form and then disappear?
A) precipitation followed by evaporation
B) deposition followed by evaporation
C) precipitation followed by runoff
D) deposition followed by runoff
The hands-on portion of the 2012 test involves setting up stations along prescribed diagrams. The three stations are named: Measuring Objects and Liquids, Electrical and Magnetic Testing, and Ball and Ramp. At each station, test administrators are supposed to print out the diagram and fold it on the dotted line and tape it to the bottom of the station so that the diagram faces the student.
Here is what the “measuring station” looks like:

How wonderfully mechanistic and well-organized. If only it all proceeded like clockwork, giving educators diagnostic tools that they could immediately use as “a basis for determining students’ needs for academic intervention services in science.”
Is the testing system set up to achieve this goal? Probably not.
Students are tested in late May each year. They finish school and head into a long vacation where much of the material they were tested on likely slips their minds. If educators a) spent the summer carefully analyzing each student’s results in the context of the instruction they received in the preceding elementary school years, b) accurately identified the gaps in each student’s conceptual knowledge, and then c) planned the types of interventions that will happen in the following years to close those gaps, there could be hope of successful intervention.
But having this productive conversation of “How do we use this information to improve things?” is not the normal inclination that surfaces when test results are released. That is in part because of how those of us who write about and study educational policy react when results are released. We are rarely neutral or scientific in how we present standardized science test results.
Instead we want to attack or defend. We get into arguing about issues like whether or not standardized testing is even valid; whether it is even a reasonable proxy for “real learning” (especially in the sciences); whether test results should be used to determine teacher performance or pay; and whether we can address the great imbalances in achievement that we see by school, district or socioeconomic group.
These discussions are important. There is no doubt they will adamantly continue. But just once in a while, perhaps, at least for the interest of arguments’ sake, we in the pundit world might consider if and how the testing that is happening (and will continue to happen no matter what we write) could have some useful purpose. We think for a living. Perhaps we could open our minds and consider this particular thought experiment. If we care about education and want to support educators and students in the trenches, we owe them that much.






