RS is the average of scores given by two human readers;
all the others are computer programs.
To anyone who’s ever written an essay for a standardized test—be it the SAT, the ACT, the GMAT, or others—it should come as no surprise that getting a high-scoring essay is a matter of following a formula. The SAT is not the time to show off your lyrical ability or demonstrate your awareness of the nuances of morality: when the prompt is “Is it better to have loved and lost than never loved at all?” it’s hard to argue “It depends” in 25 minutes. Just take a stance, come up with two supporting examples, and hammer that baby out.
Turns out, though, that standardized test essays are so formulaic that test-scoring companies can use algorithms to grade them. And before you get worried about machines giving you a bad score because they’ve never taken an English class, said algorithms give the essays the same scores as human graders do, according to a large study that compared nine such programs with humans readers. The team used more than 20,000 essays on eight prompts, and you can see in the figure to the right, the mean scores found by the programs and the people were so close that they appear as one line on a chart of the results.
The way bubbles are filled in encodes quite a bit of identifying information
What’s the News: Standardized tests aren’t as impersonal as you might think. Much as detectives analyze a note’s handwriting to pinpoint its author, scientists have developed a way to identify test-takers, voters, and so on just from the way they fill in bubbles.