The Absurdity of the Hold Statistic
In yesterday's game against the Arizona Diamondbacks, Washington Nationals closer Drew Storen had what could only be described as a meltdown appearance. If not for the triage work of Todd Coffey, Sean Burnett and Henry Rodriguez, as well as a five-run 11th courtesy of a Rick Ankiel sacks-juiced walk and a Mike Morse grand slam, Storen would have squandered the curly-W.
All in all, Drew Storen's butcher job amassed a dismal -0.449 Win Probability Added (-0.457 WPA if you ask FanGraphs instead of Baseball Reference).
He also earned the hold.
Wait, what's that you say? Storen earned the hold even though it was his runners who tied the game? He earned the hold even though he was on the hook for the loss should Coffey allow another inherited runner to score?
Yes, that's correct. In fact, Storen's HD is the 10th worst in WPA terms since the stat was invented, and the 4th worst in a winning game. It's easily the most detrimental hold this season.
And had the Nationals lost in the 9th, Storen would have kept his HD and earned the loss, while Coffey would get the blown save and the Nationals would drop another game below .500. How is this possible? I'll tell you after the jump.
Ten Worst Holds (in WPA Terms) Since the Statistic was Invented in 1986:
| Rk | Player | Tm | Opp | Rslt | Scr | Dec | IP | ERA | WPA | RE24 | aLI | Date |
| 1 | Steve Karsay | TEX | SEA | W | 6-4 | H | 0.1 | 27.00 | -0.475 | -3.033 | 2.642 | 23-Aug-05 |
| 2 | Lee Guetterman | NYY | TOR | L | 7-11 | H, L | 0.2 | 27.00 | -0.474 | -2.639 | 3.037 | 25-Aug-91 |
| 3 | Willie Hernandez | DET | BOS | L | 5-6 | H, L | 0.1 | 81.00 | -0.470 | -3.076 | 3.350 | 06-Apr-88 |
| 4 | Dan Plesac | PIT | SFG | L | 10-12 | H, L | 0.1 | 108.00 | -0.469 | -3.038 | 2.236 | 13-Aug-96 |
| 5 | Carlos Marmol | CHC | CHW | W | 5-4 | H | 0.1 | 54.00 | -0.468 | -3.080 | 2.702 | 26-Jun-09 |
| 6 | Kyle McClellan | STL | FLA | W | 6-4 | H | 0.1 | 54.00 | -0.464 | -2.907 | 2.650 | 13-Aug-08 |
| 7 | Danny Kolb | ATL | WSN | L | 6-8 | H, L | 0.1 | 108.00 | -0.464 | -2.902 | 2.184 | 02-Jun-05 |
| 8 | Scott Garrelts | SFG | STL | L | 6-7 | H, L | 0.1 | 81.00 | -0.456 | -2.519 | 2.996 | 09-Jul-87 |
| 9 | Cal Eldred | STL | MIL | L | 4-9 | H, L | 0.1 | 108.00 | -0.455 | -2.736 | 2.162 | 16-Jun-03 |
| 10 | Drew Storen | WSN | ARI | W | 9-4 | H | 0.1 | 81.00 | -0.449 | -2.731 | 2.398 | 05-Jun-11 |
Before I continue, I should note that the hold is not an official MLB stat, like the save or the win. The hold was invented in 1986 by John Dewan (of Fielding Bible and STATS, Inc. fame) and Mike O'Donnell as a way to give middle relievers credit the way that the save proffers similar credit for closers. The definition varies, but typically a reliever earns a hold whenever he enters in a save situation, records at least one out, and leaves the game still in a save situation.
This is where the stat becomes problematic. Closers, by definition, cannot finish a losing game and earn a save at the same time. However, if the closer blows the save, the setup man still earns his hold. Moreover, if the setup man leaves the game after a hold opportunity, but the next pitcher allows an inherited runner to score the winning run, the first pitcher still earns the hold because he fulfilled the above criteria. Thus, a setup man can earn the HD and the L in the same outing.
In fact, this has happened quite often. Relievers have logged 1,652 holds in games where A) his team lost and B) he posted a zero or negative WPA since the advent of the stat. That's an average of 63 questionable holds per season, a great number of which qualified as losses. The total number of HDs "earned" in losing efforts since 1986 exceeds 5,000.
You may be thinking, "Sure, holds are irrational and arbitrary, but they're no worse than saves, right?" Wrong. You may already know that Fangraphs employs two stats that remedy inherent irrationality of holds and saves, called meltdowns and shutdowns. As further evidence of the absurdity of the hold stat, a whopping 3,278 holds classify as meltdowns since the hold was invented, compared to four saves.
I don't want to be too hard on the stat here. Dewan and O'Donnell were well-intentioned in trying to quantify the value of middle relief. I even include holds as a stat in my fantasy league in order to open up the player universe to all pitchers. That said, the history of the hold stat, including the embarrassing effort of Drew Storen in yesterday's game, serves to remind us that we need to move on from saves and holds.
A parting note: the all-time holds leader is former Yankees and Braves setup man Mike Stanton (not that one, the other one) with 266, of which 34 (13%) occurred in losing games.
Sources: Baseball Reference's Play Index; MLB Rules, Regulations and Statistics; Rob Neyer (link no longer available); Wikipedia
21 comments
|
0 recs |
Do you like this story?
Comments
missed an option
all three
Choose me, Ash!
by Pikachu on Jun 6, 2011 5:45 PM EDT reply actions 1 recs
Hah, fair enough
Too easy, though
Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.
What is the alternative counting stat?
I really do want to open up my fantasy league to more pitchers — but I also need to have reasonably well-understood counting stats rather than pure ratio categories. I can see quality starts rather than wins. But the whole saves/holds categories are a mess.
I’d like to see a category total bases against (as a measure of how hard batters hit v that pitcher) since that does seem to me to be a measure of how close they are getting to trouble. But that stat isn’t recorded anywhere common. And I’d prefer not to get into some arbitrary incomprehensible sabermetric counting stat.
what do other LM’s here do?
If you really want to run your own league, use Meltdowns and Shutdowns
Which aren’t arbitrary. Those WPA numbers are based on average outcomes in winning and losing efforts as calculated by Tom Tango. Of course, most fantasy sites don’t offer this stat, so this is really if you wanted to run a league like we did before the Internet, where the commissioner manually handles the numbers.
Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.
you should look into linear weights
You should consider switching to a linear weights based scoring system, like the one being used in Fangraphs’ Ottoneu leagues (http://ottoneu.fangraphs.com/support), which was based on a system developed by Justin Merry using Tiger Tango’s linear weights.
I was going to propose that
But I’m not sure that qualifies as “some arbitrary incomprehensible sabermetric counting stat.” or not.
Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.
I may join that next year
but yeah — that would qualify as incomprehensible – and I’m a guy who tutored stats in college. Incomprehensibility being – can this stat be easily explained to a regular baseball fan. If not, then the league will be composed entirely of fellow stat heads.
points breakdown
Although the linear weights that the scoring system is based on is pretty stats heavy, the actual categories aren’t.
Hitting
AB -1.0
H 5.6
2B 2.9
3B 5.7
HR 9.4
BB 3.0
HBP 3.0
SB 1.9
CS -2.8
Pitching
IP 5.0
K 2.0
BB -3.0
HBP -3.0
HR -13.0
SV 5.0
HOLDS 4.0
Can't you just say that it's based on the number of runs the events typically generate in an inning?
The calculations behind it are complex but the concept is relatively simple.
Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.
Loved this article. Nice research. It was probably difficult (if possible at all) to add links to the chart but it would have been nice to be able to have the link to baseball reference page for each game available.
Thanks for the positive feedback.
That’s a good suggestion. I’ll give that a shot in the future.
Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.
Great info.
Though one typo- Mike Stanton was an All-Star in 2001. If the “three excellent shortstops” you’re referring to were Jeter, Nomar, and A-Rod, he did pick A-Rod and Jeter, but Nomar was not deserving that season anyway because he was injured and didn’t play at all during the first half.
That being said, Eric Chavez had an amazing year, and wasn’t an All-Star, so he (or maybe Twins 3B Corey Koskie) should have been an All-Star over a setup man…
PS- Is there any way on FanGraphs to sort splits leaders up to the All-Star Break of a season?
Unless you're a pitcher or Gustavo Molina, kindly SWING THE BAT and ignore the Binder's bunt signal.
Scott
I also recall Christian Guzman being deserving, but I may be confusing my years here (as I did above). Thanks for the heads-up.
Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.
Looks like Guuuz did play, but Miguel Tejada did not.
Either way, there was a good deal of debate over whether Torre was playing favorites picking his own setup man when there were other deserving players.
Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.
there's a great one at junk stats
Armando Almanza walked two batters and threw a wild pitch, didn’t throw a single pitch, but got a hold thanks to a caught stealing.
http://junkstats.com/mlb/appearances-without-throwing-a-strike/
Not actually affiliated with whygavs.
I want to hear more about the saves that were also meltdowns
I mean, if you get the save, the WP when you leave is 100% by definition. So the pitchers must have come in in the eighth (or earlier) in a save situation, gave up a bunch of runs but not enough to surrender the lead… and then somehow got into a situation where finishing the ninth didn’t give them enough WPA to make up for the eighth-inning meltdown. Maybe if their team scores a lot in the top of the ninth so they don’t get much WPA for the ninth?
Actually I’ll bet that most of these are three-inning saves where the reliever almost surrenders a big lead, but still, I’d like to hear more. Can we see Worst WPA in a save at junkstats?
Not actually affiliated with whygavs.
You can look it up using the Baseball Reference Game Index
But if you’re not a subscriber, here are the meltdown-saves if you want to take a closer look at them:
05/14/86: Gossage SDP @ PIT (2.0 IP, -0.075 WPA)
04/27/04: Shields ANA @ DET (2.1 IP, -0.189 WPA)
07/15/97: Taylor OAK @ SEA (1.1 IP, -0.070 WPA)
05/08/87: Quisenberry KCR @ CLE (1.2 IP, -0.069 WPA)
Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.
awesome, thanks!
I’m not a subscriber, so I needed it. (And I am a biscuit conditional enthusiast, which made your reply — the rare double-biscuit conditional — all the sweeter.)
…wow, really surprised that none of these were three-inning jobs.
Not actually affiliated with whygavs.
by WHYG Zane Smith on Jun 9, 2011 7:36 AM EDT up reply actions

by 
































