thinking about wWAR
I really like adarowski's wWAR concept. It's a great way to quickly distinguish those players who were good for a long time from those who were great for a shorter amount of time. For instance, being a Boston fan, I love Pedro Martinez, and wWAR does a little better than career WAR in accounting for the absurdity of his peak years when considering his overall career in a broad context. While being aware of the caveat that WAR "means something" while wWAR does not, I will submit that every calculation means SOMETHING, you just have to decide what that something is. In wWAR's case, it is a zeroth-order approximation of the additional value a team gets from a player who is better than average for a particular season.
There is a common notion that exemplary player seasons are even more valuable to a team's success than their WAR, and I think the intuition is correct. If you had Josh Hamilton (8 WAR) and Ryan Theriot (0 WAR) on your team last season, they would have the same (theoretical) impact as Vernon Wells (4 WAR) and Brandon Phillips (4 WAR). But if I told you to choose between the Wells+Phillips duo and Hamilton without specifying the second player, you would take Hamilton every time because the average 2B player will be better than Theriot and will give you ~2 WAR. That is, major league rosters are limited, and there is an opportunity cost associated with spreading the WAR over more players. wWAR at least goes in the right direction with this.
Nonetheless, objections have been raised to the wWAR methodology. In particular, azruavatar notices that the wWAR components are (somewhat) arbitrary (because, e.g., the WAE cutoff is 3 rather than oh-I-don't-know 3.2) and it seems weird that someone should all of a sudden start getting double (or triple) credit for his home runs at some point in the season. Both adarowski and azruavatar talk a bit about wWAR as a tool to think about Hall of Fame candidates (and I'm particularly looking forward to adarowski's latest project), but I would like to stay away from this since we would all want tools to compare the careers of players even if the Hall of Fame didn't exist.
Anyway, if you think wWAR needs to be improved upon, just replace the components (WAR+WAE+WAM) with another (possibly more complex) function that maps single-season WAR to a number that more accurately reflects the opportunity savings, fear inducement, and overall star power that a great player season affords a team. The function f(WAR)=Real( (WAR-0.3)^1.3 ) weights value similarly to wWAR, but it is a smooth mapping that removes the arbitrary cutoffs of the wWAR components and starts giving extra credit to anyone above average.
Or perhaps a function like f(WAR)=2*WAR-2 would be a reasonable first-order approximation. The rationale is that the average player on a team will give you ~2 WAR. If four of those players could play a single position and occupy a single roster spot, then you'd have Josh Hamilton. Josh Hamilton takes the contributions from four players, condenses them into a single roster spot, and clears the way for, on average, 6 more WAR from the players who will occupy those 3 other spots. WAR-2 is the average opportunity gained from having a certain amount of WAR condensed from average players into a single player. It turns out that the function f(WAR)=2*WAR-2 is pretty similar to wWAR, but it is smooth and it penalizes player seasons that are below average.
In any event, using a smooth mapping like these eliminates the arbitrary cutoffs that some people might find displeasing. Of course, unless you justify the weighting, the weighting itself will be arbitrary. But at least in my mind, a function that gives extra credit for excellence is a reasonable tool for considering the careers of players.
12 comments
|
0 recs |
Do you like this story?
Comments
Not trying to pick on you, because you're far from the only person to argue this, but
There is a common notion that exemplary player seasons are even more valuable to a team’s success than their WAR, and I think the intuition is correct.
It’s not.
If you had Josh Hamilton (8 WAR) and Ryan Theriot (0 WAR) on your team last season, they would have the same (theoretical) impact as Vernon Wells (4 WAR) and Brandon Phillips (4 WAR). But if I told you to choose between the Wells+Phillips duo and Hamilton without specifying the second player, you would take Hamilton every time because the average 2B player will be better than Theriot and will give you ~2 WAR.
Ummm, yeah, you’d rather have 10 WAR in two roster spots than 8 WAR. If the question is Hamilton + Theriot vs. Wells + Phillips, a lot of people would say the Hamilton/Theriot duo is more valuable because of the upside associated with consolidating eight wins into a single roster spot. It’s a common pitfall, and in doing so you’re ignoring the risk associated with consolidating eight wins into a single roster spot.
That is, major league rosters are limited, and there is an opportunity cost associated with spreading the WAR over more players.
There’s also a risk associated with not doing so that nobody seems to want to acknowledge. If an 8-win player and a 0-win player were really more valuable than two 4-win players, we’d see teams pay for wins on a non-linear scale. They don’t!
"The WAR folks like yunel apparently. i know this, bobby cox hated going to war with this guy." - Jon Heyman
Beyond the Box Score / Capitol Avenue Club / shwitter: @CapitolAvenue
What are the stats on teams payments to free agents?
Is it truly linear? I’ve seen some years where it is and some where it isn’t—an exponential rise as the projected wRC per season increases.
I’d love to see the data.
Writer at Beyond the Box Score and tortured Mets fan (is there any other kind?)
I agree!
Front offices need to balance the risk and reward of paying a player for future performance. But I’m not sure we need to bother ourselves with this when we’re evaluating a season that’s already happened. Hamilton’s 8 WAR season allowed the Rangers to have a higher overall win ceiling than four 2 WAR players would have, so I’m willing to give him credit for it.
Yes
The ceiling is certainly higher, but teams don’t pay based on best-case scenarios, they pay based on least-error projections. Balancing the risk and the upside leads to a linear scale.
"The WAR folks like yunel apparently. i know this, bobby cox hated going to war with this guy." - Jon Heyman
Beyond the Box Score / Capitol Avenue Club / shwitter: @CapitolAvenue
The money also isn't even.
Two 4 WAR guys might cost $20M total (at $2.5M per win, which is about MLB average). An 8 WAR guy would be $20M plus some sort of premium, so maybe $25M. Then the extra 2 WAR player is another $5M. So these are actually two very different scenarios. You can have 8 WAR for $20M or 10 WAR for $30M. The latter gets you more wins, but where does the extra money come from? In the first scenario, you only have to spend $10M on 2 wins to be equal. Sounds like a better situation to me.
Thanks for the FanPost, Jared.
Regarding this:
Anyway, if you think wWAR needs to be improved upon, just replace the components (WAR+WAE+WAM) with another (possibly more complex) function
Amen. Like WAR, wWAR is really just a framework that can be easily adapted. I’m very open to discussion of ways to do this better.
On Twitter: @baseballtwit
pennants added
Here’s a relevant link on how to value marginal gains in one player’s WAR:
http://www.hardballtimes.com/main/article/pennants-added/
“The value of an additional win peaks at four wins above replacement; four years of average performance (two WAR) is more valuable than one huge MVP year (eight WAR)! That is certainly unexpected. Pennants Added is supposed to help players who had great peaks, but instead the measure rewards players who are above-average for long periods of times.
So does that mean that Pennants Added is wrong? No, what it means is that our assumptions about peak are wrong. A high peak might be exciting to watch but it is not as valuable as a steady record of above-average performance.
If we focus on what really matters—winning the World Series—we should downgrade Sandy Koufax, and value someone like Bert Blyeleven that much more."
If we accept this, then wWAR doesn’t seem to be measuring any kind of real value, so much as it is taking the anti-compiler bias of HOF voters and applying it to WAR.
Again, wWAR is NOT TRYING to measure some sort of "real value"
It’s a measure of greatness. 4×2 WAR might be more valuable than 1×8 WAR for helping a team win games. But the point of wWAR is that it wants players to be great, not the most valuable, necessarily.
Neither pennants added nor wWAR are wrong, they are just answering different questions.
okay, but how can we separate greatness from value?
There seems to be a presumption that compilers aren’t as “great,” but I’m not sure what that’s based on apart from the historical perceptions of most HOF voters and fans.
I agree, it's not necessarily based on something logical, often just a personal preference.
However, one objective rationale is that better seasons are more scarce. In general, a season that’s 1 win better is twice as rare. So if you assign “points” like this…
WAR—pts
1—1
2—2
3—4
4—8
5—16
6—32
7—64
8—128
Then adding up points tells you how scarce a player’s seasons were compared to everyone, all-time. Looking at the results, it appears to overrate things at the high end: you need 16(!) 4 WAR seasons to match just one 8 WAR season. But that’s a personal opinion again.
Hall of Pennants Added vs. Hall of wWAR.
I wonder how similar they would be.
They’d both kick the ass of the Hall of “Fame”.
On Twitter: @baseballtwit
It looks like Pennants Added gives us pretty much the same answers as regular WAR
I’m enjoying your series btw

by 






























