Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: The Latest NBA Trade Rumors

The Genesis of wWAR

Readers of these fine pages know that I've been all wrapped up in a little thing I like to call wWAR (or, Weighted WAR) lately. Reader Chris Levy sent me an email asking about wWAR, so I figured I should probably step back and explain it a bit. Here's Chris' email:

Adam,

I was wondering if I could ask you a couple of questions about your wWAR construct.

When WAR entered the discussion I felt as if the lights in the room had been turned on, no longer having to weight the merits of a great slugger versus a great hitter or a great base stealer. However, like you I was a bit troubled by the effect the dreaded compilers have had whenever using career stats to 'rank' someone.

I think yours is a novel approach. Which WAR data did you use? Sean Scott's from BaseballProjection/Baseball-Reference? The version from FanGraphs? Somewhere else or your own custom construct?

I notice you use Wins Above Excellence (3.0+) and Wins Above MVP (6.0+). I've always been working with the assumption that 5.0+ was AS and 8.0+ was MVP. Where did WAE and WAM come from?

I'm always trying to stay up to date with WAR developments. I've tried counting only 8.0+/5.0+ seasons myself.

- Chris

First of all, thanks to Chris for writing in. I've gotten a decent amount of email about wWAR and it's very exciting. People seem to dig it. And I dig that.

So, where did it come from?

Star-divide

It all does come from Sean Smith's WAR model that started off at BaseballProjection.com and then moved on to Baseball-Reference.com. Why use Rally's WAR (or rWAR)? Well, this was the first WAR implementation to go all the way back to 1871. Since my research spans throughout all of baseball history, this was a very compelling reason to choose rWAR. I'm also a big fan of Rally's model for pitchers when analyzing the past. I still think FanGraphs WAR is the tool to use when predicting pitcher performance. But I'm interested in the Hall of Fame—past performance. You don't build a Hall case based on how good you should have been.

What's Wins Above Excellence? As if Sean Smith hadn't completely blown my mind already, he casually introduced Wins Above Excellence in a post over at The Hardball Times.

Sean wrote:

Time to introduce a new junk stat. For this measure, I'm looking at how many wins a player has above three in a season, though his season total can never be below zero. This gives a player credit for great seasons, and ignores anything where a player is average or below, it neither adds nor hurts a player's case for greatness. A great player should not be penalized if he hangs around past his peak contributing a only little bit to his teams.

I love this idea. This takes just the player's years as a productive player and cuts out the false starts as a 19 year old or the few seasons of part time play in their 40s where they tried to hang on too long. I mean, if Ted Simmons retired five years earlier, he would have been worth 53.2 wins instead of 50.4. WAE won't penalize him for that.

How about Wins Above MVP? WAM is something I came up with after playing with WAE a little bit further. 3.0 seemed like a low baseline for "excellence", so I wanted to see what a 6.0+ version of WAE would look like. Wins Above MVP seemed like a nifty name. I wrote about it on this very site.

Wow, so the average WAR for all MVP winners thoughout the history of the National League is 7.62. The average for all MVP winners in American League history also happens to be 7.62. My quick math tells me the average of the two of those is 7.62.

As that list shows, there are plenty of MVP winners that don't reach 7.62 WAR. Still, the fact that Willie Stargell won and MVP award with a 2.3 WAR season doesn't mean I should use that as my baseline either. So what baseline should I use for finding MVP-type seasons? I hesitate to use the average because, by definition, that means 50% of the players who did win the MVP award didn't even reach that level.

So, let's go back to my original baseline of 6.0 WAR. Of the 181 players who have won MVP awards in MLB history, 133 of them achieved a WAR of 6.0 or above. That's 73%. That, to me, sounds like a pretty good baseline.

And that's how Wins Above MVP was born.

And why wWAR? As Chris references, WAR can be rather kind to "compilers". Hall of Fame voters, of course, look for peak performance in addition to longevity. The idea behind wWAR is to give extra credit to excellent seasons (WAE) and then even more credit for MVP-type seasons (WAM). The formula is ridiculously easy and seems to work well. It is simply:

wWAR = WAR + WAE + WAM

Let's look at a couple players who have similar WAR. George Sisler posted 50.4 WAR in his career and Tony Perez posted 50.5. Do they have equally good Hall of Fame cases? Heck no. Sisler's WAE was 24.6 and his WAM was 6.0. This gives Sisler a wWAR of 81.0. Meanwhile, Perez posted WAE and WAM numbers of 15.6 and 0.7, respectively. That comes out to a wWAR of 66.8. wWAR recognizes that George Sisler has a better Hall of Fame case than Tony Perez. Straight up WAR does not.

On the pitching side, take Don Sutton and Sandy Koufax. Sutton's 70.8 WAR bests Koufax's 54.5 by a substantial amount. Of course, Koufax's value was condensed into just a few seasons. As a result, his wWAR of 97.7 beats Sutton's 87.9 easily.

I really only see wWAR as useful in Hall of Fame discussions. WAR actually means something — how many wins the player was worth above replacement level. wWAR just morphs that to show players who condensed their value into peak years better. That's not particularly more valuable. It's just more Hall-worthy.

Update: (3/15/2011) I just posted the last of a series of articles where I created the "Hall of wWAR". I re-populated the Hall of Fame based purely on wWAR. Have a look!

The Hall of wWAR
Catchers | First Basemen | Second Basemen | Third Basemen | Shortstops
Left Fielders | Center Fielders | Right Fielders | Designated Hitters
Pitchers

Comment 26 comments  |  0 recs  | 

Do you like this story?

Comments

Display:

Comparison

Thanks for the overview, Adam!

Ok, so with that in mind, I’d be interested to see folks’ thoughts on this system in comparison to JAWS.
http://www.baseballprospectus.com/glossary/index.php?mode=viewstat&stat=477

JAWS = Career WARP + WARP of 7 best seasons (as I understand it)

wWAR = WAR + WAE + WAM

I have my own thoughts, but I’d like to see others’. Pros/cons of each system? Which makes more sense in terms of understanding career value? Star value? “Hall of Famer” value?
-j

by JinAZ on Jan 19, 2011 8:56 AM EST reply actions  

Ignoring the other differences between the systems,

I prefer wWAR. Yes, 3 and 6 are arbitrary for giving extra credit (and maybe something more continuous would be better) but I’d rather have arbitrary cutoffs for the “HoF worthiness” of each season, than arbitrary cutoffs for how many seasons count for peak.

by Sky Kalkman on Jan 19, 2011 9:46 AM EST up reply actions  

Arbitrary cutoffs will always be an issue.

3 WAR, 6 WAR, 7 seasons…

I use rWAR, so the inspiration was really to create sort of a JAWS for rWAR. The big difference, as JinAZ notes, is the 7 seasons vs. the extra credit for 3 and 6.

On Twitter: @baseballtwit

by adarowski on Jan 19, 2011 9:49 AM EST up reply actions  

I guess that's my take too

Different players have different-sized peaks. If someone has a 3-year peak of brilliance, and otherwise is league average, I think wWAR will better capture that than JAWS. I also like that wWAR gives special recognition to insanely good seasons, as that seems to better reflect “fan memory…” A lot of people tend to remember the best years of a player rather than his ho-hum years (e.g. Dave Parker or Dale Murphy).

Might be interesting to do a comparison of rankings for a set of players between JAWS and wWAR, just to get an idea of how different they are. You could even use a JAWS-ish stat based on rWAR so we only compare the “peak weighting method” and not the differences of rWAR and WARP. Just a thought!
-j

by JinAZ on Jan 19, 2011 9:57 AM EST up reply actions  

Related idea.

I’ve fooled around with measures of “peakiness” — a number that would describe the “style” of a career, on a range from compiler (Baines) to peaker (Albert Belle), regardless of greatness. One idea using wWAR is wWAR/totWAR. The higher the number, the “peakier” the player.

by Sky Kalkman on Jan 19, 2011 11:46 AM EST reply actions  

Sounds good.

Though it might make upper-crust players like Pujols/Williams/Ruth look really peaky when they just spend a lot of time in the higher wWAR regions.

See Data Differently: Beyond the Box Score | @justinbopp

by Justin Bopp on Jan 19, 2011 1:36 PM EST up reply actions  

I love the idea, Adam, and I know this is a minor hangup, but I'm not sold on the name.

wWAR makes it sound like a third calculation method of WAR, like fWAR and rWAR.

by danmerqury on Jan 19, 2011 2:20 PM EST reply actions  

PAW?

Peak-Adjusted WAR?

Ah, nested acronyms…

On Twitter: @baseballtwit

by adarowski on Jan 19, 2011 3:59 PM EST via mobile up reply actions  

Hmm.

Maybe. Could even be as simple as capitalizing the first W, so it’s WWAR.

by danmerqury on Jan 20, 2011 1:51 AM EST up reply actions  

There's another way to look at this

The thing is, how do we know that the fifth WAR is more valuable than the first? David Gassko looked at this a couple of years ago and found that the incremental value of a WAR peaks at around four:

http://www.hardballtimes.com/main/article/pennants-added/

http://www.hardballtimes.com/main/article/pennants-added-revisited/

In other words, a WAR of eight is twice as big as a WAR of four, but it isn’t twice as valuable.

by studes on Jan 19, 2011 4:35 PM EST reply actions  

True, but it depends on the question.

I think there’s a difference between value and greatness, though. Extra wins above 4 might not be incrementally more valuable, but — in my opinion — they can be incrementally more Great.

It’s a complicated question.

by Sky Kalkman on Jan 19, 2011 5:07 PM EST up reply actions  

Can we quantify that?

Trouble is, I find exercises like this (and I’ve done plenty of my own, by the way) pretty much arbitrary, unless we can somehow quantify greatness. Two ideas occur to me:

- Develop a score based on how rare each level of WAR is
- Don’t add a WAR player to all teams, as David did, but just to an average team. Perhaps the curve would look different then.

Just a couple of ideas.

by studes on Jan 19, 2011 5:45 PM EST up reply actions  

I favor #1

Via B-Ref’s Play Index, here are the relative rarities of X WAR seasons since 1950. Red is cumulative, blue is between X WAR and X+1 WAR.

WAR_rarity

One other issue is randomness. In any system where you give more credit for better seasons, a lucky season can seriously inflate a HoF case.

by Sky Kalkman on Jan 19, 2011 7:17 PM EST up reply actions  

Compared to 12+ WAR seasons since 1950, here is the relative frequency of the lower bins:

11+ 1x
10+ 3x
9+ 6x
8+ 12x
7+ 26x
6+ 51x
5+ 84x

Until the 5+ bin, nearly doubles each time you move down the ladder (and 11+/12+ are equal)

by Sky Kalkman on Jan 19, 2011 8:02 PM EST up reply actions  

Very cool!

Very cool, Sky. I ran a different kind of data pull, but got similar results (I looked only at batters). There’s obviously a bell curve here. When I looked at batters with 500+ plate appearances, the curve peaked between 2 and 3 WAR.

So you could set up a scale that starts with 3 WAR, like you have now, and then multiplies each incremental WAR by a factor:

Up to 4: times 1.4
Between 4 and 5: 1.6
Between 5 and 6: 1.8
For each increment above: 2.0

So a player with 8 WAR would yield 1.4 plus 1.6 plus 1.8 plus two times 2, for a total of 8.04 SuperWAR.

Looking at David’s derivative chart of playoff probabilities from his article, a scale like this wouldn’t be too dissimilar from placing this player on an 81-win team.

Just an idea.

by studes on Jan 19, 2011 8:55 PM EST up reply actions  

Looks painful!

Just for fun, I ran SuperWAR for all batters 1955 onward (the year I was born, natch). I then compared changes in ranks between WAR and SuperWAR, and fun things jumped out. Lenny Dykstra, for example, jumps from 391 in WAR to 175 in SuperWAR. Bret Boone takes a big jump, too. Fun stuff.

by studes on Jan 19, 2011 9:57 PM EST up reply actions  

Yes, this is quite fun.

I liked seeing how even going from WAR to wWAR affected some players. Lou Whitaker’s WAR is super high, for example. But he has this rep for not being dominant, just steady. His wWAR evens that out a bit, but still has him well within Hall of Fame range. Like, WELL within it.

On Twitter: @baseballtwit

by adarowski on Jan 19, 2011 11:25 PM EST up reply actions  

Should be logarithmic

This scale should be logarithmic instead of linear. In other words, you probably want WAR to be an exponent of a base of two, or something like that.

by studes on Jan 20, 2011 5:28 AM EST up reply actions  

I admit I was skeptical

But you’re all turning me into a believer.

Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.

by J-Doug on Jan 20, 2011 2:26 AM EST reply actions  

The key is that it's really just used for Hall of Fame cases.

They player isn’t really providing any additional value. They’re just condensing it better into great seasons.

On Twitter: @baseballtwit

by adarowski on Jan 21, 2011 9:14 AM EST up reply actions  

Then help convince me

I like studes’ idea above of making the weights more empirical (right now they’re pretty arbitrary with the 1/2/3 weighting at 0/3/6), and I think that would have a bit more value because everyone would have their peaks judged on the rarity of that peak, rather than arbitrary cutoffs. Once I get home, I may try to throw something together to help show what I mean.

My Work: Henkakyuu. Entice me to use twitter more @henkakyuu

by jmaciel on Jan 21, 2011 1:16 PM EST up reply actions  

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?

FanPosts

Community blog posts and discussion.

Recent FanPosts

Small
The Value of Playoff Games and "Championship Probability Added"
Small
Prince Fielder in Comerica Park
Crystal_ball_small
Sparky vs Buck
Img_3830_small
BtBS Fantasy League
Small
Context Neutral Run and RBI projections
Small
Free Agent Compensation
Img_0001_small
Value of Various Plate Approaches
Strike_three2_small
Effect of Foul Area on Strikeouts: AL 1954-68: Erratum
Small
Baseball on a stick
Small
Player Evaluating Statistic

+ New FanPost All FanPosts >

Follow us on Facebook!

Follow us on Twitter!

SaberGraphics

MLB Daily Dish

Get the latest MLB Trade Rumors, Transactions, and News at MLB Daily Dish!


Managing Editor:

Jbopp-kc_small Justin Bopp

Columnists:

Adam_small adarowski

Dme_small Satchel Price

Closeup4_small J-Doug

Carlosicon_small Julian Levine

Billy_and_daddy_4th_of_july_small Bill Petti

Featuring:

Dayton_small Jeff Zimmerman

12475953_small Jacob Peterson

Picture-6_small Chris St. John

Btbpro_small Dave Gershman

229331_10150183361996591_674441590_6760167_6637860_n3_small Lewie Pollis

Img_3830_small David Fung