Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: The Latest NBA Trade Rumors

Baseball-Reference's WAR in Plain English

Rick Reuschel leads all eligible non-Hall of Famers in Baseball-Reference's pitching Wins Above Replacement. (photo via Wikipedia)

"My Thing" is comparing players across eras to formulate fun historical debates. Because of this, Baseball-Reference's implementation of WAR (commonly known as bWAR, but I still call it rWAR as an homage to creator Sean Smith) has become my go-to metric. Really, it has to be because FanGraphs' implementation (fWAR) of pitcher WAR only goes back to 1974 (for example, a pitcher as recent as Bert Blyleven doesn't have a career fWAR available). And because I can't use FanGraphs' pitcher WAR, I really can't use their position player WAR. You can't mix and match between rWAR and fWAR because they use different replacement levels. So, I just stick with rWAR.

I mean, come on. What's the point to baseball writing if you can't compare Kevin Brown to Jim McCormick?

So, how is Baseball-Reference's WAR calculated? I'm sure somewhere you can get a crazy mathematical formula that will give you the answer. But I'm dumb. I want to see it in plain English. So I ended up writing it. I hope this helps shed some light onto how rWAR is calculated, if you haven't taken the time to find out for yourself already.

Let’s look at both a position player and a pitcher. For a position player, let’s take Alan Trammell, who is something of a darling to stat geeks. The typical analytical baseball fan believes that Alan Trammell should be enshrined in the Hall of Fame. So why do we all feel so strongly for a shortstop who hit .285 with 185 home runs?

Star-divide

Trammell hit .285/.352/.415 for his entire 20-season career. While that may not look overwhelming by today’s standards, it was quite impressive for the 1980s. OPS+, a statistic that measures a player’s On-Base-Plus-Slugging (OPS) against the league average, puts Trammell at 110 (or 10% above the league average). While that also may not sound overwhelming, remember that this is a player who played over 18,000 innings at shortstop, one of the most demanding positions on the diamond. When you take Trammell’s offensive production and park- and era-adjust it (as well as a custom adjustment Smith says is "to ensure that total runs credited to players will equal the actual runs scored for that team"), it comes out to 124 batting runs above average. That puts Trammell's career offensive value in the same region as Cecil Fielder, Richie Sexson, Rusty Greer, or Mike Greenwell. Good hitters, all. But again, not overwhelming. However, we’ve only begun to tap into Trammell.

Trammell stole 236 bases in his career and was caught 109 times. While the caught stealing total is a bit high, he also rates well at taking extra bases in a variety of situations. For this reason, we add 21 more runs above average.

Trammell also tended to induce a lot of errors by the opposition. This likely has a lot to do with him being an excellent base runner. In this skill, he rates as 12 runs above average. For this reason, he also tended to ground into double plays far less often than the average player. In fact, he’s worth 19 runs above average here. Already we’re at 176 runs—just 124 of which were done with the bat.

Now, the glove. Trammell’s defense was worth 76 runs above average. This puts him in the neighborhood of Adam Everett, Tony Kubek, Mike Bordick, and John Valentin. Again, not among the all time greats, but still very, very good. The 76 defensive runs break down to 59 for range and 17 for turning the double play. Once defense is added, Trammell’s now at 252 runs above average.

When you play a full season of shortstop, you get a credit for an additional eight or nine runs per season because the position is much more difficult to fill with a quality player than others. Trammell played the position almost exclusively for 20 years, so that brought his team a ton of value. In fact, this is worth 118 runs. Suddenly Trammell is at 370 runs above average with just 124 coming with the bat.

These components all track how much above average Trammell was. But WAR is Wins Above Replacement, not Wins Above Average. Typically, the difference between an average player and a replacement level player is about 20 runs in a full season. An average player with Trammell’s playing time was worth 291 runs more than a replacement level player with Trammell’s playing time. Take this number and add it to his other components (370 runs above average) and Trammell  was worth 661 runs above replacement level.

Runs Above Replacement is then put into a runs-to-wins converter (typically, about ten runs is equal to one win). Trammell comes in at 66.9 WAR. How good is that? 66.9 WAR ranks 72nd all time for position players. He is surrounded by Brooks Robinson, Tony Gwynn, Eddie Murray, and Manny Ramirez.

So why don’t people tend to think of Trammell as a Hall of Famer? Players who are really good at everything—from offense to defense to baserunning, while playing a premium position—have traditionally been underrated.

Pitcher WAR is a bit more of a mystery to us mere mortals. Let’s give it a whirl anyway with Rick Resuchel as our sample. Why Mr. Reuschel? Because now that Bert Blyleven is in the Hall of Fame, Reuschel has the distinction of having the highest career WAR among eligible non-Hall of Famers. That took me by surprise and I'm sure I'm not alone. So, let's see what makes it so.

Reuschel was a big workhorse who threw over 3500 innings in his career. He won 210 games and lost 187 with just a .528 winning percentage. He posted a 3.37 ERA for his career, which gave him a 114 ERA+ (14% better than average). He was very good at avoiding home runs, allowing just 221 (0.6 per nine innings). In other words, he is 67th all time in innings pitched but just 156th all time in home runs.

Stat geeks love pitchers who don’t allow home runs. The FIP (Fielding Independent Pitching) methodology states that pitchers have complete control over just three in-game outcomes—the walk, the strikeout, and the home run. Everything else leaves a pitcher leaning on his defense to get the job done.

The fact is some pitchers play in front of better defenses than others. Over his carer, Reuschel allowed 1494 runs (total runs, not just earned runs). According to WAR, during his career his defense was worth –66 runs. It is much better to use total runs and then give the pitcher back what the defense actually cost him than to use earned runs. A run is only unearned if there is an error involved—and we all know that a fielder can’t make an error on a ball that he’s too slow to even reach.

Reuschel’s Runs expected for replacement level is 2106 runs. In other words, a replacement level pitcher who threw the same number of innings as Reuschel, against the same opponents as Reuschel, in the same ballparks as Resuchel, in the same league as Reuschel, in the same game situations as Reuschel (by spending the vast majority of his career as a starter, Reuschel’s leverage index was 1.1), and with the same defensive support as Reuschel would give up 612 runs more than Reuschel did.

Running that number through the wins converter brings us to 66.3 WAR, which is an exceptional total. That actually puts him in the Top 30 all time for pitchers. Some things just are not captured in ERA.

Comment 13 comments  |  2 recs  | 

Do you like this story?

Comments

Display:

Questions

1- how certain are we about the baserunning #s? I find it (very) hard to believe that, for example, Trammell created half as many runs with his baserunning as he did with his fielding. Especially since he wasn’t a good base stealer.
2- where’d the fielding #s come from? I thought UZR only went back a few years. How much confidence do we have in them?
3- Is there some method behind the position adjustments? Am I wrong that this is not much more than a WAG?
4- Does WAR value long-term consistency over short-term brilliance since teams typically very quickly find replacements that perform at a level above the replacement level defined?
5- does FIP unfairly penalize guys with great defenses who allowed a high BIP%? i.e. guys who “gave their fielders a chance” a la Jim Palmer.
6- As I understand it, FIP is based on event-specific run values from recent years, and given 3 or more years of data, is a worse predictor of ERA than is ERA. So, does it make sense to use FIP to look at the whole careers of pitchers from years gone by?
7- Is FIP the appropriate model to use for pitching? The assumptions that the run-producing impacts of HR, K, BB are neither linear and independent of one another. This probably doesn’t matter in most cases, but it much more likely to matter when great or unusual (i.e. interesting) pitchers are evaluated.
8- Do the various adjustments unfairly penalize players who performed in favorable circumstances, but could not realistically have done much more with what they were given? Think ‘70s Palmer or ’77-’79 Jim Rice.

I hope these questions are not troll-ish. I admit to being very skeptical of the new conventional wisdom that, for example, Catfish Hunter is taking up space in the HOF that rightfully belongs to Bobby Grich.

by peter745 on Feb 28, 2011 9:31 AM EST reply actions  

I'll let Adam Answer, HOWEVER

Adam is talking about Baseball Reference WAR (rWAR or bWAR), which doesn’t use FIP. Instead, it uses RA (ERA if you include unearned runs).

by garik16 on Feb 28, 2011 10:34 AM EST up reply actions  

Good questions... I don't have all the answers. Let's see what I have:

1. I’m really not sure how to tell how certain we are. Rally states:

Baserunning runs. Includes stolen bases, advancement on hits, outs, wild pitches, passed balls, and outs made on bases.

As Trammell had the reputation as a fundamentally sound player, it makes sense to me that he’d do pretty well in these categories. As for certainty? Well, I don’t know if that’s even possible.

2. The fielding numbers all come from Smith’s Total Zone system. Baseball-Reference doesn’t use UZR at all. As for confidence, TZ tells me Ozzie Smith, Brooks Robinson, Mark Belanger, and Andruw Jones were among the elite fielders in history. There are some interesting places where it disagrees with the conventional wisdom. But it seems to be a very good indicator of CAREER fielding value.

3. There’s a bunch out there discussing Positional Adjustments.

4. WAR does tend to reward long-term consistency, or “compilers”. For that reason, I’ve started looking at wWAR (WAR with a peak adjustment) to balance this.

5. I’m not sure of the answer to this because of the word “unfairly”. But rWAR doesn’t use FIP, so I’ll let that one go to someone else.

6. Again, no FIP expert here.

7. For HOF cases, I like the approach rWAR uses. It doesn’t use FIP. Instead, it takes actual runs allowed and adjusts for park, era, league, opponent, and defense. The defensive adjustment is how much above or below average a defense was. So, a pitcher really wouldn’t be hurt for letting an “average” defense take care of more balls in play. But if he is helped out by an amazing defense behind him, that value will instead be shifted from the pitcher to the fielders. I like this a lot.

8. I don’t think I really follow this question. Can you elaborate a bit?

Thanks!

On Twitter: @baseballtwit

by adarowski on Feb 28, 2011 11:03 AM EST up reply actions  

thank you

8. my thought was that perhaps it’s possible to bump up against a ceiling of how good it’s possible to be in the context of your league and era. Far-fetched, maybe.

Taking just batting,to minimize issues related to any measurement and position-type, and taking the ever-popular and controversial Mr. Rice as an example:
1977-1979
HR- Rice 124, Don Baylor 95
TB- Rice 1157, Eddie Murray 868
Hits- Rice 620, Ron Leflore 590
SLG- Rice .596, Freddie Lynn .526

So, I look at offensive WAR for this period, and see Rice trailing Ken Singleton. Understanding that Singleton walked 100 times/yr and Rice 50 and that this is significant, I still struggle with the judgement that falls out of WAR. So, I wonder:
- are walks subject to park effects? Or are they limited to batted balls, in which case park adjustments would distort this comparison.
- how individual are park effects? If a lot of your value is tied up in HR, and you park is great for singles and doubles, but not home runs, is it appropriate to apply the a general factor for your park?
- do I believe the blip in the park effect factor for Fenway for this period, or is it possible that the factors are skewed because the late ’70s Sox (e.g. Lynn, Yaz, Fisk) were uniquely suited to their home park?

More questions, I know, and probably stuff that’s already been asked and answered.

by peter745 on Feb 28, 2011 11:35 AM EST up reply actions  

Excellent questions

Especially #7. I don’t believe the current FIP formula is a good metric for evaluating pitchers because it only uses stats from about 30% of a pitchers performance. I want to see the other 70% of what the pitcher did, like how the pitcher worked the count. Other stats to use would include: 1st pitch strike %, 2 strike counts (incorporated with K’s), and 3 ball counts (incorporated with walks).

by Keith-Allen on Mar 1, 2011 11:12 PM EST up reply actions  

That's a lot of questions...

Great post, Adam. The explanation of position player WAR is really clear and easy to follow. Like you said, pitcher WAR is a bit more of a mystery, but nice job explaining that as well.

Left Field: Adventures in baseball fanaticism, music obsession, craft beer enthusiasm, and other stuff from out of left field.

by Dan McCloskey on Feb 28, 2011 10:28 AM EST reply actions  

Why "rWAR" is an homage to Sean Smith

In case that was hard to find in the opening of this post, Sean used to post as Rally or Rally Monkey at various baseball websites, so the “r” stands for Rally.

by Detroit Michael on Feb 28, 2011 11:58 AM EST reply actions  

In fact, we

rWAR is a truncation of Rally WAR, where fWAR is Fangraphs WAR.

Personally, I think the longer we have a difference between the two, the longer it’ll take for mainstream/scoreboard/baseball card acceptance.

by Justin Bopp on Feb 28, 2011 1:25 PM EST up reply actions  

here's one problem...

It’s the replacement that needs to be park and defense adjusted, not the player. Then you’d be comparing the player to what his replacement would have done under the same circumstances. For example, if Player A has a value of 50, a league-average replacement has a value of 10, and the adjustment is -20%, the WAR should be 50-10(1.2)= 38, not 50(.8)-10= 30. Adjusting the player instead of the replacement will over-rate the Jimmy Wynns and Rick Reuschels and under-rate the Jim Rices and Catfish Hunters.

by peter745 on Feb 28, 2011 8:37 PM EST reply actions  

The adjustment is additive, not multiplicative

So you can adjust either and get the same result.

Blogger and Editor, Rational Pastime Blog. Twitter: @RationalPastime.

by J-Doug on Jun 6, 2011 5:50 PM EDT up reply actions  

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?

FanPosts

Community blog posts and discussion.

Recent FanPosts

Small
The Value of Playoff Games and "Championship Probability Added"
Small
Prince Fielder in Comerica Park
Crystal_ball_small
Sparky vs Buck
Img_3830_small
BtBS Fantasy League
Small
Context Neutral Run and RBI projections
Small
Free Agent Compensation
Img_0001_small
Value of Various Plate Approaches
Strike_three2_small
Effect of Foul Area on Strikeouts: AL 1954-68: Erratum
Small
Baseball on a stick
Small
Player Evaluating Statistic

+ New FanPost All FanPosts >

Follow us on Facebook!

Follow us on Twitter!

SaberGraphics

MLB Daily Dish

Get the latest MLB Trade Rumors, Transactions, and News at MLB Daily Dish!


Managing Editor:

Jbopp-kc_small Justin Bopp

Columnists:

Adam_small adarowski

Dme_small Satchel Price

Closeup4_small J-Doug

Carlosicon_small Julian Levine

Billy_and_daddy_4th_of_july_small Bill Petti

Featuring:

Dayton_small Jeff Zimmerman

12475953_small Jacob Peterson

Picture-6_small Chris St. John

Btbpro_small Dave Gershman

229331_10150183361996591_674441590_6760167_6637860_n3_small Lewie Pollis

Img_3830_small David Fung