Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Celtics, Heat Score On Purpose In Super Sunday Wins

BABIP and DIPS 3.0

Marc wants me to introduce myself, so I guess I should start by saying that my name is David Gassko, as my handle here might have given away. I write a weekly column for The Hardball Times, and also have been blogging at Statistically Speaking for almost a year now. I'll be writing here at BtB now as well, semi-regularly. I hope you enjoy my writing, and if not, I hope you enjoy my wrath.

Star-divide

Last August, I published an article updating Voros McCracken's Defensive Independent Pitching Statistic (DIPS), calling my system DIPS 3.0. To recap quickly, what McCracken found was that individual pitchers seemed to show little control over what happens to balls put into play. Instead, he found, pitchers only seem to have much control over the defensive independent categories--strikeouts, walks, and to a lesser extent, home runs.

I took his idea one step further (a step that was actually originally suggested by Voros), and based DIPS on batted ball information. Basically, I take a pitcher's batted ball line (ground balls, line drives, bunts, and outfield and infield fly balls) and transform it into a "regular" line--singles, doubles, triples, home runs, reached on error, as well as, strike outs, walks, and hit-by-pitch.

Here's a very simply run-down of how I do it. I take the number of batted balls the pitcher allowed, and assign him a league average line drive percentage, because based on my research with JC Bradbury in the Hardball Times Annual, it seems that pitchers have little control over how many line drives they allow. I then split the rest of their batted balls based on their actual batted ball percentages. Take, for example, Jarrod Washburn. He allowed 586 batted balls last year, and since the average line drive percentage in the AL last year was 19.9%, I assign him 117 line drives. He actually allowed 120. Washburn also allowed 206 outfield flies. So his "new" outfield fly number would be 206/(586-120)*(586-117) = 208. And so on for every batted ball. A better explanation of the whole method is available here.

To transform this translated batted ball line into "normal" statistics, I just take the average outcome of each type of batted ball, and multiply it by the translated line. So, for example, 50.8% of all line drives became singles in the American League last year. Since Washburn had 117 translated line drives, based on his liner numbers alone, he would be expected to have 59 singles. I do this for every batted ball type, and every hit-type (as well as reached-on-errors) as well.

I then plug that all into BaseRuns, and find how many runs the pitcher would be expected to allow.

Okay, 375 words in, and I'm finally getting to the point of this post. One criticism my system has received is that it zeroes out too many things. To some extent, this is true, though the point of DIPS is to zero-out the things that don't really matter. But here's the real question: how great is that extent?

On this very site, John Beamer wrote an article arguing that there is some skill involved in preventing line drives, which is certainly true. However, John's argument (not to put words in his mouth!) seemed to extend beyond that: As many others have said, John was arguing that disregarding one year's worth of line drives is incorrect; that there is some information contained in that information. He would not be the only person for whom I have respect to have said, that, more so, specifically in regards to DIPS 3.0. A poster that goes by the tag GuyDM posted the following on the Strategy and Sabermetrics board awhile ago:

Not to quarrel with the central importance of K and BB rates, but to some extent the correlation of your metric and DIPS 3.0 is inevitable. David is imposing league average LD%, and standard run values for every BIP type. So the only source of variance left is GB/FB ratio, which translates into roughly +-.25 R/G given the range in GB/FB.

Essentially, his point was that the process I go through for DIPS 3.0 does not leave much room for variance, less than there should be. Is that true? Well, luckily, the process is set up in such a way that we can actually check whether or not that is true. Using my expected batting lines against, we can calculate pitchers' expected batting average on balls in play (BABIP). In this case, I include expected reached on error in the numerator, because they are defense independent, since they are just based on batted ball distribution.

For example, Johan Santana was expected to allow 127 singles, 39 doubles, 4 triples, and 6 reached on error last year, according to DIPS 3.0. He also had 601 BIP, for a .294 BABIP.

Doing the math for every pitcher with at least 350 BFP in 2005 (171 in all), how much variance is there among these players? Well, the answer is .009. That's our standard deviation, which is a measure of spread. It means that 68% of all the pitchers would be expected to be within +- .009 points of average, and 95% would be expected to be within .018 points of the mean. How much is that? Since the average pitcher in our sample had about 500 BIP, that would make one standard deviation +/- 4.5 hits, or about .15 points of ERA.

So is that a lot or a little or what? Actually, it's just right. According to a research paper by Erik Allen and Arvis Hsu, the true standard deviation of BABIP is supposed to be, you guessed it, .009 points. DIPS 3.0 is perfectly capturing the true spread in BABIP, further support for disregarding a pitcher's line drive percentage. That extra batted ball information actually plays a big role in understanding the subtler differences between pitcher seasons, because it allows us to capture the true spread in BABIP, which Voros' two versions of DIPS (which has a spread of zero, since they assumed every pitcher would have the same BABIP, though the second did make some small adjustments based on handedness) could not. And once again, we have even more reason to believe that line drive percentage really means nothing over the course of one season.

So vive DIPS 3.0! Oh, and it's nice to join the BtB staff.

Comment 3 comments  |  0 recs  | 

Do you like this story?

Comments

Display:

Welcome aboard David
And good stuff. At the SABR Convention in Boston on Saturday, there was a discussion on Bill James' response paper to the response to his original Underestimating the Fog piece (Got that?, heh). The panel included former advisor to Montreal and Boston, Mike Gimbel, a professor from Tufts Sabermetrics 101 course in Andy Andres, David Grabiner, Steven J. Miller, who has done a lot of work on Weibull distributions, as that link shows, and Tom Tippett, of Diamond Mind Baseball fame. The discussion swung to Voros McCracken's DIPS theory, and the consensus seemed to be that Voros and his original theory went much too far, although the entire panel did not really disclose their thoughts on the validity of DIPS.

So far I agree with DIPS theory, but I'd like for there to be more research done on it in order to prove or disprove its validity further. I see the occasional pro-DIPS article, but I haven't seen much around besides Tippett's piece in disagreement with the theory, although I'm most likely just not looking in the right places. It is a very interesting topic to me, but I don't really have the chops to research it myself methinks.

"I don't set the rosters, I just make fun of the guy who does" - Rob Neyer

by Marc Normandin on May 22, 2006 12:00 PM EDT reply actions  

Nice to have you on board David
Hi David,

Good to have you on board at BtB. Nice opening article by the way. I am a big fan of your work at THT and look forward to reading lots more of your analysis.

John

ps. my understanding was there was a potential issue with batted ball data from BIS for 2004? Is there more clarity as to how valid this data?

Another point which I haven't quite got my head around is to what extent (if any) is some of the data being manipulated by inconsistent scoring. One person's line drive could be another person's fly ball. For a 20% LD rate a few misclassifications could have an impact. I have yet to see a study on the potential impact of this though.

by John Beamer @ Beyond the Box Score on May 23, 2006 5:24 AM EDT reply actions  

Clarification
David:  My post that you cited was not really a criticism of DIPS 3.0.  I was just making an observation that Tango's quick-and-dirty metric based only on BBs and Ks had to correlate with DIPS 3.0, given all the things DIPS 3.0 standardizes.  

I think DIPS 3.0 is useful, as long as people understand its limitations.  DIPS assumes that any deviation from lg-average line drive % is luck.  Given a 1-year sample, that will be more right than wrong for most pitchers.  However, it's silly to say that Mariano Rivera is "lucky" in avoiding LDs every year, or that Pedro is "lucky" to have a career BABIP about 30 points below league average.  When we have more than one year's data on a pitcher, we can assess a pitcher's true BABIP ability better than DIPS does.  You understand that, but I'm not sure everyone who is a "DIPS believer" does.

As for "zeroing out" various elements, the very low y-t-y correlations you report for LD% and HR/FB provide good justification for those  decisions.  The one question I have there is whether those correlations would still be so low if the BIS data coding problems were accounted for?  Some other data seems to indicate these variables may have more y-t-y consistency than you found, which would suggest they are (somewhat) a skill.

The other question I have is whether you should estimate different run values for GBs and FBs, depending on a pitcher's GB/FB ratio?  (e.g. do groundballs surrendered by groundball pitchers do less harm than those given up by flyball pitchers?)  I've seen arguments made on both sides, and I'm not sure what the right conclusion is.  And the differences -- if they exist -- may not be big enough to matter a lot anyway. But it's an interesting issue.

BTW, I think a SD of 4.5 hits would be more like .21 R/G, meaning the difference between an above-average and a below-average BABIP pitcher is about .40 in ERA, not trivial.  I'd be interested in seeing the same calculations for HR/9, K/9 and BB/9 -- I think you would find that the talent range is wider, but not hugely so.

 

by GuyM on May 25, 2006 10:54 PM EDT reply actions  

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?

FanPosts

Community blog posts and discussion.

Recent FanPosts

Small
Free Agent Compensation
Img_0001_small
Value of Various Plate Approaches
Strike_three2_small
Effect of Foul Area on Strikeouts: AL 1954-68: Erratum
Small
Baseball on a stick
Small
Player Evaluating Statistic
Baseball_small
Rays Outfield: Cheap but Extremely Productive
Small
A new xBABIP
Small
Jack Morris "pitching to the score"
Strike_three2_small
Foul Area and Differences in SO: AL vs NL
Baseball_small
Is there a Kuroda and Oswalt Alternative?

+ New FanPost All FanPosts >

Follow us on Facebook!

Follow us on Twitter!

SaberGraphics

MLB Daily Dish

Get the latest MLB Trade Rumors, Transactions, and News at MLB Daily Dish!


Managing Editor:

Jbopp-kc_small Justin Bopp

Columnists:

Adam_small adarowski

Dme_small Satchel Price

Closeup4_small J-Doug

Carlosicon_small Julian Levine

Billy_and_daddy_4th_of_july_small Bill Petti

Featuring:

Dayton_small Jeff Zimmerman

12475953_small Jacob Peterson

Picture-6_small Chris St. John

Btbpro_small Dave Gershman

229331_10150183361996591_674441590_6760167_6637860_n3_small Lewie Pollis

Img_3830_small David Fung