Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: The Most Dangerous Division in Sports

Inducing Whiffs: Breaking Balls

Shown here is the relationship of whiff rate and velocity, split by batter handedness where right-handed batter data is blue and lefties are red. Gray bands indicate confidence.

What makes a breaking ball nasty?

As a knowledgeable baseball fan, you would probably answer that a combination of movement, location, and velocity all contribute to making a breaking ball nasty. Today I will examine how accurately we can predict nastiness using these three variables, where nastiness is defined as whiffs per swing.

To test how much these three variables can tell us, I created a model that predicts whiff rate using only horizontal movement and location, vertical movement and location, and velocity. It should also be noted that looking at whiffs per swing as opposed to whiffs per pitch helps a lot. This is because whiff per pitch also measures which pitches batters like to swing at, and that's a question we are not interested in right now. The model was created using every breaking ball thrown in the 2011 season that was swung at. Only right-handed pitchers who released the ball above 5 feet were included. MLBAM classifications were used, only including sliders (SL) and curves (CU and KC). Great, boring stuff out of the way! Results after the break. 

Star-divide

Using data through the middle of June (last time my database was updated), the leaders among pitchers with at least 50 swings on their breaking balls  are:

Name Predicted whiff rate
Sergio Santos 53.1%
Mark Melancon 43.2%
Craig Kimbrel 42.4%
Al Alburquerque 

 41.9%

David Hernandez 41.5%
Matt Garza 40.8%
AJ Burnett 39.8%
Zack Greinke 39.1%
Kyle Drabek 38%
Josh Johnson 37.6%

For context, the average whiff rate on breaking balls is 27.4%. As you can see, most of these names are not very surprising. It's good to see guys like Greinke and Burnett who are known for their breaking balls on the list.

Here is how the model performed more generally, among all pitchers with at least 50 swings on their breaking balls:

Mean_whiff_mean_pred1_medium

It's important to note that this graph is for the mean predicted rates and the mean actual rates, and not for individual pitches. Individual pitches don't really have a probability - there either is a whiff or there isn't. There's also a lot of noise on a pitch by pitch basis, so no model is going to perform well at that kind of level. 

As you can see, the R -squared is 63.4%, which means that there is a strong relationship between mean predicted rates and mean actual rates. Here are the 5 pitchers that the model thinks are over-performing the most:

Sergio Santos

Al Alburquerque

Sergio Romo

Ramon Ramirez

Jose Veras

Of note is that all of these guys are relievers. Here are the 5 pitchers that the model thinks are under-performing the most:

Wade Davis

Mike Pelfrey

Bartolo Colon

Matt Belisle

Kyle Lohse

In statistics this is called "East coast bias." The model also does not account for the situation the pitch was used in, so it's likely that pitchers' differing usages of breaking balls hurts the accuracy of the model. 

Platoon Splits observations:

I should clarify that the model is actual two models, one for right-handed batters and one for left-handed batters. I chose to do this because of the platoon split of sliders and the reverse platoon split of slow curves. 

You can see this relationship in the following graph:

Whiff_velo1_medium

Shown here is the relationship of whiff rate and velocity, split by batter handedness where right-handed batter data is blue and lefties are red. Gray bands indicate confidence. 

Of interest here is the gap between the two predicted whiff rates when velocity is in the 80-90 mph range. This is because this is where we find mostly sliders, pitches that are known for having a large platoon split. How is that just a 10 mph difference can cause such different behavior among breaking balls? I don't know the answer, and this question is ultimately out of the scope of this post. Just something to keep in mind. 

Finishing Thoughts:

We were able to accurately predict whiff rates using just location, movement, velocity, and the knowledge that the batter had swung. This tells us two things. Firstly, it's not much of a mystery what makes a breaking ball hard to hit. Secondly, the more elusive skill seems to be getting batters to swing at pitches that are hard to hit in the first place. The value of models like this are that they can be used to evaluate players in small samples, assuming that the data that goes into the model stabilizes quickly. 

References and Resources

*PITCHf/x data from MLBAM through Darrel Zimmerman's pbp2 database.

*http://princeofslides.blogspot.com/ – used as reference for R code. 

Please welcome our newest contributor, Josh Weinstock! You might be familiar with his excellent pitch f/x work at It's About the Money, the SweetSpot network Yankees blog. Josh will be contributing feature-length content a couple times per month, and will expand upon his pitch f/x work. -jbopp

Comment 11 comments  |  5 recs  | 

Do you like this story?

Comments

Display:

Wow

That was a fascinating read. Great work.

"It wasn’t that it was slippery or anything like that. It was just, dadgum, my hands and the balls were so wet..." - Tim Hudson

by KoKo the Monkey (T-Bone) on Jun 28, 2011 11:40 AM EDT reply actions  

Muchos nice

Assuming the results are the same for lefties?

Columnist at Beyond the Box Score. Contributor at Amazin' Avenue.

by Bill Petti on Jun 28, 2011 11:43 AM EDT reply actions  

I have not checked for lefties

I probably will soon. I just usually don’t like to include lefties because the lefty-lefty match up is so different from the other three types of matchups.

by Josh Weinstock on Jun 28, 2011 12:20 PM EDT up reply actions  

Ah! I recognize those graphs!

You’re the guy who did the groundball one a little while ago. That one was amazing. This is just as good.

by danmerqury on Jun 28, 2011 11:53 AM EDT reply actions  

Looking at whiff rate by velocity

It rises as velocity increases, but then falls when the slider speed starts getting into the upper 80’s and low 90’s.

What’s probably happening here is simple: Those pitches aren’t really sliders. That’s a limitation of using MLBAM classifications.

"That boy is our last hope" - Obi Wan Scioscia, as Francisco Rodriguez left for the Mets. "No, there is another" - Yoda Reagins.

by RallyMonkey5 on Jun 29, 2011 9:38 AM EDT reply actions  

Definitely

Yea, not too many pitchers legitimately throw breaking balls that hard.

by Josh Weinstock on Jun 29, 2011 2:08 PM EDT up reply actions  

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?

Follow us on Facebook!

Follow us on Twitter!

SaberGraphics

Yahoo_full_count

MLB Daily Dish

Get the latest MLB Trade Rumors, Transactions, and News at MLB Daily Dish!


Managing Editor:

Jbopp-kc_small Justin Bopp

Columnists:

Adam_small adarowski

Dme_small Satchel Price

Closeup4_small J-Doug

Carlosicon_small Julian Levine

Billy_and_daddy_4th_of_july_small Bill Petti

Featuring:

Dayton_small Jeff Zimmerman

12475953_small Jacob Peterson

Recent_pic_pg_small Patrick Gordon

Btbpro_small Dave Gershman

Me_small Bryan Grosnick

229331_10150183361996591_674441590_6760167_6637860_n3_small Lewie Pollis

Img_3830_small David Fung

30472_1481067225243_1190689185_1381415_997334_n_small Glenn DuPaul

1mnvxku7_small joshuaworn

Set_small MattFilippi18

Photo0011_small Nathaniel Stoltz