A Question of Regression: Heat Maps

I recently published an article using heat maps over at Fangraphs showing differing batter strike zones, in which a question arose about Brett Gardner vs. LHP. In particular, the question asked about a single way inside pitch that Brett swung at as seen in this image:

Gardl_medium

It looks like he swings at stuff way inside, but it was just a one time deal.

The best method I could think of to deal with this problem is to regress the data by the league average for each area. This would help smooth extreme and out of place values on the heat maps. I have found that I will need to add between 20 and 30 league weighted pitches to properly regress the data.

With this information, what do you think is the correct way to regress the data once other variables are used? For example, what if I want to look at how one player swung on 0-2 counts during 2010? Do I use the league average data for all counts since 2007 or should I just look at 0-2 counts in 2010? I think resetting the data for each scenario would be ideal, but then I run into another problem.

 

Currently, the process of creating the heat maps takes about 3 seconds over the internet. Figuring out the data on the fly will add anywhere from a few more seconds up to 15+ minutes per map. Also, once a second person starts a process, the heat maps will then take twice as long to produce for both people. If I pre-program in a set values for all processes to regress the output to, the heat map will be created in just seconds. This off season, I plan on making this application available to the public (some people already have access to it) and am wondering how people would feel about having a faster application or a more correct image.

 

Right now, I am thinking of doing a single adjustment for each of the counts and ignoring the dates. Does this seem like a reasonable middle ground or should I be more or less stringent with the data?

 

Let me know if you need more information or need any ideas cleared up. Thanks -Jeff

X
Log In Sign Up

forgot?
Log In Sign Up

Forgot password?

We'll email you a reset link.

If you signed up using a 3rd party account like Facebook or Twitter, please login with it instead.

Forgot password?

Try another email?

Almost done,

Join Beyond the Box Score

You must be a member of Beyond the Box Score to participate.

We have our own Community Guidelines at Beyond the Box Score. You should read them.

Join Beyond the Box Score

You must be a member of Beyond the Box Score to participate.

We have our own Community Guidelines at Beyond the Box Score. You should read them.

Spinner

Authenticating

Great!

Choose an available username to complete sign up.

In order to provide our users with a better overall experience, we ask for more information from Facebook when using it to login so that we can learn more about our audience and provide you with the best possible experience. We do not store specific user data and the sharing of it is not required to login with Facebook.

tracking_pixel_9351_tracker