Beyond the Box Score: An SB Nation Community

Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Around SBN: Baby Boilers no more: Michigan State-Purdue Preview

Determining a Player's True UZR

Question: How do I determines a player's real/predicted UZR including age adjustments?

 

Why I asked the question: I needed a way for my Manger's Scorecard to determine a player's defensive ability and I couldn't find a good method. I was told that an average player's UZR should decrease by ~1 UZR per year , but my work on CF UZR/150 showed different results.

 

Analysis:

 

Positional adjustments

I collected the UZR for all players since 2002 from Fangraphs that had over 100 innings at each position. I decided not use true UZR because a few bad plays in a small sample size can lead to a player with a fairly large positive or negative UZR. I needed to regress the data to a value that is more indicative of the player's actual ability. I used the value of 125 games (as recommended by Mitchel Lichtman) to be the point when 50% of player's UZR can determined from their current stats and the rest will be the league average. The value of 125 games is a little more conservative value than the 100 value mention in this article at Fangraphs (http://www.fangraphs.com/blogs/index.php/fielding-update-arms-and-double-plays), but I will use it for my analysis.

 

I assumed that a player regresses to a value of 0, so for each player I got their yearly regressed UZR (rUZR) with the following equation:

 

rUZR = (1-((125)/(number of games + 125))) * Yearly UZR)

 

I ran this formula on all players (~3200 players), except pitchers and catchers (who have no UZR data on them) born after 1968 (the data from Fangraphs doesn't have playerID, so I was getting too many duplicate names). I combined all the positions and got the following results:

 

It is tough to make too many conclusions from the data, but the following can be observed:

  • Players are significantly better when they are younger than 30 than when they are older than 30. Both of these age groups seem to have values grouped together.

  • Even though the general trend in downward, there seem to be a learning curve for a few years, a peak and then a slow decline.

Star-divide

After that, I summed up the data for each position and got the following average numbers for the yearly rUZR for 23 to 34 year olds (above and below these ages, the amount of data for certain position, shortstop or example, was severely lacking):

 

Yearly regressed numbers
Position Change in UZR per year R-squared
1B -0.042 0.155
2B 0.075 0.296
SS 0.054 0.161
3B 0.006 0.001
RF -0.124 0.458
CF -0.127 0.274
LF -0.127 0.732
All Positions -0.032 0.431

 

The r-squared shows the level of correlation in a direct change over time with the left field numbers actually being statically significant and the third base numbers not having any correlation.

Here is a chart of the actual rUZR values by year and position:

Age 1B 2B SS 3B RF CF LF
23 0.30 -1.04 -0.15 1.64 0.76 -0.17 0.50
24 0.44 -0.56 -0.11 -0.81 0.14 0.86 0.70
25 -0.23 0.15 -0.17 0.33 1.21 1.38 0.49
26 -0.46 0.00 0.39 -0.53 0.29 0.84 -0.07
27 0.01 0.55 0.74 -0.41 -0.03 -0.17 0.63
28 0.67 0.43 0.44 0.19 -0.36 1.27 0.41
29 0.02 0.63 -0.32 0.83 0.40 0.98 -0.44
30 0.36 0.27 0.27 0.71 0.46 0.54 -0.33
31 0.06 0.22 0.11 -0.06 0.01 -0.78 0.01
32 0.07 0.20 -0.25 0.63 -0.56 -0.60 -0.54
33 -0.40 -0.23 0.20 0.53 0.11 -0.60 -0.76
34 -0.56 0.57 1.36 -0.07 -1.38 0.08 -0.66

 

The preceding data seems to indicate that there are 3 categories for rUZR:

  • Outfielders – All three groups show about the same decrease of ~0.125 rUZR per year of age.

  • Corner Infielders – These both are pretty steady over the years with values close to 0, but 1B seems to decline a little more than 3B and its decline rate is closest of any position to the overall decline rate.

  • Middle Infielders – These two positions were a little confusing in that they had a trend of an increase in regressed UZR over the years. I needed to look into them a little more. I selected the players that played 4 straight seasons at SS and looked to see how much of change there was from year to year and the results were the same. The players seem to having an improving rUZR to a peak around age 27 and then decline after that. Here are the graphs for these 2 positions:

Once you look at the positional rUZR, the overall rUZR values makes a little more sense when the following 3 trends are combined:

  • General overall decline(outfielders)

  • Increase at the beginning (middle infielders)

  • General overall leveling (corner infielders)

Creating a prediction formula

After collecting the information on individual positions, I wanted to look at using the age adjustment to help predict the players actual rUZR scores. Through the discussion with Mitchel Litchman, he recommended the following formula which weights the recent years more than the previous years:

 

((5* # of games last year * UZR last year)+

(4 * # of games 2 years ago * UZR 2 years ago)+

(3 * # of games 2 years ago * UZR 3 years ago)+

(2 * # of games 2 years ago * UZR 4 years ago))/

((5 * # of games last year) +

((4 * # of games 2 years ago) +

((3 * # of games 3 years ago) +

((2 * # of games 4 years ago))

 

I ran this formula against all the players with over 100 games at a position for 5 straight years and using this system, it was a very good predictor. There was a slight decimal difference for perfect prediction, but it was a very good method. I also found out that using a weight of 2,1 (2 for last year, 1 for 2 years ago and 0's for the rest) was equally as good.

 

The one fact I did find was that -.03 was not a good indicator of the loss of defensive ability from these players. The main reason I saw a larger increase in the negative direction is that I was predicting using 4 years of data to predicted the 5th year, so the youngest players could not be predicted until they are ~25 years old. Here are the age adjustments and standard deviations on being able to predict a player's UZR using the 5,4,3,2 method and the 2,1 method:

2,1 Age Adjustment Standard Deviation for 2,1 5,4,3,2 Age Adjustment Standard Deviation for 5,4,3,2
Year's Regressed UZR -0.35 1.61 -0.70 1.87
Year's Actual UZR -1.00 4.44 -1.35 4.45

 

r-squared of rUZR vs predicted UZR = 0.65 and r-squared of UZR vs predicted UZR = .66

I might go back sometime and figure out the overall and positional age adjustment using the 2,1 formula. For now I feel the data is good enough for the Manager's Scorecard, but it does raises some questions on exact positional aging patterns.

 

I have put a spreadsheet on the web (with a web page coming soon) that a person can download and enter a player's Games/Chances, UZR, regression adjustment and age adjustment and the player's rUZR will be exported.

 

There is not great understanding of defensive metrics, but my hope is to help remove some of the old clutter (and possibly created new clutter) and help people interpret the defensive metrics like UZR better. As always, I open to comments and suggestions.

7 recs  |  Comment 11 comments

Story-email Email Printer Print

Comments

Display:

Great work, and rec'd

In the study, did you separate the “Range” component(s) of UZR from “arm” and “double plays?” It seems to me, intuitively (for whatever that’s worth), that that would be important, particularly the arm/range distinction for OFs.

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Mar 3, 2009 2:13 PM EST reply actions   0 recs

just a suggestion/question

I don’t know whether is should be done or not

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Mar 3, 2009 2:34 PM EST up reply actions   0 recs

Outstanding work!

I’d be interested to see if we get the exact same results with sUZR.

Proud adoptive parent of Tim Alderson.

by Anticon23 on Mar 4, 2009 10:51 AM EST reply actions   0 recs

Cool stuff, as always.

What about using a methodology where you use matched pairs of a players’ regressed UZR at a position over two consecutive years and find the average change in ability, weighting each player-season-pair by the lower number of defensive innings in the pair?

In your current structure, a player might be a SS at age 35, but a 3B at age 36 and a 1B at age 37, making the easier positions look better. That might also be one reason MI’s look like they age well: only good fielders remain there as they age.

Beyond the Boxscore // Calling BJ Upton lazy is lazy.

by Sky Kalkman on Mar 4, 2009 11:08 AM EST reply actions   0 recs

Matching the pairs will be a pain, but workable.

Sky — you are a contributor over at Fangraphs. Is there any way they can add the playerID to the download CVS or at least remove the link to the name in the CSV.

by Jeff Zimmerman (TucsonRoyal) on Mar 4, 2009 11:33 AM EST up reply actions   0 recs

yes!

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Mar 4, 2009 11:57 AM EST up reply actions   0 recs

Fangraphs' playerID is in the link.

With some Excel trickery (find/replace, etc.) you can get the player name alone and the ID in its own cell.

Tango’s posted a universal ID lookup table you can use to convert that to any ID system.

Whoila, playerIDs and a nice name format.

Beyond the Boxscore // Calling BJ Upton lazy is lazy.

by Sky Kalkman on Mar 4, 2009 12:38 PM EST up reply actions   0 recs

Remove link

I asked them to do that about a year ago. There is no reason to include a link with html code in a CSV download. I had to write a parsing program that extraced out the player ID and info in the HTML code and discard what I didn’t need. They said they’d get around to it, but I guess it’s not a high priority.
vr, Xei

by Xeifrank on Mar 4, 2009 4:52 PM EST up reply actions   0 recs

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?
Start posting on Beyond the Box Score »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

770insig_small
BtB's "Ball On A Budget" Fantasy League - Discuss Participants, Payrolls and Position Eligibility

Recent FanPosts

Ds9_small
good graphing program?
Small
Predicting HR/FB Rates
Leopold_butter_scotch_southpark_small
Troy Tulowitzki vs Ryan Braun
Small
Pitchers batted ball observations
Small
Eric Byrnes: A player worth a look?
Small
Valverde Is Charging Detroit Double
Mukuro_small
Another question: About power rankings
Small
Why You Shouldn't Trade for Arroyo
Jinaz-reds-avatar_small
Last Call for BtB Sabermetric Writing Award Nominations

+ New FanPost All FanPosts >

FanShots

Quick hits of video, photos, quotes, chats, links and lists that you find around the web.

Recent FanShots

If you care about newspaper coverage of MLB, read this post
Visualizing the Difference Between Offensive and Defensive Value for Catchers
First B-Pro and now ESPN. Tommy, you're growing up so fast
THT - Advancing by ground
Negro League Museum Close to Folding
It is a capital mistake to theorize before one has data. Insensibly one...
Ranking Minor League Systems Using Victors Wang's Prospect Valuations
Pitch f/x on Ricky Nolasco Stretch vs. Windup again
Veron Wells the artist.  I never knew.

http://www.vwellsart.com/
A Dream Team... in honor of Dr. King

+ New FanShot All FanShots >

BtB on Twitter

Main Feed: @BtBScore

Jeff: @jeffwzimmerman
Steve: @steve_sommer
Sky: @BtB_Sky
Dan: @dturkenk
Harry: @harrypav
Jinaz: @jinazreds
Jack: @jh_moore
Erik: @Erik_Manning
Tommy R: @trancel
Justin: @justinbopp

Subscribe to BtB via Email

Enter your email address:

Delivered by FeedBurner

Most Commented

BtB Goes Social


Managers

Wbc_029_small Jeff Sullivan

Editors

Rawlings_baseball_bigger_small Dan Turkenkopf

Limes_125_small Sky Kalkman

770insig_small Jeff Zimmerman (TucsonRoyal)

Aviles_small Justin Bopp

Authors

Roots_game_small R.J. Anderson

Jinaz-reds-avatar_small JinAZ

Face_small Harry Pavlidis

1753738656_110919ebe9_o_small vivaelpujols

Ozzie_small erik

Raysring1_small Tommy Rancel

Redcap_small SFiercex4

St_louis_cardinals_ce1141_003263_small stevesommer05

Paige_small Satchel Price