Beyond the Box Score: An SB Nation Community

Navigation: Jump to content areas:


Sports blogs for fans, by fans.
Around SBN: Spencer Hall's Sports Meme Power Rankings

Using the TVC

General question for the members of Btb:  When using Sky Kalkman's Trade Value Calculator, during the offseason, do you include '09 numbers to determine the players value?

Josh Johnson for example, if you input his 5.5 WAR, and $1.4 million salary, and then input 60 and 80% for his '10 and '11 salary, with a 4.7 and 4.3 WAR for '10 and '11 respecitvely, his surplus value comes out to be a little over $41 million.  But this just doesnt seem right to me.  Why would a team pay for past performance?

If you eliminate the '09 information altogether, but leave the '10 and '11 the same, the surplus value comes out a little over $17 million.

Should you just split the difference to determine his value?  Or is the $17 mil number correct, and if so, it seems a little low.  Any advice?

5 comments  |  0 recs

Determining Batted Ball Rates using Pitch Type and Location

It is well-established that pitchers have control over their ground ball and fly ball rates--some pitchers, like Roy Halladay, are known for their extreme ground ball tendencies.  But what allows these pitchers to achieve a markedly different batted ball profile from the average pitcher?  I decided to use Pitch f/x data to determine whether batted ball rates depend on pitch type (as classified by Gameday) and location.

Continue reading this post »

1 comment  |  0 recs

a new xBABIP calculator

I've been a big fan of the hardball times xBABIP calculator over the last 6 months or so, but there were a couple of things that I didn't like about it.  The first thing I didn't like, was having to stick in exact numbers for AB's, HR's, etc.  When dealing with projections, I much prefer to work in percentages.  With percentages you can see what their BABIP for a partial season, or even a span of several years, or a career much easier.  I also am not so sure about the inclusion of stolen bases as a statistic.

I'm a big fan of the fangraphs website, and they provide a wide array of batted ball data for each player.  I determined that BABIP is very strongly determined by a combination of LD%, GB%, FB%, IFFB%, HR/FB%, and IFH%.  That is to say, as much as BABIP can be.  This is right along with what the hardball times uses, except in my case, I'm dealing strictly with percentages, and I've substituted in IFH% as opposed to SB's.  It's worth noting, that I'm not taking into account ballpark factors (which surely have some kind of effect on BABIP as well).

I came up with my numbers, plotting a large amount of data (3 years worth of individual player statistics), and doing a multi-variable regression analasys on it (I'm not sure if that's the right wording or not, I have no formal training in statistical analsys, just some stuff I've picked up).

Here's the equation I came up with:

xBABIP =0.391597252 + (LD% x 0.287709436 ) + ((GB% - (GB% * IFH%) ) x -0.151969035 ) + ((FB% - (FB% x HR/FB%) - (FB% x IFFB%)) x -0.187532776) + ((IFFB% * FB%) x -0.834512464) + ((IFH% * GB%) x 0.4997192 )

Here's a published view of a spreadsheet showing it in action:

http://spreadsheets.google.com/ccc?key=0AuaVTUnZda7fdFVpY2NoRC1zS1p0UlNPaDlVdlRhN1E&hl=en

Here's a download of the spreadsheet in open office (Forgive the lame hosting service, I wasn't sure where to upload):

http://www.filefactory.com/file/a1a2d5a/n/public_xBABIP_Calculator_ods

I've been using the following calculator (along with a number of other equations) to build my own projections for 2010, and here are a few of the interesting things I've noticed.

First off, LD% has a very strong correlation to BABIP (not exactly a revolutionary statement), but it's also very hard to project it seems.  There seems to be a lot of luck built into it, so even taking career LD% rates is still factoring in some luck, so I tend to trend them closer towards the league average (19.5).  

GB% is a little easier to predict  Higher GB% tend to yield higher BABIP's, but that's based on your IFH% as well.  A player who can post high IFH% with a lot of ground balls will greatly increase their BABIP, while a slow player with a terrible IFH% with a lot of GB% won't increase their BABIP nearly as much (makes sense).

FB% is again easier to predict then LD% typically, and high FB% tend to yield lower BABIP's, as they are more likely to record outs.  But you've got to look at HR/FB, and IFFB% as well to get an accurate picture.  A player who hits a ton of fly balls, but has a very high HR/FB rate, with a very low IFFB% (ryan howard), can post more respectable BABIP's (they have a better shot of landing if they are getting out of the in field)

HR/FB is also a little easier to predict, and doesn't directly effect your BABIP, it's only used to take the home runs out of your fly balls (which in turn helps your BABIP).  One thing that strikes me as problematic here, is line drive home runs.

IFFB% seems somewhat player controlled, but also has a large luck component to it  from year to year (probably largely due to sample size).  This has a definite impact on your BABIP, as fly balls on the infield are automatic outs.

IFH% seems very speed dependant.  The more in field hits you have, the higher your BABIP as well.  This can vary from year to year with luck, but generally speedy players will post better (there are a few notable exceptions, like jason bay's abnormally high IFH%, which I chalk up to some luck) numbers.  Ballpark factors play a role here I'm sure as well (which I'm not accounting for).

So in the end, what we get, is a way to take numbers directly from fangraph (over the course of a career, full season, or even partial season), and get a descent idea of what their BABIP should be like, and how lucky they have been.  As always, this will still vary a lot from year to year (and the BA, OBP, and SLG along with it), but this is an attempt at trying to get an idea of what that middle number, that the BABIP will fluctuate around is for a given player.  Outside of using a calculator like this one, or the hardball times, the next best way to evaluate BABIP is probably to look at a players career numbers, but even those are prone heavily to be skewed by some lucky streaks.

I'm very interested in any feedback/critique that anyone has to offer, or any ideas on improving it.  I've also got a number of other calculators (one that does batting average, xHR, xR, xRBI, xSB, xAvg, xOBP, xSLG, that I'd be willing to throw out there as well, but I figured before I went through the trouble, I'd see what kind of buzz I get from this one.

13 comments  |  1 recs

Top 15 high school MLB draft prospects

Greetings fellow Beyond the Box Score readers,

I write about the MLB Draft for Examiner.com and am hoping to direct some traffic to my first post on the top 15 high school MLB draft prospects.  Please feel free to comment (preferably on my site, but I'm happy to respond here as well).

Here is some information about me:

Jesse Burkhart, 23, is a North Carolina resident and a former correspondent for Baseball America.  He attended USA Baseball's Tournament of Stars in Cary, N.C. and the East Coast Pro Showcase in Lakeland, FL in 2009, and he regularly attends other high school and college baseball games along the east coast.  He recently graduated from the University of North Carolina at Chapel Hill with a degree in Journalism & Mass Communication.  You can contact him at jesse.burkhart@gmail.com.

 

Thanks for reading,

Jesse

                                                                                                                                                                                                        twitter | jesseburkhart     AIM | jb02186

0 comments  |  0 recs

PZR-based Win Values 2001-2006

When we use a DIPS-based pitching statistic like FIP to calculate win values, our intention is to isolate the pitcher's contributions from those of his defense.  FIP, however, is only a crude approximation of the truth: it assumes that all pitchers should have a league-average BABIP, it fails to correct for double plays and pitchers who struggle out of the stretch, and as a linear run estimator, it underestimates good pitchers and overestimates bad pitchersPZR does a much better job of isolating pitching from defense without removing the effects of timing or hit quality.  PZR is UZR, but from a pitcher's perspective.  The pitcher is credited the run value of each batted ball, based on its trajectory, location, and speed, regardless of whether or not the ball fell for a hit.  When these run values are totaled, we obtain a measure of a pitcher's "defensive support;" we can estimate his true performance by adding his defensive support to his actual runs allowed.  As a result, PZR is perfect for Win Values, since it separates defense from pitching without removing anything else.

Continue reading this post »

9 comments  |  5 recs

The "30 parks on a budget" challenge


The World Series is over.  You're looking at another off-season of trying to figure out if your team should sign some middle-of-the-road middle infielder who won't bring the pennant next year either.  And because you can only do that for about 15 minutes before crying, you need something to do that can keep your mind on baseball, so that you don't have to resort to *gasp!* watching the NFL, NBA, NHL, EPL (go Everton!), WWE, or SCOTUS.  (Tonight on Monday Night Litigation, it's Sotomayor vs. Alito.  Justice will prevail!)

No, let's do something baseball-related.  Let's plan a baseball road-trip, the best baseball road trip ever.  And let's do it on a budget.

Your mission is simple:  In 2010, you will (in your imagination) travel to all 30 MLB parks, see 30 games, and you will figure out how you would do it at the lowest possible cost.  At least in theory.  (It's a free country, so if you actually want to do this in real life, you can... but the point is that it's a fun mental exercise.)  Using the real 2010 MLB schedule, and some other web-based resources, you will put together an itinerary that would allow you to meet your goal.  The person who can do it in the lowest amount of cash will win a cookie and bragging rights.  (Note: no actual cookie will be awarded.)

I will tell you, this "game" is horribly addictive.  Worse than Farmville.  You won't think so at first, but it is.

Intrigued?  The rules are below...

Continue reading this post »

29 comments  |  5 recs

World Series Simulation, Game #6

I used my simulator to simulate 100000 baseball games between the Phillies and Yankees using what to the best of my knowledge will be close to today's starting lineups.  The simulator outputs a win probability for each team, along with the average total runs scored, a distribution matrix of all the final scores and how often they occurred, and an averaged box score tally of all games combined.  You may find it strange that there are so many one run games listed as the most likely score, and that the home team is always favored to win in the top couple of scores.  This is due to basic math and the way that baseball rules play out of having the home team bat last.  Here is a great article explaining this phenomenon.

Today's Results... (Last simulation ran on Monday at 930PM)

 

Visitors Home Pitching Matchup Favorite Vegas Win Prob Simulator Win Prob AccuScore
PHI
NYA
P.Martinez vs A.Pettitte NYA 64.97%
64.66%
61%

 Skinny:  Simulator Fun Facts... Vegas and my simulator are in almost perfect agreement on the Yankees win probability.  Once again the key for the Phillies will be to silence the heart of the Yankees order as much as possible.  Not an easy thing to do.  AccuScore has come in with the lowest win probability of the three for the Yankees.  Meanwhile, the Vegas line has moved up to 65.87% since the opening lines came out.

 

Top 100 Most Likely Scores

1 NYA 4-3
2 NYA 3-2
3 NYA 5-4
4 NYA 6-5
5 NYA 2-1
6 PHI 4-3
7 NYA 4-2
8 PHI 3-2
9 PHI 5-4
10 NYA 5-3
11 NYA 3-1
12 NYA 4-1
13 NYA 5-2
14 NYA 7-6
15 NYA 6-2
16 NYA 6-4
17 NYA 6-3
18 NYA 5-1
19 PHI 4-2
20 PHI 5-3
21 PHI 2-1
22 NYA 3-0
23 NYA 7-2
24 PHI 6-5
25 PHI 6-4
26 NYA 7-1
27 NYA 4-0
28 NYA 7-3
29 PHI 5-2
30 NYA 7-4
31 PHI 3-1
32 PHI 6-3
33 NYA 6-1
34 NYA 7-5
35 NYA 8-3
36 PHI 4-1
37 NYA 8-7
38 NYA 1-0
39 NYA 2-0
40 PHI 7-4
41 NYA 5-0
42 PHI 7-5
43 PHI 7-6
44 NYA 8-4
45 PHI 6-2
46 NYA 6-0
47 NYA 8-2
48 NYA 8-1
49 NYA 9-3
50 PHI 7-3
51 PHI 5-1
52 NYA 8-5
53 NYA 8-6
54 NYA 9-2
55 PHI 2-0
56 PHI 8-6
57 NYA 9-4
58 PHI 8-4
59 NYA 9-8
60 PHI 8-3
61 NYA 7-0
62 NYA 10-2
63 NYA 9-1
64 PHI 6-1
65 PHI 8-5
66 PHI 4-0
67 PHI 8-7
68 NYA 9-5
69 NYA 8-0
70 PHI 7-2
71 NYA 10-4
72 PHI 3-0
73 NYA 10-3
74 PHI 1-0
75 PHI 8-2
76 NYA 9-6
77 PHI 7-1
78 NYA 10-1
79 NYA 10-5
80 NYA 9-0
81 PHI 9-8
82 PHI 9-4
83 PHI 5-0
84 NYA 11-3
85 NYA 11-5
86 PHI 9-5
87 PHI 8-1
88 PHI 9-2
89 PHI 9-3
90 NYA 11-2
91 NYA 11-1
92 PHI 9-6
93 NYA 9-7
94 NYA 10-6
95 NYA 11-4
96 PHI 10-3
97 PHI 9-7
98 PHI 6-0
99 NYA 10-9
100 PHI 10-5

 

Game Pitching Results

Pitcher IP SO BB HR WHIP FIP
P.Martinez 6.4 3.994 2.521 1.17 1.465 5.524
A.Pettitte 7.0 5.164 2.746 0.78 1.255 4.360

 


Note: Keep in mind I did this simulation before knowing the actual starting lineups, so some of the players I used may not be starting.   Many of the "Players Most Likely To" stats depend on having the correct lineup.  The lineups  I used are listed below.  Chances are you won't see a big difference in win probability from having the lineup order slightly off.  Picking the correct starters is more important, especially if one of the better players is taking the day off.  But you still won't see a big change in win probability if the starter and backup are interchangeable.

Simulation Lineups
  Name wOBA Name wOBA
1 J.Rollins .2986 D.Jeter .3301
2 S.Victorino .2937 J.Damon .3450
3 C.Utley .3195 M.Teixeira .3843
4 R.Howard .3124 A.Rodriguez .3837
5 J.Werth .3282 H.Matsui .3533
6 R.Ibanez .3052 J.Posada .3590
7 B.Francisco .2921 N.Swisher .3454
8 P.Feliz .2810 R.Cano .3233
9 C.Ruiz .2821 B.Gardner .3003

 




3 comments  |  0 recs

JT20 Dynasty League

I have started started a free fun dynasty league in which each person takes control of a MLB roster & minor league system, drafts, signing free agents....

If this sounds interesting to you, go check out the league message board & league rules and sign up. There is no prize for the winner for each season, it is all just for fun, so join!

http://jt20.proboards.com/

thought people here would be interested sign up soon!

 




0 comments  |  0 recs

New Look

As part of SB Nation's visual refresh, Beyond the Box Score has a new look. Some widgets have moved around, but nothing should be too hard to find.

Like all upgrades, things may not work perfectly at first. If you encounter any issues, please let us know so the coders can address the problem. Thanks for reading.

15 comments  |  1 recs

Exploring Hit f/x, Albeit Badly

 

This summer, Major League Baseball revealed its newest statistical plaything, hit f/x, which records the speed of batted balls. This technological toy may not exactly revolutionize the way the game is played, but it does add a critical variable to a hitter’s value – how hard he can hit. Now teams can determine which batters truly hit the stuffing out of the ball.

Strangely, this hit-tracker data seems to go against conventional baseball wisdom. Hit-tracker results have recently revealed that players with lower batting averages actually hit the ball harder than those with high batting averages. Does that mean that to achieve a high degree of success, a hitter needs to hit the ball softer that he otherwise would? Looking beyond the numbers, we discover that this conundrum is an issue of logic.

It is generally accepted in the literary world that plot and character vary inversely; rarely do we find a book with an in-depth plot and intricately developed characters. Similarly, in the world of baseball, speed varies inversely with power; there are exceptions, but it is unusual for a power-hitter to steal 20 bases in a season, or for a lanky speed-demon to hit 30 homers.

A substantial percentage of the batters with batting averages over .300 in the 2009 season had around 15-18 infield hits. There were outliers, of course, but the trend stuck for the most part.  Let’s take a look at how some of baseball’s top hitters (in terms of batting average) might have fared without that extra bit of speed.  Obviously this is not completely indicative of how they actually would have done, but it’s an interesting experiment.

A table:

# of Infield Hits Batting Average Batting Average Minus Percentage of Infield Hits
Ichiro Suzuki 50 .352 .273
Jacoby Ellsbury 26 .301 .259
Pablo Sandoval 16 .330 .302
Denard Span 23 .311 .271
Jason Bartlett 15 .320 .290
David Wright 13 .307 .283
Scott Podsednik 27 .304 .254
Skip Schumaker 13 .303 .278
Hanley Ramirez 17 .342 .312
Nyjer Morgan 15 .307 .275
Derek Jeter 22 .334 .299
Erick Aybar 16 .312 .280

Notice that this trend only serves to prove the importance of speed in the batting averages of players who have an abundance of it.  David Wright and Pablo Sandoval still retain high batting averages despite when we subtract a portion of their speed.

Because power-hitters tend to be physically larger and thus slower than other hitters, they don’t beat out as many groundballs as quicker hitters, decreasing their batting average by approximately 17/550, or almost .031. For example, Wladimir Balentien, formerly of the Seattle Mariners, had the fastest average batted ball speed during April 2009 (according to Matthew Carruth of Lookout Landing http://www.lookoutlanding.com/2009/6/6/901100/mariner-hitters-batted-ball-speeds), but a paltry average. Ichiro, on the other hand, had the lowest average batted ball speed but his usual high average during that stretch.

In reality, batters that hit the ball hard often aren’t quick enough to reach first base on a weakly hit ground ball, knocking a significant amount off their average.

Contact rates actually player a larger role in this issue than does speed.  Players, like the aforementioned Balentien, who swing for the fences all the time usually have low contact rates.  And you’re certainly not going to get a hit if the ball doesn’t leave the catcher’s mitt.  Here I generalize quite a bit – not all power hitters are free-swingers, but those that do see that batting averages drop quite a bit despite hitting for power and a high hit f/x value.

What does this mean?  Honestly, I don’t know.  Infield hits are flawed because in using them to make a point, one must assume that the official made the right call (error vs. hit).  Plenty of power hitters are disciplined.

Any thoughts?  I'm really not sure what to take from this.

6 comments  |  1 recs


Managers

Nando_small R.J. Anderson

Limes_125_small Sky Kalkman

E52205a2_small Tommy Bennett

Editors

Face_small Harry Pavlidis

Rawlings_baseball_bigger_small Dan Turkenkopf

770insig_small Jeff Zimmerman (TucsonRoyal)

Aviles_small Justin Bopp

Authors

Banny_small erik

Raysring1_small Tommy Rancel

Jinaz-reds-avatar_small JinAZ

Jmlogo_small Jack Moore

1753738656_110919ebe9_o_small vivaelpujols

1_small Graham

Baseball_small Mike Rogers

Redcap_small SFiercex4

Small Patrick Clark

Walter_album_small Walter Fulbright