Daily Box Score 10/26: Guess Again
It is the sovereign right of every sports fan to question--loudly, if need be--the decisions of those who are lucky enough to coach professional teams. For the same reason I have little sympathy for celebrities who complain about the paparazzi even as they build their brands, I have little sympathy for managers whose bad decisions are second-guessed.
Without a doubt, the peak sofa-managing month is October. With seasons hanging in the balance, every pinch runner, relief pitcher, and bunt call is intensely scrutinized. With quantitative methods sufficiently developed, you'd think the answers would be as obvious to managers as they are to you and me.
And yet, the second-guessing continues.
Table of Contents
Clear!
Second Guesser
Defining the Class
Discussion Question of the Day
The foundation of much of the second guessing that happens among quantitatively minded baseball fans these days is FanGraphs. During each game, including the postseason, they update charts displaying the win probability of each team over the course of the game. Here's an example of an exciting one:
What I find amazing about these graphs is the degree to which they resemble an EKG of an Angels or Yankee fan over the course of a game. It's irregular, spiky, and ends either up or down. At the end of the game, one team is pronounced dead.
But it also gives a clear sense of the ups and downs of the game. If a manager calls for a sacrifice bunt and it is successful, the decreased win expectancy will materialize immediately on the live graph. Alternatively, if the bases are loaded with no outs, but a team has a five run lead, the win probability may still reflect the fact that the team with the lead is the overwhelming favorite. It almost takes the fun out of it. (I said almost.)
And now these graphs are everywhere! You can now purchase a FanGraphs application for your phone. Who's in his mother's basement now, HUH?!
But before we get too carried away, it's important to remember the limitations of these graphs. First, they are based only on the run scoring environment, the score, the base-out position, and the inning. That's it.
No consideration is given to the strength of the teams, let alone of individual players. Not only are the overall talents of individual players not accounted for, neither are the shapes of their talents. Over hundreds of games, these talents cancel out. But in a single playoff game, these differences make all the, well, difference.
So when the win expectancy goes down after a sacrifice bunt, what have we really learned? Anything more than the mostly uncontroversial observation that outs are bad?
Let's consider another example.
The USS Mariner warmachine rolls along with its newest offering: a $2.99 application, also in the iPhone app store, that explicitly promises to help you second-guess the decisions of managers in real-time. Derek Zumsteg writes of his new creation:
There’s also the managerial view, which allows you to game out whether it’s a good idea to steal, bunt, or intentionally walk the hitter in the current situation
You’ve probably seen me rant about this kind of in-game stuff here on USSM over and over, citing Tango’s Inside the Book, massive studies on when it makes sense to bunt, or how crazy it is to intentionally walk batters in most situations where it’s considered normal.
I really think using run expectations and WPA are key to understanding effective in-game strategies, and I hope that in offering a really easy way to experiment with tactics, especially as you follow a game, it’ll make all of this more relevant and understandable.
Now, in general, I'm all for bringing WPA to the masses. But, as they say, a little knowledge is a dangerous thing.
I must admit that I have not plunked down the cold, electronic cash for 2nd Guesser, as the app is known. But it does not appear that this app makes any allowances for the skill sets of individual players either. One thing it does do is allow you to adjust the break-even percentage for stolen base attempts, but even that appears to be a relatively unguided choice, delinked from the overall run-scoring environment.
But it really can't be stressed enough how much it matters how strong the players in question are. Here's an example, the numbers for which I have borrowed from The Book.
Imagine you are Joe Girardi. It is Game 4 of the World Series, and it is the bottom of the third inning. Having been chased from the game, C.C. Sabathia sits on the bench while Chad Gaudin stands on the mound (just play along). The Yankees are trailing by five runs, 6-1. There is currently a runner (Jayson Werth) on second base. There are two outs. Raul Ibanez is about to bat; Pedro Feliz stands in the on-deck circle.
What, if anything, should you call for? I'll give you a moment to think about it.
I am guessing your first reaction is to say, "do nothing and let Gaudin pitch."
But this would be the wrong answer! In this situation, an intentional walk will actually INCREASE win probability as long as the ratio of the wOBA of the batter at the plate to that of the batter in the on-deck circle is at least 1.25. Ibanez (.379 this year) is that much better than Feliz (.302 this year, for a ratio of 1.255) even before you consider the platoon advantage.
So Girardi should signal for Gaudin to walk Ibanez to get to Feliz. But I'm confident two things are true: 1) Girardi would almost never call for an intentional walk in this situation, and 2) every stat-minded fan would be talking about it for a week if he did.
If you're at all like me, I know what you're thinking. We can refine the methodology, so the application scrapes data in real time about the individual players and then inputs it into the program. We could do Markov chains 'til the cows come home; we could compare individual players to similar historical players. But at each step along the way, we are introducing assumptions into the analysis.
The eagle-eyed among you probably noticed one methodological flaw with my hypothetical scenario outlined above: I assumed that a player's seasonal wOBA was a good predictor of his wOBA in the next at-bat. And I did it without justification!
All models require simplifying assumptions. Those assumptions are what make them simple, and therefore useful. But if we pass along the models from one person to the next, we tend to forget the assumptions, which convey information about a model's weakness, and focus only on its predictive strengths.
(Perhaps you've heard about this problem?)
The only way to use a model effectively is to familiarize yourself with its strengths and weaknesses. It's something worth bearing in mind as quantitative analysis gains traction with a wider audience.
Discussion Question of the Day
How accurate of a picture of in-game probabilities do live WPA charts give? Are there specific ways we could improve them? Structural limitations that preclude perfection?
0 recs |
8 comments
| Add comment
|
Comments
WPA is very useful, but you're right that each situation requires talent adjustments
Where you get those adjustments is beyond me (your choice of projections, in-season, offseason, whatever you want to do). What I think WPA introduces is the idea of the generally good or bad move, which is the first step the hypothetical manager would have to take to figure out what to do. Once he knows what average teams do, he can then apply the appropriate adjustments accordingly.
Of course, there’s dispute about the source to use to apply these adjustments and how much time indeed the manager will have to make these decisions. But if managers know the average right move on the fly, like 2nd Guesser generally allows, the adjustments can be done much faster, or perhaps more efficiently. You don’t need to figure out if the move is good, just if the move in this case is good.
Marlin Maniac, a Florida Marlins blog
Come attend Intro to Sabermetrics 101!
Check me out at Beyond the Box Score as well.
by SFiercex4 on Oct 26, 2009 11:32 PM EDT reply actions 0 recs
Here is a nice attempt by the blog, Vegas Watch on starting with the Vegas line winning % and then going with an adjusted WPA
http://vegaswatch.net/2009/10/series-win-probability-graphs-stl-vs.html
Jeff Zimmerman - Protecting the world from RBI's and Wins from my mom's guest house.
by Jeff Zimmerman (TucsonRoyal) on Oct 27, 2009 2:03 AM EDT reply actions 0 recs
There has to be a relatively easy way to do this
at least on the most basic level. The Book’s calculations are based on running a simulation over and over where every batter has the same wOBA. Can it not be done for other wOBA’s based on platoon advantages or disadvantages? You’re right that a sac bunt w/ the average hitter on deck isn’t the same as one w/ Albert Pujols on deck w/ a platoon advantage but I don’t see why those simulations can’t be run the same way the authors of the book ran them.
by chuckb on Oct 27, 2009 10:12 AM EDT reply actions 0 recs
Potentially they could
But could they be done in real-time and in an automated fashion? If so, it would be extremely impressive.
by Tommy Bennett on Oct 27, 2009 7:30 PM EDT up reply actions 0 recs
“How accurate of a picture of in-game probabilities do live WPA charts give? Are there specific ways we could improve them? Structural limitations that preclude perfection?”
What if you were to take the data from a full year [or multiple years] and look at how close it is to predicting the game outcome on an inning by inning basis;
IE, if in games where the Win Probability says the home team has a 75% chance of winning the game at the end of the 5th are ultimately won by the home team about 75% of the time, then it would seem to suggest a reasonable amount of accuracy, no?
by erosen on Oct 27, 2009 12:53 PM EDT reply actions 0 recs
There's stuff done on that
I’m pretty sure there are simple WPA tools that are based on an era’s game results rather than on Markov chains.
Marlin Maniac, a Florida Marlins blog
Come attend Intro to Sabermetrics 101!
Check me out at Beyond the Box Score as well.
by SFiercex4 on Oct 27, 2009 1:32 PM EDT up reply actions 0 recs
They would almost certainly come out right in this sort of study,
because they are based on game data. Over the course of a season or so, the numbers would all come out right, because the matchups, etc. would all even out. What the real question is is how accurate are they on a game by game basis? Can we really second guess the manager based on a win expectancy chart?
My guess is that, although the manager has a lot of information that we don’t have, that it would be better to do what the win expectancy chart says every time, than to let the manager make subjective adjustments. What I’m saying is, that WPA would probably be a better manager than any of the current skippers, at least when looking at IBB and bunts. You could extend that to relievers, but WPA would probably put Miriano Rivera in every game of the season, so that might be a problem.
by lookatthosetwins on Oct 27, 2009 1:32 PM EDT up reply actions 0 recs
According to this,
http://www.fangraphs.com/blogs/index.php/were-the-yankee-sac-bunts-in-the-8th-inning-correct
I’m wrong. Using win expectancy would yield a too predictable result. Oh well. I’ll continue to second guess, as long as the managers are just as predictible.
by lookatthosetwins on Oct 27, 2009 1:47 PM EDT up reply actions 0 recs

by 











BtB on Facebook
















