Interleague Records Through 2010, and Implications for the AL vs. NL Disparity
The line is a two-year moving average.
This year, the National League had its best showing in interleague play since 2004, when the leagues almost exactly split the season series. The American League's overall record this year was 134-118 (.532 W%), which is almost exactly matching their PythagenPat record of .529 (1168 RS vs. 1098 RA). It is the second consecutive year that the NL improved its record vs. the AL, which hasn't happened since 2002-2003. As a National League fan, and as someone who tries to assess the league quality differences in the power rankings, this is terrific news. The ideal would be that the two leagues are equivalent; the NL may still be behind, but it seems to be making a comeback.
What's happening? We've talked before about some of the explanations for the American League's dominance over the past half-decade in interleague play. Once you dispense with the rule differences driving the entirety of the effect (please see this article first before you start ranting about DH advantages for AL teams), most explanations have converged on American League teams simply having better management. I think what we may be seeing is the National League teams starting to play catch-up, employing better use of fiscal resources via the draft and amateur free agent signings, making smarter moves based on better scouting and statistical information, etc. The Pirates and Reds are great examples of this shift, even if it hasn't paid off yet for their major league club.
...the rest of this is more geeky, for those interested in quantifying/estimating league disparities. Feel free to skip!
How does this news affect the power rankings? Over the past two years I've applied a 40-run per season boost to the run differential of AL teams (20 runs added to runs scored, 20 runs subtracted from runs allowed), and a 40-run penalty to NL teams when determining our Team Performance Index (TPI). This is based on several convergent sources of information, including interleague records as well as studies of players who switched teams. Do we need to reduce this league adjustment to account for this year's apparent NL resurgence?
Maybe. Let's just assume for a minute--and it's a big assumption--that the .532 W% posted this year represents the true talent gap between the two leagues. Using the odds ratio, we can estimate that a 0.532 W% would result when a true talent .516 W% team (the AL) played a true talent .484 W% team (the NL). Assuming a fairly typical MLB run environment, and using PythagenPat, this would mean that an AL team might be expected to score 760 runs and allow 734 runs when playing 0.500 teams, while the NL team would do the reverse. Based on this, and accounting for the fact that teams play 10% of their games in interleague and thus those games do not need adjusted in this manner, I estimate that we should apply a 23-run/season boost/penalty to the run differentials of teams instead of the 40-run adjustment I've been using in the power rankings. Pretty big difference!
But that might not be fair. This year might be an aberration, and we may see the AL return to its typical dominance next season. Maybe there were some match-up issues going on this year that tweaked the records to favor the NL more so than is typical? A more conservative approach would be to take a 2-, 3-, or even 5-year average of interleague records and use that as the basis for our adjustment. Here's a table showing how doing so would affect our league adjustment:
| Time Span | AL W% | Estimated AL True Talent | Estimated NL True Talent | Per-Season Adjustment to Run Differential |
| 2010 Only | 0.532 | 0.516 | 0.484 | 23 |
| 2009-2010 | 0.539 | 0.520 | 0.480 | 28 |
| 2008-2010 | 0.556 | 0.528 | 0.472 | 40 |
| 2006-2010 | 0.565 | 0.533 | 0.467 | 47 |
This means that the currently-used league adjustment of 40 runs is (exactly!) appropriate if we use a 3-year moving average of interleague records to estimate league talent levels. But it's not appropriate if we only look at this year or the past two years.
I'm honestly not sure what to do here. My inclination is to err toward the smallest league adjustment above, if only because using interleague records is an imperfect measure of league quality differences. It sort of builds in some "regression" (in the loosest possible sense), in that we're leaning toward evenly matched leagues in our decision making process. So, I'm leaning toward pulling the adjustment down to 23-runs instead of 40. That said, I've argued in the past for using multiple years as a more accurate, predictive measure of league differences, and so I'm reversing myself here because it gives me a more palatable result. That's not exactly a scientific approach.
...what do you folks think?
17 comments
|
0 recs |
Do you like this story?
Comments
I don't know too much about how to perform a regression
But could you use this year’s interleague result, and then regress it appropriately based on the year-to-year correlation of interleague winning percentage?
Hmm...
I dunno. Seems confounded with skill changes. I’m also not sure that it would be a particularly strong correlation. Good idea, though, let me think more on it.
-j
I write at:
Beyond the Boxscore | Red Reporter | Basement-Dwellers.com | Twitter: @jinazreds
How about including and weighting the past 3 or 4 years
such as 5-4-3 for 2010, 2009, and 2008?
Would you like to follow me on Twitter, Facebook, or my blog...well you can't.
by SagehenMacGyver47 on Jun 28, 2010 4:16 PM EDT reply actions
That was my initial thought
And I know those numbers are common, but do they have any real justification or are they just magic numbers that seem to work well?
I feel like this analysis should remain as “scientific” as possible.
I looked at a 5-4-3 weighting too.
It’s pretty close to the 3-year average. 0.551 W% instead of 0.556 W%.
The problem I have with using the 3-year average (or weighted average) is that it might be missing what seems to be an important event: the NL teams waking up and getting better. That could be a mirage, but I trust 250 games more than I trust a hitter’s single season hitting line. Maybe I’m misguided.
At this point, I’m leaning toward using the two-year average. It’s still a fairly large drop in the league adjustment, but it’s acknowledging that what we saw in interleague this year might not be what we can expect to see in the future.
-j
I write at:
Beyond the Boxscore | Red Reporter | Basement-Dwellers.com | Twitter: @jinazreds
My preseason forecasts
showed an AL advantage of 135-117
http://www.insidethebook.com/ee/index.php/site/comments/chone_team_forecasts/
See post #71. Off by one win. I blame Fernando Rodney for his 5/23 meltdown against the Cardinals for that inaccurate forecast.
"That boy is our last hope" - Obi Wan Scioscia, as Francisco Rodriguez left for the Mets. "No, there is another" - Yoda Reagins.
In all seriousness, though
The fact that this year’s performance is in line with a preseason projection bodes well for its validity as an indication of true talent. I still favor using the two-year average, but I think you can make a good case for using the single year outcome, this time, based on this info.
-j
I write at:
Beyond the Boxscore | Red Reporter | Basement-Dwellers.com | Twitter: @jinazreds
I don't think you can
That was one projection system…if a lot of projection systems agreed on the interleague number, then I’d agree. But the fact that Rally was so close could have just been a “lucky guess.”
My point is more...
…just that the fact that this lines up with a preseason projection is better than if it did not line up with this preseason projection.
-j
I write at:
Beyond the Boxscore | Red Reporter | Basement-Dwellers.com | Twitter: @jinazreds
I know this is kind of nitpicky, and I'm sorry, but
I don’t even think you can say that’s true. How do we know this isn’t a very inaccurate projection outside of interleague results?
You can trust the track record of the system
It’s not like folks are just putting projections out there and the saber community isn’t looking at them retrospectively. Can they miss on players/teams? Of course, but that doesn’t mean there isn’t value in using them in a context such as this.
Agree with your premise above that it would be ideal if all of the projections had a similar outcome re:interleague. I’m sure it’s a testable thing.
by stevesommer05 on Jun 29, 2010 9:26 PM EDT up reply actions
I'm not sure that all projections explicitly include league adjustments
CHONE does. MGL’s unpublished stuff does. I think ZiPS does, though I’m not positive. I’m not sure that PECOTA does, though it might. Marcel absolutely does not.
-j
I write at:
Beyond the Boxscore | Red Reporter | Basement-Dwellers.com | Twitter: @jinazreds
Right, I agree with Steve.
CHONE is probably the big dog among projection systems, and has been for a few years. It works well as an estimate of true talent. His team-level projections probably aren’t as good as his player projections, but I feel better knowing that the preseason stuff is in line with the actual results.
-j
I write at:
Beyond the Boxscore | Red Reporter | Basement-Dwellers.com | Twitter: @jinazreds
Maybe this is an almost non-factor
but re: AL dominance, I know the DH has largely been disregarded as a factor here, but I’m curious as to the fact that the talent pool is simply larger for AL teams than it is for NL teams, because AL teams can simply employ players with zero fielding skills, while in the NL that player would be almost useless except in a pinch hitting role. I’m sure this isn’t the main cause, but it seems significant enough that players who can hit but are a detriment in the field hold a lot of value for AL teams and almost no value to NL teams. Again, I’m sure this is pretty negligible, but I don’t think it should be disregarded, as AL teams have some sort of value advantage both in trades and FA signings of these players.
Your prototypical weak-hitting second baseman.
Doesn't this put AL teams at a disadvantage when playing under NL rules?
Often times, one of their main cogs in the lineup is either a) forced to play the field where his weakness is exposed and thus he loses value, or b) sit on the bench and get only one PA per game.
-j
I write at:
Beyond the Boxscore | Red Reporter | Basement-Dwellers.com | Twitter: @jinazreds





















