BtB Power Rankings: Through Tuesday, August 18th, 2009
Welcome to our weekly ranking of all the MLB teams! In this ranking, we use aggregate team hitting, pitching, and fielding statistics--not team wins, losses, runs scored, or runs allowed--to evaluate the performance of teams to date. You can think of the estimated winning percentage (eW%) below as how we'd expect things to fall out if we threw teams with these aggregate statistics into one big league and let them battle it out for thousands of games. They don't replace your actual standings, but they give you something different to consider when thinking about team performances. The table is sortable if you click in the header. All data are park-adjusted when possible. A legend is below the table, followed by commentary. There is also a table comparing actual vs. expected run scored and run allowed totals, as well as actual vs. expected winning percentages.
Beyond the Boxscore Power Rankings
| Rank | Chg | Team | wOBA | eRS | tERA | tRns | Fld | eRA | eW%lg | LgAdj | eW% |
| 1 | 0 | NYA | 0.362 | 704 | 4.43 | 572 | 8.0 | 564 | 0.608 | 14.9 | 0.630 |
| 2 | 0 | TB | 0.352 | 644 | 4.41 | 553 | 30.5 | 522 | 0.600 | 14.7 | 0.623 |
| 3 | 0 | BOS | 0.340 | 599 | 3.96 | 501 | -23.3 | 524 | 0.563 | 14.7 | 0.588 |
| 4 | 0 | LAN | 0.337 | 610 | 3.92 | 514 | 26.0 | 488 | 0.604 | -14.9 | 0.579 |
| 5 | +2 | CHA | 0.326 | 537 | 3.81 | 485 | -8.7 | 494 | 0.538 | 14.9 | 0.565 |
| 6 | 0 | COL | 0.328 | 542 | 3.57 | 450 | -1.5 | 451 | 0.584 | -14.8 | 0.557 |
| 7 | +1 | TEX | 0.334 | 556 | 4.40 | 555 | 32.2 | 522 | 0.530 | 14.7 | 0.555 |
| 8 | -3 | TOR | 0.331 | 566 | 4.20 | 533 | 0.7 | 532 | 0.529 | 14.6 | 0.554 |
| 9 | 0 | LAA | 0.353 | 645 | 4.82 | 603 | -5.0 | 608 | 0.529 | 14.6 | 0.552 |
| 10 | +1 | CLE | 0.339 | 600 | 4.68 | 588 | -21.9 | 609 | 0.492 | 14.7 | 0.516 |
| 11 | +2 | MIN | 0.337 | 594 | 4.55 | 578 | -26.7 | 605 | 0.491 | 14.8 | 0.515 |
| 12 | +2 | PHI | 0.338 | 585 | 4.43 | 559 | 17.2 | 542 | 0.537 | -14.4 | 0.512 |
| 13 | -3 | ARI | 0.321 | 531 | 3.90 | 506 | 16.5 | 490 | 0.537 | -14.9 | 0.510 |
| 14 | -2 | ATL | 0.326 | 543 | 3.86 | 494 | -10.0 | 504 | 0.535 | -14.8 | 0.508 |
| 15 | 0 | DET | 0.326 | 528 | 4.54 | 569 | 19.5 | 549 | 0.481 | 14.7 | 0.507 |
| 16 | 0 | OAK | 0.318 | 506 | 4.16 | 531 | -13.3 | 545 | 0.466 | 14.8 | 0.492 |
| 17 | 0 | CHN | 0.318 | 512 | 4.02 | 508 | 5.7 | 503 | 0.508 | -14.6 | 0.481 |
| 18 | +2 | FLA | 0.332 | 574 | 4.35 | 556 | -12.1 | 568 | 0.505 | -14.8 | 0.480 |
| 19 | -1 | STL | 0.327 | 542 | 4.11 | 529 | -8.2 | 537 | 0.504 | -15.1 | 0.478 |
| 20 | -1 | SEA | 0.316 | 495 | 4.74 | 606 | 44.1 | 562 | 0.441 | 14.8 | 0.467 |
| 21 | 0 | BAL | 0.327 | 544 | 4.69 | 593 | -30.1 | 623 | 0.435 | 14.8 | 0.459 |
| 22 | 0 | SF | 0.304 | 440 | 3.90 | 495 | 41.4 | 454 | 0.486 | -14.8 | 0.457 |
| 23 | +1 | MIL | 0.336 | 579 | 4.90 | 614 | 10.1 | 604 | 0.479 | -14.8 | 0.455 |
| 24 | -1 | KC | 0.313 | 476 | 4.21 | 527 | -43.5 | 570 | 0.417 | 14.8 | 0.443 |
| 25 | +1 | NYN | 0.322 | 519 | 4.32 | 546 | -27.0 | 573 | 0.453 | -14.8 | 0.428 |
| 26 | +1 | WAS | 0.332 | 566 | 4.96 | 619 | -19.1 | 638 | 0.442 | -14.8 | 0.418 |
| 27 | -2 | HOU | 0.321 | 516 | 4.47 | 568 | -21.6 | 590 | 0.437 | -14.8 | 0.412 |
| 28 | 0 | PIT | 0.315 | 484 | 4.63 | 575 | 14.4 | 561 | 0.431 | -14.7 | 0.406 |
| 29 | 0 | SD | 0.321 | 524 | 4.75 | 615 | -16.6 | 632 | 0.411 | -15.1 | 0.387 |
| 30 | 0 | CIN | 0.301 | 436 | 4.62 | 586 | 20.8 | 565 | 0.383 | -14.7 | 0.357 |
Offense = wOBA (park-corrected), eRS (estimated runs scored; wRC from FanGraphs, then park adjusted)
Pitching = tERA and tRns are a home-brew version of Graham MacAree's statistic.
Fielding = Fld: average of bUZR (from FanGraphs) and THT's batted balls statistic (converted to runs)
eRA (estimated runs allowed) = Pitching - Fielding
eW%lg = estimated winning percentage within the specific league (AL or NL)
LgAdj = league adjustment (bonus to AL teams, penalty to NL teams, because the AL has superior level of play)
eW% = estimated winning percentage if all teams were in one league (after league adjustment)
Methods provided in more detail in the first post in this series
American LeagueTeam Leaders (asterisks indicate teams improving in specific ranking):
Offense (wOBA): Yankees, Angels, Rays National League American League: E=Yankees, C=White Sox, W=Rangers, WC=Rays
Pitching (tERA): White Sox, Red Sox, Athletics*
Fielding (Fld): Mariners, Rangers, Rays
Offense (wOBA): Phillies, Dodgers, Brewers*
Pitching (tERA): Rockies, Braves, Diamondbacks
Fielding (Fld): Giants, Dodgers, Reds*"On Paper" Playoff Leaders (asterisks indicate new leaders):
National League: E=Phillies*, C=Cubs, W=Dodgers, WC=Rockies
This Week's Movers
The only change this week in terms of our on-paper playof leaders is that the Phillies unseated the Braves from their one-week stint at the top of the NL East. The Braves actually had the better week from the standpoint of wins, but the component statistics that are the basis of our rankings pushed the Phillies to the top. In reality, it's effectively tied--they're only separated by a 0.003 winning percentage. As mentioned in the comments last week, the Marlins may be closer to the Phillies in the actual standings, but both actual and estimated runs show that they've been outscored on the season. So far, they're hanging in there, but my feeling is that if anyone is going to close the gap, it'll be the Braves. They're going to have to make up some ground real quick, though...
Another mismatch occurs in the AL Central, where the White Sox are currently three games behind the Tigers in the actual standings. The BtB Power Rankings, on the other hand, has the White Sox up to #5 (expected W% in the AL of 0.538), while the Tigers lag behind at #15 (eW%lg of 0.481). If component statistics give you a better idea of a team's true talent level than their actual runs scored and allowed, the Tigers may be a bit of an overperformer thus far. Their wRC is 17 runs shy of their actual runs scored total. Similarly, their estimated runs allowed, based on tRA, is 20 runs higher than their actual total. As a result, while their PythagoPat record is 0.514 (essentially the same as the White Sox'), their estimated W% in the AL is just 0.481.
I thought it was also worth mentioning that the Holliday-lacking Athletics have climbed all the way to #16 in our rankings, largely on the backs of their pitching staff. I have their staff at a 4.16 tRA, and a 4.29 FIP. At 20 games behind the Angels, they're not going anywhere this season. But Beane has assembled a nice, young, cheap core of a pitching staff. If only they could find some offense...
0 recs |
26 comments
|
Comments
Yeah!
there’s no doubt the Cubs are going to win the Central!
by chuckb on Aug 19, 2009 10:03 PM EDT reply actions 0 recs
The Diamonbacks are bending my mind
into a pretzel!
godfather of futureredbirds.net
by erik on Aug 19, 2009 11:53 PM EDT reply actions 0 recs
Freakin' defense.
Gah.
@bs_uf15bosox9be The Original Gameday; Learn to use SB Nation
by bs.uf15bosox9bears23 on Aug 20, 2009 12:23 AM EDT reply actions 0 recs
Rally
I’m used to these sort of things showing the Angels lower than their actual record, but I can’t take it seriously until they at least move ahead of Toronto.
The HK-47 hitting droid is the finest line drive machine ever built
by RallyMonkey5 on Aug 20, 2009 9:34 AM EDT reply actions 0 recs
Don't expect to get any response to this. It's a little like Fight Club.
Refusing to address why our statistical models lack the proper inputs to describe the world-as-it-is (especially as it relates to the sabermetric black swan of Lanaheim) is the first law of BtB.
That pleasant plashing does not exist. There is no black swan. Don’t talk about the black swan!
by Turks Teeth on Aug 20, 2009 1:24 PM EDT up reply actions 0 recs
So glad you stopped by just for this comment.
I’d delete it, but it lets me make a point:
Justin and others are more than happy to discuss where their models and assumptions are incomplete. We know baserunning isn’t included. We know bullpen leverage isn’t included. We know we’re not accounting for style of offense. We know fielding metrics are painting with wide brushes. We know strength of schedule (other than league) isn’t included. Those have all been discussed before and we’re not trying to hide them. There are also inadequacies we’re not aware of. We’d love to find out about them and find solutions to fix them all.
Anything constructive you have to add to the conversation is more than appreciated. If you’d like to stay on topic, great. If you’d like to discuss with me any problems you have with BtB, my email is available, always has been.
Beyond the Boxscore Not a member? Sign up.
by Sky Kalkman on Aug 20, 2009 2:13 PM EDT up reply actions 0 recs
We are actually having a big discussion about the BTB power rankings over at VEB today
Obviously most of the Cardinals fans feel a little annoyed by the fact it only has us as the 19th best team in baseball and that it says we are a sub .500 team.
Stat Whore
by FlimtotheFlam on Aug 20, 2009 2:37 PM EDT up reply actions 0 recs
I'll head over, but which of Justin's main three pieces do fans most disagree with?
Or is it something else, like the way they’re combined?
Offense: a touch below average (.327 wOBA)
Fielding: a touch below average (-9 runs)
Pitching: solidly above average (4.11 tERA)
(All pre-league adjustment, natch.)
Beyond the Boxscore Not a member? Sign up.
by Sky Kalkman on Aug 20, 2009 3:01 PM EDT up reply actions 0 recs
I have seen tERA
Under estimate ground ball teams
Stat Whore
by FlimtotheFlam on Aug 20, 2009 4:20 PM EDT up reply actions 0 recs
I've also used FIP in the past
I moved to tRA because it takes just about everything into account, rather than just K, BB, and HR/9.
As I said below, though, using FIP instead of tRA still doesn’t get the expected runs allowed back to reality. It does help, but that’s to be expected when you’re dealing with one of the larger outliers.
-j
My blog: Basement-Dwellers.com
Also: Beyond the Boxscore
by JinAZ on Aug 20, 2009 10:00 PM EDT up reply actions 0 recs
How?
tERA includes batted ball data.
Thanks
by vivaelpujols on Aug 21, 2009 12:15 AM EDT up reply actions 0 recs
I posted this over there...
Wow, this is a pretty angry conversation about something that I’ve been doing most of the season. I will say that I have noted that the strongest critiques are almost always from fans of teams ranked poorly. FWIW, my Reds come in last, while my most-hated team, the Yankees come in first. ::sigh::
Anyway, some points:
1. The league adjustments are based on interleague records, and (loosely) the studies done on players swapping leagues. The two match up. And those league-change studies are the basis of the typical adjustment of a half-win a season for AL players vs. NL players most people use (though FanGraphs does nothing at this point, afaik).
2. If you don’t like the league adjustments, just look a the eW%lg column. You can click on it to sort by it. That column essentially assumes no difference in quality between leagues. I don’t think that’s true, but it makes some people happier.
3. If you’re going to compare actual records to expected records, you really should use the eW%lg column. It’s quite literally a component pythagenpat estimate of W%. The eW% column assumes league adjustments, which is much more hypothetical (what if you took all the AL and NL teams and threw them into one league).
4. Sorry to hear some folks don’t like the name of the list. This is the first time in 3 months someone has objected to it, so I think I’m going to stay with it.
5. As Sky mentioned at BtB, there are a lot of things not in the calculations yet: baserunning (beyond SB & CS), bullpen leverage, and strength of schedule being the biggest of them. And there are reasonable criticisms of how I’m implementing tRA, park effects, etc. Happy to talk about that. My feeling is that most of those things, if implemented, will have pretty small effects. But I don’t know that.
6. As for the Cardinals, through Tuesday, here’s what I have:
True W%: 0.562
Pythag W%: 0.548
eW% lg (without league adjustment): 0.504
So yes, definitely a “miss” low. In fact, it’s the second-largest “miss” in baseball, behind only the Giants. Why? Let’s look at runs scored and runs allowed:
Actual RS (after park adjustments): 556
Estimated RS (after park adjustments): 542
So eRS is 14 runs shy of the actual total. I don’t know why that is—I literally just take actual RS and wRC from fangraphs and park-correct both of them in the same way. Could be lucky, could be baserunning, could be something else. That’s not a small disparity, but it’s not huge either at this point in the season. We can add in some baserunning, but BPro has them (looking it up) at +1 run above average, so that doesn’t help much.
Actual Runs allowed (park adjusted): 502
Estimated RS (park adjusted): 537
So there’s the biggie. I’m estimating 35 more runs allowed than actually has happened. Looking quickly, the Cards tERA is 4.11 whereas their FIP (as I do it, anyway) is 4.04. Pretty close match there, but if I used FIP instead of tRA to estimated runs allowed it comes in at 523 RA. Definitely better.
Fielding-wise, UZR at fangraphs has them (or did through Tuesday) at -13 runs. THT’s team fielding stat, which is a mesh of DER and batted ball data, has them at -3 runs. I’m using the average of the two, which is -8 runs, in the power rankings. You could argue we should drop down -3 runs if you don’t like UZR, though I think UZR’s a better fielding estimate than THT’s (and studes has concurred with me on that point in the past). Thinking about it, though, I’m not including Molina’s defense in any of this, so I think that would help justify going to the -3 runs.
So if we do everything in a direction that makes the Cards look better (which is not exactly the objective way to make a decision, but whatever), we get an estimate of 518 runs allowed vs. 543 estimated runs scored. That would put the component pythag at 0.524. Closer to reality than the 0.504 I had before, but still shy of the actual W%. And again, I’m doing everything here to make the Cards look better, which almost certainly will have the opposite effect on other teams. In other words, these decisions aren’t necessarily the “correct” ones just because the make the Cardinals look better.
So, I guess what I’m saying is that why I can certainly sympathize with folks being upset that your team is ranked lower than you’d like, I think it is the case that, at least based on the component stats I’m using, that the team might be “overperforming.” Whether that’s due to lucky, leverage, or something else, I have no idea. But that’s what the BtB power rankings do—they give you something else to look at when evaluating the relative strength of teams.
Hope that helps. And look on the bright side—with all of your additions this season, the Cardinals are almost certainly a better team than their component stats would indicate! :) Enjoy the ride.
-j
My blog: Basement-Dwellers.com
Also: Beyond the Boxscore
by JinAZ on Aug 20, 2009 4:19 PM EDT up reply actions 0 recs
How do you do Park Adjustments
Based on data from this year or multiple years?
Stat Whore
by FlimtotheFlam on Aug 20, 2009 4:26 PM EDT up reply actions 0 recs
Depends on the stat
Though all are multi-year, regressed park factors.
For straight-up runs data—actual runs scored and allowed, or wRC from fangraphs, I use Patriot’s 5-year regressed park factors. Stadiums where we don’t have 5 years yet (like Busch, i think) get more severe regression.
For tRA, I use David Gassko’s batted ball park factors. If I remember right, they are based on 3-years of data and are regressed (though he didn’t elaborate on how he did the regression).
UZR has built-in park factors. THT’s team fielding stat does not have any AFAIK, and I’m doing nothing to do those data.
-j
My blog: Basement-Dwellers.com
Also: Beyond the Boxscore
by JinAZ on Aug 20, 2009 9:58 PM EDT up reply actions 0 recs
You know who Rally is, right?
He’s one of the original Project Mayhem members,basically, if you want to continue this metaphor.
@bs_uf15bosox9be The Original Gameday; Learn to use SB Nation
by bs.uf15bosox9bears23 on Aug 20, 2009 11:14 PM EDT up reply actions 0 recs
So good to see
You’ve read Nassim Nicholas Taleb. Sadly, I’m not sure you quite understood the thesis of his book.
According to Taleb, black swan events are statistical outliers, which is to say that they occur very infrequently. If you’re arguing that this ranking undervalues the Angels on a consistent basis, then that is most certainly not a black swan case. You might be arguing that it is a systematic error, which is an entirely different matter.
Furthermore, the reason black swan events are difficult to model is because the amount of time for which we have data in the financial world is limited compared to the frequency of such events. This is simply not the case when it comes to baseball statistics.
I would actually recommend the book “Fooled by Randomness,” also by Taleb, to you. I think you might find it enlightening.
by Tommy Bennett on Aug 22, 2009 7:01 PM EDT up reply actions 0 recs
Don't feel bad
The eW% of the Cardinals is only 22 games off the real record
Stat Whore
by FlimtotheFlam on Aug 20, 2009 1:28 PM EDT up reply actions 0 recs
don't use eW% when comparing to actual records
Use eW%lg. It doesn’t make sense to use eW%, as that assumes that all the NL and AL teams are in the same league (i.e. it’s a hypothetical estimate).
-j
My blog: Basement-Dwellers.com
Also: Beyond the Boxscore
by JinAZ on Aug 20, 2009 4:20 PM EDT up reply actions 0 recs
Rally
I will try to adjust all of my methods to make the Angels look better. Or at least make the Blue Jays look worse.
-j
My blog: Basement-Dwellers.com
Also: Beyond the Boxscore
by JinAZ on Aug 20, 2009 4:21 PM EDT up reply actions 0 recs
Quick question
And may have been answered, how are you accounting for the Yankees new ballpark, if at all?
by ETA on Aug 20, 2009 7:33 PM EDT reply actions 0 recs
Baserunning
I don’t mean anything personal here. I appreciate BTB and Justin’s work. Nothing came close to getting me angry, except for the “I’d delete it” crack.
Sometimes Ihave to take a step back from the model and wonder if it is helping understand reality a little better, or distorting. The Angels being essentially even with the Blue Jays is that sort of moment.
If you want to include a team baserunning metric, here’s a great place to start:
http://www.baseball-reference.com/leagues/AL/2009-baserunning-batting.shtml
Even cooler is how easy Sean Foreman has designed it for moving into a spreadsheet. Angels have scored from 2nd on singles 124 times, no other team even has 100 yet. Though they also lead in outs on the bases. My initial estimates show the Angels with a net of about 10 runs through baserunning, though I’d want to play around with it some more.
The HK-47 hitting droid is the finest line drive machine ever built
by RallyMonkey5 on Aug 21, 2009 9:03 AM EDT reply actions 0 recs
Baserunnning is on the to do list
I was just planning to pull BPro’s baserunning stat—it’s Dan Fox’s work, which should mirror your own and John Walsh’s, I think (?). Omitting stolen bases (since they’re already included in the model), the Angels are about +8 runs. Doesn’t help a ton, but it does help.
I will try to get this going soon. I’ve just been working on other things of late. I might be able to do it for next week, though.
FWIW, I don’t think the “I’d delete it” was directed in your direction…
-j
My blog: Basement-Dwellers.com
Also: Beyond the Boxscore
by JinAZ on Aug 22, 2009 6:11 PM EDT up reply actions 0 recs
Pretty sure it wasn't.
Sky was talking about the BtB-hater, it seems to me.
@bs_uf15bosox9be The Original Gameday; Learn to use SB Nation
by bs.uf15bosox9bears23 on Aug 24, 2009 1:56 PM EDT up reply actions 0 recs
Nice to see the White Sox up there in the rankings...
too bad they can’t take thier impressive stats to the field at the same time. Starting pitcher throws a gem = bull pen blows it and vise versa.
"you should go back to your cowardly practice of offering no opinion of your own."
-picktoclick
by Where Triples Go to Die on Aug 22, 2009 5:17 PM EDT reply actions 0 recs
Timing...
My blog: Basement-Dwellers.com
Also: Beyond the Boxscore
by JinAZ on Aug 22, 2009 6:11 PM EDT up reply actions 0 recs













BtB on Facebook

















