Bill James and the Cy Young Awards
Yesterday I decided to begin reading a few more of the essays in The Neyer/James Guide to Pitchers (I would highly recommend any book by either to anyone who even mildly likes baseball) so after I got through the Bob Friend and injury pieces I ran across one entitle "E=M CY Squared." (Page 467) Amazingly the piece is quite timely since one of the hot topics right now is the Cy Young races, and in this article, by Bill James, he created a formula that is about 80% accurate in predicting the Cy Young winners and does a nice job picking the top three overall.
Naturally I decided this could be worth replicating on site. If nothing else we have an 80% chance at nailing them. To begin here's the formula, per James:
Wins multiplied by six, then minus losses times two, then add strikeouts divided by 12, then add saves times 2.5, then add shutouts, then add runs saved versus a pitcher with a 5 ERA, then add 12 points for a first place team.
For runs saved I took the ERA, divided it by nine, then times it by the innings that pitcher pitched. I am assuming this is correct and there is no pre-set measure on innings for the 5.00 ERA pitcher. As for my sample group I simply went to THT and took the pitchers with 10 or more wins from each of the leagues (with one very special exception). Since simply giving the top three isn't fun I will provide the top five and then post the entire list at the conclusion of this article.
First up, due to alphabetical order and such, the American League.
| Pitcher | Team | Points |
|---|---|---|
| Lee | CLE | 205 |
| Halladay | TOR | 162 |
| Matsuzaka | BOS | 158 |
| Lester | BOS | 147 |
| Santana | LAA | 146 |
This probably is not too much of a surprise. Lee runs away with the award due to his wins and runs saved. Lee is having an incredible season in every regard. Even if most of his success has come against weaker foes I’m not sure you can discredit his victory. After all Lee’s tRA is almost a half run better than Halladay; 2.86 to 3.35. What’s amazing is that Lee was offered this past off-season for Delmon Young yet the Rays passed. There’s no guarantee he repeats this performance on any other team but talk about a diversity in seasons between those two.
The first place team bump puts Matsuzaka far too close to Halladay despite Halladay being the supreme pitcher in every regard except losses. That is perhaps the trickiest part about this. James knows and takes into account the mainstream voter’s mindset. The award is not truly about who is the best pitcher, otherwise win-loss records would obviously be the first thing discarded. To the voters Matsuzaka is a good pitcher, but to the more informed, he is okay, but not even a top 20 American League starter.
Onto the National League…
| Pitcher | Team | Points |
|---|---|---|
| Lincecum | SF | 176 |
| Webb | ARI | 160 |
| Sabathia | MIL | 152 |
| Dempster | CHN | 146 |
| Santana | NYN | 146 |
As is the case in the A.L. neither league has a pitcher on a first place team in the top two spots. Lincecum is incredible however that shutout was not the cake but rather the sprinkles. His strikeout numbers are beyond amazing and on an awful team, going 17-3 is quite the accomplishment.
Then there is Brandon Webb. His team has a legitimate shot at the playoffs which would help his case, yet even if the Diamondbacks were in first he would need to make up four points on Lincecum somehow. Perhaps winning out with Lincecum losing out and not striking anyone out would help. Of course relying on Lincecum to fail all but seals Webb as the runner up this year, but I hear Bruce Bochy wants to ensure that this will not be a problem in the future.
Now, Sabathia. With his N.L stats Sabathia finished with 102 points, blame only nine wins – despite the limited playing time – for that. I had a bit of a debate with someone when discussing whether I should include his Indians stats as well and we were split. He argued that voters should only talk about what he did in the N.L. due to the league specific award, I argue that you simply cannot ignore half of his season. Since I am the one writing this I included them. Whether the writers will or will not consider his Indians half is irrelevant to me.
The rest: American League
| Pitcher | Team | Points |
| Lee | CLE | 205 |
| Halladay | TOR | 162 |
| Matsuzaka | BOS | 158 |
| Lester | BOS | 147 |
| Santana | LAA | 146 |
| Burnett | TOR | 124 |
| Mussina | NYA | 123 |
| Saunders | LAA | 123 |
| Floyd | CHA | 123 |
| Shields | TB | 122 |
| Lackey | LAA | 110 |
| Buehrle | CHA | 104 |
| Danks | CHA | 101 |
| Kazmir | TB | 100 |
| Garza | TB | 99 |
| Garland | LAA | 98 |
| Sonnanstine | TB | 96 |
| Galarraga | DET | 95 |
| Beckett | BOS | 95 |
| Vazquez | CHA | 93 |
| Duchscherer | OAK | 92 |
| Greinke | KC | 87 |
| Slowey | MIN | 86 |
| Weaver | LAA | 83 |
| Litsch | TOR | 80 |
| Perkins | MIN | 80 |
| Padilla | TEX | 79 |
| Guthrie | BAL | 78 |
| Jackson | TB | 77 |
| Meche | KC | 76 |
| Pettitte | NYA | 75 |
| Blackburn | MIN | 72 |
| Verlander | DET | 45 |
| Hernandez | MIN | 41 |
The rest: National League
| Pitcher | Team | Points |
| Lincecum | SF | 176 |
| Webb | ARI | 160 |
| Sabathia | MIL | 152 |
| Dempster | CHN | 146 |
| Santana | NYN | 146 |
| Billingsley | LAN | 140 |
| Volquez | CIN | 138 |
| Zambrano | CHN | 130 |
| Hamels | PHI | 122 |
| Sheets | MIL | 122 |
| Haren | ARI | 118 |
| Oswalt | HOU | 117 |
| Lilly | CHN | 117 |
| Nolasco | FLA | 116 |
| Lowe | LAN | 112 |
| Cook | COL | 109 |
| Moyer | PHI | 106 |
| Lohse | STL | 100 |
| Jurrjens | ATL | 98 |
| Wellemeyer | STL | 94 |
| Pelfrey | NYN | 93 |
| Hudson | ATL | 89 |
| Arroyo | CIN | 83 |
| Maine | NYN | 79 |
| Perez | NYN | 78 |
| Looper | STL | 76 |
| Johnson | ARI | 71 |
| Myers | PHI | 71 |
| Moehler | HOU | 69 |
| Parra | MIL | 69 |
| Jimenez | COL | 65 |
| Marquis | CHN | 62 |
| Redding | WAS | 61 |
| Suppan | MIL | 52 |
| Kendrick | PHI | 46 |
References
The Hardball Times stats database
The Neyer/James Guide to Pitchers (buy it!) page 467.
0 recs |
9 comments
Comments
sounds good to me
go Timmy!
Still defending Rich Aurilia, and the Niners' classic unis
by wjackalope on Sep 16, 2008 1:07 PM EDT reply actions 0 recs
I am cautiously optimistic for Timmy. The final vote will seem to prove if September really is what the voter’s focus on or not.
"Ice is made for two things: injuries and my drinks." The Lincecums
by igotnothing on Sep 16, 2008 2:02 PM EDT reply actions 0 recs
standings on ESPN
ESPN keeps track of this too:
http://sports.espn.go.com/mlb/features/cy
Your list seems to be missing the relievers, but I’m not sure about the details of ESPN’s calculations.
by Trey Hillman's Chin on Sep 16, 2008 2:30 PM EDT reply actions 0 recs
Huh, I had no idea ESPN had that.
Thanks!
by R.J. Anderson on Sep 16, 2008 2:37 PM EDT up reply actions 0 recs
Sure thing,
The other thing I noticed that they’re doing something funky with the 12 point bonus. It seems that the Angels’ pitchers get the full bonus, presumably because they’ve clinched 1st place. The Cubs’ pitchers get the full 12, but I don’t think they have even clinched (at least) the wildcard (am I wrong here?). Other contending teams seem to get between 5 and 7 points. Presumably these will all be 0 or 12 in the end.
Also, for the runs saved vs. 5.00 ERA pitcher, they use ((5-ERA)*IP/9) which I think is the same thing you described.
As for the leaving out the relievers, I think I like your list better. I think the 2.5 weight of the save is a bit high. There are way too many closers on all of those lists on ESPN’s site. Does James say anything about that? I wonder what the accuracy would be if you simply ignored saves and conceded the error in years where a closer wins. Since 1990, a closer has only won once in each league.
by Trey Hillman's Chin on Sep 16, 2008 4:33 PM EDT up reply actions 0 recs
From what I can recall looking at his lists in the book
I didn’t remember seeing any closers listed. I’d have to check later later though, perhaps Gagne?
The bonus is supposed to be only for first place teams from my understanding. I assume that does not include the wild card because that team is obviously in second in their division. I don’t really know what they’re doing with the Cubs points. That’s odd.
The ERA formula would work out equal, I just got the runs amount and subtracted the allowed total by the pitcher. Their formula is less work, but I wanted the numbers for comparison sake.
I don’t particularly see any closer being good enough to win Cy Young, although I likely should’ve included them anyways, but oh well.
by R.J. Anderson on Sep 16, 2008 5:17 PM EDT up reply actions 0 recs
KRod's high save total could certainly make a big difference in voting
he’s second over at ESPN, closer to Cliff Lee than Roy Halladay. and that’s my guess how the voting turns out (not how i would vote, though):
Lee
KRod
Halladay
Beyond the Boxscore // Calling BJ Upton lazy is lazy.
by Sky Kalkman on Sep 17, 2008 9:57 AM EDT up reply actions 0 recs
My guess is that ESPN uses their playoff odds report to split the 12-point bonus between teams with a chance of making the playoffs
Beyond the Boxscore // Calling BJ Upton lazy is lazy.
by Sky Kalkman on Sep 17, 2008 9:55 AM EDT up reply actions 0 recs

by 













BtB on Facebook














