Manager Wins Above Expectancy

by adarowski on Mar 28, 2026 9:01 AM EDT in Visuals Above Replacement

Click to enlarge

Enough people have asked me if a "WAR for managers" exists that it's time to share some ideas I've been kicking around. I'll say up front—in this article I'm going to present more questions than answers. My main goal is to see if something like WAR for managers is even possible.

How do we evaluate a manager's career today? Honestly, it sounds a lot like how we used to evaluate pitchers—wins, winning percentage, championships. We found better ways to evaluate a pitcher's value. Can we do the same for managers?

Let me throw an idea at you—Wins Above Expectancy.

Wins and championships simply tell us that a manager had a lot of talent on his team. If he has a lot of talent, he is supposed to win. But how about when a talented team doesn't win? What about when a team wins more than it should, based on its runs scored and runs allowed? How much is actually luck? If it is all luck, wouldn't it even out over thousands of games? Is there more than luck involved? Is the manager responsible for some of this over- or under-performance?

Pythagorean Record

You're probably familiar with pythagorean record—the record a team is expected to finish with, based on its runs scored and allowed. If a manager's team over-performs its pythagorean record, is it all luck? Did the manager manage his bullpen well? Did the manager pinch hit and substitute well? What if that manager routinely out-performs his expected record? Is he just that good?

Here are the top 25 managers of all time, by Wins Above Expectancy (based on pythagorean record):

29.4 Joe Torre
28.6 Ralph Houk
25.5 Mike Scioscia
24.0 Bobby Valentine
23.3 Walter Alston
22.5 Dick Howser
21.6 George Gibson
21.3 Tony LaRussa
20.9 Felipe Alou
20.5 Ron Gardenhire
20.3 Earl Weaver
19.8 Bill McKechnie
19.5 Sparky Anderson
17.7 Frank Robinson
17.0 Wilbert Robinson
17.0 Chuck Dressen
16.9 Whitey Herzog
16.6 Don Zimmer
16.5 Bruce Bochy
15.7 Lum Harris
14.7 Ozzie Guillen
14.6 Bobby Cox
14.5 Jimmy Dykes
13.7 Jim Marshall
13.7 Ossie Bluege

Joe Torre finishes first. We also have several managers with excellent reputations like Mike Scioscia, Tony LaRussa, my beloved Earl Weaver, Sparky Anderson, and Bobby Cox on the list.

At the bottom, we have:

-48.0 Bucky Harris
-47.9 John McGraw
-36.0 Jimmie Wilson
-35.9 Fred Clarke
-30.5 Hugh Duffy
-27.3 Eric Wedge
-25.9 Connie Mack
-25.5 Jimmy McAleer
-25.0 Jim Riggleman
-24.5 Buddy Bell
-23.6 Tom Lasordaa
-21.9 Jimy Williams

I'm a little surprised to see Tom Lasorda and John McGraw here. Connie Mack, of course, managed some really lousy teams after selling off his stars.

Is there another way we can look at managers to see if their teams are consistently doing better than they should?

Wins Above Replacement

I love WAR, so why not a WAR-based approach? By Baseball-Reference's WAR, a replacement level team has a winning percentage of .320 (52–110). If all players on the team totaled 40 WAR, then that team should be expected to win 92 games. But what if they win 96? Let's give those wins to the manager. If they win 88? Take those wins away from the manager.

Here are the top 25 managers of all time, by Wins Above Expectancy (based on WAR):

84.1 Wilbert Robinson
80.7 Mike Scioscia
67.9 Connie Mack
67.4 Bruce Bochy
62.4 Bobby Cox
53.4 John McGraw
53.0 Sparky Anderson
52.8 Felipe Alou
51.5 Al Lopez
50.0 Frank Chance
44.6 Ossie Bluege
43.4 Bill McKechnie
41.7 Ralph Houk
40.7 Fred Clarke
37.5 Jimmy Dykes
37.4 Lou Boudreau
35.2 George Gibson
33.6 Pat Moran
32.3 Joe Cronin
31.0 Dick Howser
30.9 Tony LaRussa
29.3 George Stallings
29.1 Jim Tracy
29.0 Miller Huggins
28.1 Earl Weaver

And the bottom 12:

-63.1 Gene Mauch
-52.2 Eric Wedge
-47.0 Mike Hargrove
-45.0 Lou Piniella
-39.9 Buddy Bell
-38.6 Chuck Tanner
-36.7 Cito Gaston
-32.5 Stan Hack
-31.6 Tom Kelly
-28.8 Billy Meyer
-27.2 Phil Garner
-26.4 Walter Alston

There are some huge differences between the lists. Most noticably, Connie Mack and John McGraw jump from the bottom of the list to the top. Walter Alston makes the opposite leap. Also, our #1 manager by pythagorean record, Joe Torre, finishes 284th by WAR (–6.1 wins).

There is, however, quite a bit of consistency. Mike Scioscia, for example, ranks third by pythagorean record and second by WAR. In fact, he is one of nine managers to rank in the Top 20 in both lists. That list includes Hall of Famers Wilbert Robinson, Sparky Anderson, and Bill McKechnie. It also includes current or recent managers Bruce Bochy and Felipe Alou. Ralph Houk is another on the list (and he has some of the best Hall of Fame credentials of any manager on the outside of the Hall). Rounding out the list are who managers who didn't even reach 1000 games—Charles Gibson (who managed in the 1920s and 1930s) and Dick Howser (who tragically passed away at 51 less than two years after winning the 1985 World Series with the Royals).

You can see the numbers for all managers by visiting my Wins Above Expectancy page on the Hall of wWAR site. As I dig more into this, I'm sure this page will evolve. For now, it is a data dump.

Some caveats about the data:

I mostly ignored seasons with managerial changes mid-season. The exception is if one of the managers managed all but 10 games or less. In those cases, I just gave the entire season to the primary manager.
For all seasons through 2009, I pulled data from Sean Smith's historical WAR spreadsheets. There may be some minor inconsistencies with Baseball-Reference as the data is older.

Now is the part where you tell me what you think of this. It isn't really "WAR for managers". It's more like "These are the managers who often won more than expected and at least part of that is probably because they were good." Thoughts?

Tweet Comment 28 comments | Add comment | 1 recs |

Do you like this story?

Comments

Thoughts:

If a manager was able to manage close games better than “average” over a long period of time, he would rank quite well in this scale. If a manager won a lot of blowout games and lost more close ones than “average” he would do poorly. To me, this is a question of managing style and philosophy, rather than how good a manager actually is.

Let’s take a hypothetical example to illustrate what I mean. It’s the 7th inning and the visiting team is down 1 run. There’s 0 outs and a man on 1st and 2nd. Small-ball manager might be included to bunt and hope one of the next two players get a hit, going up by one with the idea of winning the game in the 8th and 9th with his relievers. Big-run manager decides to let his players swing away, with the idea that a 3 run HR or similar feats will lead to a big inning, which will make the home team unlikely to be able to score enough to win in general. Let’s also suppose that both are successful in their respective strategies 50% of the time (that is given 100 such games, both will win 50 and lost 50).

Small-ball manager will have far more 1 run wins and far fewer 5 run wins simply because of the style in which he used the outs and hitters he had, and Big-run manager would have many more big wins and far fewer 1 win wins of the 50 games each won. But both won 50 of the 100 games available to them, so each were equally successful in the job a manager actually has: winning ballgames.

My point is that both your systems seem to penalize the Big-run manager, because the big wins he is able to manage through his style will suggest the team should have actually won more games than 50, while the Small-ball manager will be awarded because due to his limiting the totally number of runs his team scores your systems is underrating how good his teams actually were.

Take this simple illustration and multiply it by the thousands of situations in which different philosophies might yield different inputs for your measure of “good vs. bad” managers, and you begin to see that any such simple system is going to only give a very very misleading at best picture of how good a manager actually is. My concern is that simply because you can use available data to rank managers, you will suppose the ranking actually means something in terms of how good the managers are at winning baseball games without considering the correlative aspects of the data you are comparing.

"Difficulties strengthen the mind, as labor does the body."
― Seneca

by NJBammer on Mar 28, 2026 10:15 AM EDT reply actions

Thanks for the comment

I can see how your concerns would apply to the pythagorean approach, though my gut says these types of things would even out over a large sample. Meaning when that big offense team isn’t losing by 3, they’re winning by 3, evening things out.

I don’t get how the same concern applies to the WAR approach, though. That’s not a strict runs scored and runs allowed approach. That’s taking the total value of player contributions, so it is far less situational. I may be missing something though.

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 28, 2026 11:41 AM EDT up reply actions

Though my specific example

certainly concerned the Pythag flaw, my overall point was probably not communicated well. A manager can influence the outcome of the game in ways which are hidden by the WAR and the pythan approach both.

The address your first concern that these things would even out, I think the longer a manager works the more distorted the counting stat would be. As I illustrated, a manager specifically following a stretegy of winning by 1 run will not score the same number of runs as a manager trying to score as many runs as possible.

Stated bluntly, players DO NOT play in a vacuum. The decisions of the manager influences their conrtibutions, and if a manager consistently follows a strategy which influences the players’ total contributions the WAR itself will be influenced by the manager, and thus will be tainted as a means of evaluating the manager’s ability to win games independent of WAR.

Let’s take something like defense instead. Manager A never shifts his defense against players like Ryan Howard, and Manager B always does. The pitchers for manager A will have better numbers against Howard for reasons totally outside of their own control. How will your WAR system adjust for that? What about a manager who notices an extra bounce in the step of Derek Jeter, and concludes that it’s unsafe to pitch to him in a critical AB, and walks him instead of giving up a HR? How about if the manager knows his ace reliever went on a bender last night and won’t be available to face Prince Fielder in the 9th, and so manages his team with such information in mind? How can any system which only measures outcomes possibly evaluate the thousands of unknowns a manager has to judge to get the most out of his team?

We’re getting past the area where specific examples may make my point well enough. All I can say is I refer you to the excellent book “How we Decide” by Johan Lehrer. In it, he illustrates much more clearly what I’m trying to express: an experienced person can process and evaluate many variables which are unknown and unknowable to any computer system we currently have access to, including body language, how hard the wind is blowing, how sharp you think your closer looks, how long the season is, etc., etc. This is sometimes called “his gut” by old school managers and has been ridiculed by and large, but there’s actual science behind it. When you consider how complex the job of managing is, trying to break it down into “how many wins over some measurable mark” gives a quite misleading ability to truly evaluate a manager’s true contribution to winning and losing, to my way of thinking.

"Difficulties strengthen the mind, as labor does the body."
― Seneca

by NJBammer on Mar 28, 2026 12:07 PM EDT up reply actions

Thanks.

Just to clarify, regarding this:

How can any system which only measures outcomes possibly evaluate the thousands of unknowns a manager has to judge to get the most out of his team?

I did write:

It isn’t really “WAR for managers”. It’s more like “These are the managers who often won more than expected and at least part of that is probably because they were good.”

I know we won’t get all the way to a WAR for managers, for many of the reasons you give. To be honest, many are flaws with actual WAR for players, too. I mean, look at relievers. They don’t accumulate much WAR. Is it their fault? If their managers put them in as much as Goose Gossage pitched, they’d accumulate more WAR.

I’m just trying to see if these tables give us a little something to go by—do we notice any trends from the tops of the lists and the bottoms.

Thanks for the insight!

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 28, 2026 3:09 PM EDT up reply actions

Thanks for the response

I share your worries about how to evaluate players such as relievers, pitchers, managers, etc. I was once a very dedicated stathead and great believer in the emperical approach. But I slowly developed a skepticism towards such attitutes as I began to see how easy it is to come to the wrong conclusions from any database by approaching it with the wrong questions.

I think a question like “How can we evaluate managers given the data we have” is really not a good question. Rather the question is “Can we evaluate managers given the data we have?” I think the answer is “Not really very well, at all.” My concern for efforts is sometimes we think because we can quantify one very small part of a manager’s job, that tends to give us the notion that means we can evaluate managers using those tools. We really can’t.

My illustration supports a very glossy look at your results. What do most of the modern managers we see have in common? Most are (I think) considered small ball managers, and it stands to reason they would manage in a style which would limit the number of runs as they sacrifice outs for runs in high leverage situations. Is this more successful than going for big innings late in the game when you are down by 1 run? I don’t think the data supports than.

I remember looking into this very question years ago when I noticed that Dusty Baker always seemed to have a record much better than his team’s Pythag record indicated. I was young and green and assumed this meant he was underrated as a manager. It was only when I realized the severe shortcomings he had as a manager that maybe he wasn’t really a good one after all. I also came to understand the very style which allowed his teams to outperform in the short run frequently ended up hurting them in the long run. Take his use of starting pitchers - by leaving in a great pitcher for a very high pitch count he was able to win more ballgames (especially close ones) but this style tended to land his best pitchers on the DL more, which hurt his team overall in a way which wouldn’t penalize him if all we looked at was his performance in close ball games.

I guess that was the beginning of my doubts about purely emperical approaches the analyzing baseball - what we don’t know is so much more than what we do it makes asking the right questions quite difficult, and makes even knowing if the question we ask is the right one or not a job nearly impossible, even in the few cases it is undertaken.

Thanks for indulging my own bloviating about this matter, I always read this site and often find a lot of interesting tidbits.

"Difficulties strengthen the mind, as labor does the body."
― Seneca

by NJBammer on Mar 28, 2026 3:25 PM EDT up reply actions

What if this was only the beginning?

In my data set, I have the runs scored and allowed by year for all years. I could conceivably compare these to the run environments of the league. I could also conceivably factor in 1-run games. This is just a first stab. We’ll never get 100% there, but I wouldn’t mind trying to get a little closer.

Even if we shift our goal from finding the best managers to finding those who’s teams constantly outperformed their numbers, I think that is still good to know.

Thanks again.

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 28, 2026 4:11 PM EDT via iPhone app up reply actions

Yes,

I agree that what you are posting is interesting and that’s why I usually come here to read the things you guys come up with. I don’t usually have much to add which is why I usually don’t comment. As far as a beginning, there’s something to that idea, but I am nervous about drawing conclusions too quickly.

Take the Baker example: is it possible that playing too much for one run wins is actually hurtful in the long run? Or take the Torres example below as another example: is it possible that certain teams respond to big-run managers and certain teams respond to small-ball? If that is the case, what can we learn from a manager’s overall record vs. Pythan/WAR? Does whatever truth we find apply to certain eras but not to others?

As I said, the unknowns are so many and the knowns are so few it makes even knowing if we are asking the correct questions, much less answering them correctly, very difficult.

"Difficulties strengthen the mind, as labor does the body."
― Seneca

by NJBammer on Mar 28, 2026 4:20 PM EDT up reply actions

Managers decide playing time

If a manager chooses to play the old guy who no longer has it over the young star in the making, the metrics won’t show that kind of decision. The manager can choose to play a 0 WAR guy all season when a 3 WAR guy is available, but his wins will still match the “expectation” because it’s based on the contributions of the guys he picked.

So your system may measure some aspects of managerial skill, but we have no idea how much.

by Newcomer on Mar 29, 2026 2:09 PM EDT up reply actions

Right. I pointed that out in a comment below.

Again, the goal isn’t to learn EVERYTHING. Just to see if we can learn SOMETHING. Thanks for reading!

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 30, 2025 7:54 AM EDT up reply actions

Love your article, research

I’ve done some research in this area, so that’s probably why I like your article so much. I did not do as extensive a research as you did regarding pythagorean because I didn’ t know how to grab all that data and automate it. I collected them by hand and found basically what you found, I only looked at recent managers, and found simlar names to your list, Torres, LaSorda, Bochy, Alou, and I had the same surprising fact that Lasorda was among the worse in the data set I collected. One name, however, that was on my list back then that you don’t have is Dusty Baker. He must have done pretty poorly after leaving the Giants.

I forgot how I did this (I did this about 8-10 years ago), but I did (or tried to do) the statistical testing to see how significantly away from the mean of 0, and none were at the 90% level, but a good number of them were in the 60-70% level, and that’s still pretty good.

However, one flaw that bugged me and hence why I never followed up on this research is that, for example, Joe Torres was below Pythagorean by a lot when he was managing horrible teams, but then he magically became a good manager when he was managing the Yankees. This lines up with your observation about managing good vs. bad teams. So perhaps Pythagorean is not the best way to measure a manager’s worth, there might be something about the Pythagorean that is under valuing losing teams and overvaluing winning teams.

Another flaw with this system, which is major for your purposes, is that perhaps the manager is doing something that helps the team score more runs and/or prevent more runs allowed. That is not measured at all by the Pythaorean methodology (as the team would win/lose more, in line with the formula), and that, I think, is what kills it as usefulness in measuring a manager’s worth. I think that this measures part of the picture, and thus is worth looking at, but it is not the whole picture and thus not definitive.

I’ve been meaning to post my research on 1-run win/loss record at Fangraphs, but lost my notes, but maybe you can do the more extensive research back into history necessary to check validity, since you seem to have access to automated data analysis. I was just checking Bruce Bochy’s record in 1-run games since both he and Gibson were compiling many more wins in that category in 2011, as the old rubric there is that any manager who wins a lot, will regress to the mean and likely swing back the next season. Since it was relatively easy using BB-Ref, I pulled down team’s record in 1-run games in the NL for a number of seasons, looking at how teams did, and he kept on showing up among the leaders, so I kept on compiling until I covered his whole career, finding that he kept on showing up among the leaders in most games above .500 in 1-run games.

I forgot how many games he was above .500 (my memory recalls something in the 60’s plus I recall him averaging roughly 4 more wins each season he managed), but assuming a .500 mean, he was statisically significant at the 90% level to be above .500 in 1-run games.

Also, I compiled all the leaders by season for his career, IDed the manager for each team, and Bochy was among the leaders each season by far the most of any NL manager during his career, compiling more 8 seasons or more than any other manager, I think he had 40% of all managerial seasons like that, despite only managing roughly 7-8 percent of the managerial seasons available in the NL during his career. The only managers to consistently show up among the leaders were Bobby Cox (early in Bochy’s career, he was done in the 2000’s; apparently Bochy and he are friends, maybe even close friends), Dusty Baker (he was like a Bochy lite, roughly half of what Bochy did), I think LaRussa showed up a bit, but not as much one would think as he’s been hyped as a manager. The only manager with an extended streak of high above .500 seasons was, if I recall right, Bobby Valentine (it was the Mets manager early in Bochy’s career as manager), who had a string of +10 seasons, unmatched by anyone during Bochy’s career, but then he stopped managing in the majors.

Given that your WAR methodology places Bochy in the Top 4 among managers in history, Bobby Cox too, that suggests to me that you might have something there with that methodology in analyzing managers. I agree with the other commenter that there are a lot of factors that play into where a manager contributes value, many very small. But ultimately, unless it contributes to wins, the factors don’t really matter, right? It has to matter at the win/loss level in order for it to matter for measuring this.

My only caveat on the WAR methodology is that I assume you are using defensive valuations of WAR in there, and we don’t really have a system yet that is THE method. There are too many players who look good in one system and bad in another. And given the lack of data going back historically, any analysis will be limited in that way. But I like the names on your list and it is matched up, as you noted, by many same names via the Pythagorean method, as well as my 1-run win/loss record sampling of the NL over the past almost 20 years, so I think that gives your list strong validity.

Adoptive parental unit of Ehire Adrianza.
Godfather of Travis Ishikawa.

"We deserve this" Sabean
"Not here to make friends, I'm here to win games" - Bruce Bochy
Q: "This doesn't happen every year." Posey: "Why not?"
"Do it again Baby!" Huff
"Let's get back to work and make another run at it" Posey

2010's will be known as "Decade of the Giants"

by obsessivegiantscompulsive on Mar 28, 2026 3:29 PM EDT reply actions

Baker rated well, I believe top 30 on both lists.

I noticed what you did about Torre, too. That was in regards to pyth. I’ll see if the same phenomenon happened for WAR.

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 28, 2026 4:14 PM EDT via iPhone app up reply actions

Also

One other thing I noticed with the Pythag analysis was that some managers had a horrendous first season then was good afterward. I think Leyland was one of those, had about as bad a negative pythagorean as you can have in his first season, then was positive after that, but his total was masked by how bad he was in his first year, at least in regards to pythagorean differential. So it looks like he’s been a good manager for much of his career (at least up to when I looked at the numbers a long while ago) but it wasn’t reflected in my statistical analysis because his total was so low.

Again, it could have been the losing/negative connection (don’t recall his record), but just another thing I recall noticing.

Adoptive parental unit of Ehire Adrianza.
Godfather of Travis Ishikawa.

"Not here to make friends, I'm here to win games" - Bruce Bochy
Q: "This doesn't happen every year." Posey: "Why not?"
"Let's get back to work and make another run at it" Posey

2010's will be known as "Decade of the Giants"

by obsessivegiantscompulsive on Mar 28, 2026 4:32 PM EDT up reply actions

Addendums

I should have also noted that some other names who showed up poorly using Pythagorean when I did them also was on your list, Jimy Williams, Buddy Bell, and (I think) Jim Riggleman. I also recall Phil Garner on my list of poor managers, though he inexplicably got another managerial job, so maybe that improved his status a bit, like I noted about Leyland.

I also thought it was interesting that Gene Mauch and Walter Alston showed up on your worse list for WAR. Mauch I could understand, maybe that connection with losses might have contributed to that, but Alston (as well as Lasorda) were associated with winning Dodgers teams, so they are big surprises to be near the bottom. Obviously, the connection there is that their teams played in Dodger stadium. How are the stats for more recent Dodger managers?

Oh, and I just noticed Phil Garner there. :^)

Lastly, we share a love for Earl Weaver, so I assume you have the book, but if not, I recommend the book he published on managing. I also wonder if the Giants carry on any of his philosophies because Pat Dobson was one of Sabean’s most trusted and closest advisors until his early passing. The Giants pitching staff reminds me totally of Weaver’s rotation in the early 80’s, with 4 20-game winners, impossible today, but the Giants at least have a high caliber rotation 1-4, the past few years.

And I was going to reply above, but I would note that while Bochy is among the leaders in managerial WAR as well as 1-run games, he is not a classic small-baller. He does do a lot of hit-and-runs, but that is mostly to avoid double plays for the slower players. He hates to bunt, which is very Weaver, as he hates to give up outs. Similarly with stolen bases, even with Torres, he didn’t run much, though I do recall him allowing Roberts more leeway in that way, but he’s an excellent base-stealer. Oh yeah, he gave Randy Winn rope because Randy got really good at stealing bases in his last years with the Giants (and he’s coaching players on base stealing in spring training for the Giants this spring).

Also, here is an idea, though not ideal: Bill James have been collecting a number of stats on managerial moves for the past 10 or so years in his handbooks. Most have been consistently collected, though some have morphed (like games with pitchers over 120 pitches). Perhaps you can do some sort of regression (if you know how to do that) using those metrics and tying it to your WAR and Pythagorean stats. If you are lucky (and subscribe to his Gold Mine service), he has posted these stats across most of recent baseball history (should be easy to compile using any database) at his Gold Mine service, and you can do the analysis across at least a couple of generations of baseball, if not the history.

As you aptly noted, you are just trying to start the analysis of something like this. We are just taking the first steps of a possible journey of 1000 steps. But you have to start somewhere if we hope to get there.

Now that is lastly. :^)

Adoptive parental unit of Ehire Adrianza.
Godfather of Travis Ishikawa.

"Not here to make friends, I'm here to win games" - Bruce Bochy
Q: "This doesn't happen every year." Posey: "Why not?"
"Let's get back to work and make another run at it" Posey

2010's will be known as "Decade of the Giants"

by obsessivegiantscompulsive on Mar 28, 2026 5:19 PM EDT up reply actions

OK, maybe this lastly

Forgot one more idea: how would your list look like if you took average over their career, both top and bottoms. Obviously, many will stay in about the same spot, but it would be interesting to see who jumps up into the top and falls into the bottom.

One, OK, two more observations. I think Hargrove was on my list of poor managers as well, and the Giants were one homesick Lou Pinella away from having him instead of Bochy as manager. The rumors were that the Giants wanted Pinella, who Sabean knew from his Yankees days, I guess, but he chose the Cubs reportedly because he wanted to be closer to Tampa where his mom is/was (not sure now) and I think she was in poor health. But if you looked at the teams back then, the Cubs looked closer to winning than the Giants did, at that time, so that most probably was a big factor that Pinella couldn’t point out publicly without burning bridges (and friendships).

Adoptive parental unit of Ehire Adrianza.
Godfather of Travis Ishikawa.

"Not here to make friends, I'm here to win games" - Bruce Bochy
Q: "This doesn't happen every year." Posey: "Why not?"
"Let's get back to work and make another run at it" Posey

2010's will be known as "Decade of the Giants"

by obsessivegiantscompulsive on Mar 28, 2026 5:28 PM EDT up reply actions

Rate stats!

Top 21 by WAE/pyth per game (21 because Bochy is #21):

(1000 or more games)

Mike Scioscia 0.013
Ozzie Guillen 0.013
Ron Gardenhire 0.013
Bobby Valentine 0.012
Chuck Dressen 0.012
Don Zimmer 0.012
Herman Franks 0.012
Felipe Alou 0.011
Fred Hutchinson 0.010
Ralph Houk 0.009
Frank Robinson 0.009
Earl Weaver 0.009
Billy Martin 0.008
Whitey Herzog 0.008
Buck Rodgers 0.007
Joe Torre 0.007
Davey Johnson 0.007
Johnny Oates 0.007
Walter Alston 0.006
Wilbert Robinson 0.006
Bruce Bochy 0.006

Top 21 by WAE/war per game:

Mike Scioscia 0.042
Frank Chance 0.036
Wilbert Robinson 0.031
Felipe Alou 0.028
Pat Moran 0.025
Bruce Bochy 0.025
Al Lopez 0.022
Jim Tracy 0.020
Chuck Dressen 0.018
George Stallings 0.018
Lou Boudreau 0.017
Fred Clarke 0.017
Don Baylor 0.016
Steve O’Neill 0.015
Jimmy Dykes 0.015
Fred Mitchell 0.015
Patsy Donovan 0.015
Bobby Cox 0.014
Joe Cronin 0.014
Ralph Houk 0.014

Really tough to beat Scioscia.

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 28, 2026 10:26 PM EDT up reply actions

More thoughts

Converting to 162 games, the top managers by pythag added 2 wins on average per 162 season.

For WAR, Scioscia added almost 7 (!) wins per season, Bochy about 4 wins (which is basically where my 1-run win differential is for him, so that basically matches), Don Baylor added 2.5 wins, and because we love him, Earl Weaver added almost 2 wins.

Scanning your list, I noticed that while not a preponderance, I find that the pythag and WAR figures are roughly in the same ballpark for a good number of managers. Have you tried to see if there is any correlation between the two figures?

Also, just curious, not sure where this leads, but have you tried adding all the manager Pythag/WAR, either by total over history and/or total over a season? I wonder if it zeroes or not, either historically or by season (which, of course, begs the question of whether all players WAR adds up to total team WAR on a seasonal basis, I’ve never seen any discussion about that; Bill James forces his Win Shares to add up, but I have been assuming that WAR methodology does not do such forcing, but then I’m not really aware of how WAR is calculated in the deepest parts of that black box). Scrolling down, it seems like there are equal numbers positive and negative, in terms of their summed total, so maybe.

Adoptive parental unit of Ehire Adrianza.
Godfather of Travis Ishikawa.

"Not here to make friends, I'm here to win games" - Bruce Bochy
Q: "This doesn't happen every year." Posey: "Why not?"
"Let's get back to work and make another run at it" Posey

2010's will be known as "Decade of the Giants"

by obsessivegiantscompulsive on Mar 29, 2026 10:59 AM EDT up reply actions

I'm thinking now about how to look for correlations between the two lists.

I also noticed that some managers seem to rate very similarly by the two metrics. That’s interesting too, since WAR seems to have a much bigger range of results.

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 29, 2026 11:18 AM EDT up reply actions

Curious: where did Billy Martin end up?

Looking at your list, I saw that Frank Robinson was up there on one list, and I viewed his gruff style to be similar to Billy Martin in that they get results in their first year but then wear out their welcome.

I was also curious how Ted Williams did in his short stint as manager. I remember a quote from a player noting how Williams love to work on hititng with everyone, but then that gave short shrift to everything else. But looking at the stats showed that a lot of hitters improved greatly under his wing as manager. Maybe look only at the change in offensive WAR for his teams in the season before and his first season? That’s something that can’t really be measure either by WAR in your methodology, because the pitching could go up or down no matter what Williams did, maybe there were trades or an injury or poor performance just due to randomness, whatever, plus, as noted, the defensive WAR varies widely sometimes for even the same player, season to season, though generally consistent.

Adoptive parental unit of Ehire Adrianza.
Godfather of Travis Ishikawa.

"Not here to make friends, I'm here to win games" - Bruce Bochy
Q: "This doesn't happen every year." Posey: "Why not?"
"Let's get back to work and make another run at it" Posey

2010's will be known as "Decade of the Giants"

by obsessivegiantscompulsive on Mar 28, 2026 5:37 PM EDT reply actions

Billy Martin

I’ve actually got all the stats here: http://darowski.com/hall-of-wwar/expectancy/

Martin was +13.1 by pyth and +6.7 by WAR.

Ted WIlliams was -7.9 by pyth and -8.0 by WAR.

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 28, 2026 10:28 PM EDT up reply actions

Thanks!

Wow, BillyBall was actually not that bad, particularly Pythagorean, though as we’ve discussed, perhaps that is a function of getting the reins of winning teams.

But TeddyBall was pretty bad, as he is about as negative as Billy was positive, but his managerial career was very short, suggesting that his per-season figures are REALLY bad.

Speaking of which, thanks for the per-season figures above as well.

Yeah, hard to beat Scioscia, though Frank Chance is up there (of Evers to Tinkers to Chance fame) and I don’t recall his managerial career being that heralded. Don Baylor also stands out to me, as I never got the impression he was that good a manager, from the press. Then again, never thought much about Bochy until 2010 when he was doing moves that you have to do - which could ruffle feathers - to win it all, then discovered his great record in 1-run games in 2011.

Thanks as well for the link.

Adoptive parental unit of Ehire Adrianza.
Godfather of Travis Ishikawa.

"Not here to make friends, I'm here to win games" - Bruce Bochy
Q: "This doesn't happen every year." Posey: "Why not?"
"Let's get back to work and make another run at it" Posey

2010's will be known as "Decade of the Giants"

by obsessivegiantscompulsive on Mar 29, 2026 10:36 AM EDT up reply actions

Question

I was looking at your stats site - thanks again! - and am wondering if you should be adding the two metrics together, since there is probably some element of double counting happening.

Though, as you noted, this is all preliminary playing with trying to do something like this.

Thanks again for sharing your research.

Adoptive parental unit of Ehire Adrianza.
Godfather of Travis Ishikawa.

"Not here to make friends, I'm here to win games" - Bruce Bochy
Q: "This doesn't happen every year." Posey: "Why not?"
"Let's get back to work and make another run at it" Posey

2010's will be known as "Decade of the Giants"

by obsessivegiantscompulsive on Mar 29, 2026 10:45 AM EDT up reply actions

Combining them doesn't really mean anything there.

I was really just using it as a default sort for players who do well on both lists. I’m certain there’s plenty of double counting there.

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 29, 2026 11:16 AM EDT up reply actions

Very Nice

This is a very cool jumping off point. I’ve been recently thinking about trying to do something similar to the WAR analysis, only using historical Marcels (and I guess TZ for D) weighted by actual playing time. Seeing if there are managers that have players that consistently beat their projected WAR. We’d need huge sampe sizes, but hey we’re looking on the career level….

Gas House Graphs | Twitter

by stevesommer05 on Mar 29, 2026 10:09 AM EDT reply actions

Thanks so much, Steve.

I actually had that thought this morning in the car. How else could we look at this? Perhaps wins above what the team was projected to do. The reason I was thinking about that is that a manager doesn’t really get hurt (using the system above) if he chooses to play a player capable of 1.5 WAR vs a player capable of 4.0 WAR. Projections might help with that.

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 29, 2026 11:15 AM EDT up reply actions

A problem with projections

While looking at projected has some value, my worry there is that injuries often hurts a player’s performance and that’s not the manager’s fault that he’s injured (except, perhaps, as noted above, for Dusty Baker :^).

Adoptive parental unit of Ehire Adrianza.
Godfather of Travis Ishikawa.

"Not here to make friends, I'm here to win games" - Bruce Bochy
Q: "This doesn't happen every year." Posey: "Why not?"
"Let's get back to work and make another run at it" Posey

2010's will be known as "Decade of the Giants"

by obsessivegiantscompulsive on Mar 29, 2026 1:46 PM EDT up reply actions

Nice article

Btw, I found this article that may interest some..

"We are all agreed that your theory is crazy. The question that divides us is whether it is crazy enough to have a chance of being correct."
- Niels Bohr

Sorry, unauthorized hotlinking of copyrighted material not permitted.

by Frag on Mar 29, 2026 11:59 PM EDT reply actions

It isn't hard to explain Torre

being on the Pythagorean list. It’s called Mariano Rivera.

Any other managers on here with long-term dominant closers (or the reverse?)

by Xave on Mar 30, 2025 1:21 AM EDT reply actions

Oh gosh, that's interesting. Let's take Trevor Hoffman and Bruce Bochy, who also performs well.

The two overlapped with the Padres from 1995 to 2006, a very long stretch.

Bochy’s WAE/pyth is +16.5. With the Padres it was +13.5 (12 seasons) and with the Giants it was +3.0 (five seasons). So, no huge finding here. Bochy’s first year with the Giants was -6.1, so that really drags it down, too.

Bochy’s WAE/war split is +57.9 for the Padres and +9.5 with the Giants. That’s +4.8 per year for the Padres and +1.9 for the Giants. A much bigger split, but of course that isn’t related to pyth.

How about LaRussa/Eckersley from 1987 to 1995?

LaRussa was +15.7 by pyth in those seasons. He was +1.6 for the White Sox and +4.1 for the Cardinals. So, he did do considerably better with Eck on hand.

Should I look anyone else up?

Creator of the Hall of wWAR • @baseballtwit on Twitter

by adarowski on Mar 30, 2025 8:02 AM EDT up reply actions