Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Trent Richardson Interviews Fellow Brown Brandon Weeden

The Most Biased Fans in Baseball (Are Not Who You Think)

Last week, I introduced a new novelty statistic called Star Power as part of an objective framework for the MLB All-Star vote that accounts for empirical, often-illogical contrarianism (surely there are people out there who think Daric Barton is better than Adrian Gonzalez) but not hometown biases or systematic over- or underratedness. As a reminder, the formula for Star Power is (numbers taken at the All-Star Break):

Star_power_medium

and a player's "expected Vote Share" is his Star Power divided by the sum of each candidate's SP's in his category. So, for example, the xVoteShare for a non-outfield AL player (Player n) would be:

Xvote_share_medium

With this in our toolbox, we can compare the actual 2011 All-Star vote totals with the expected results to find the impact of each team's fans' biases on the balloting. To do this, I invented another tchotchke statistic called "BIAS In All-Star voting," or "BIAS" for short (the acronym is part of the name, like a guy looking at a picture of himself looking at a picture). Essentially, it's the number of votes a team's players received (using my estimates for players whose totals were not released) divided by the number of votes my Star Power-based model projected, then scaled like OPS+ so a 100 BIAS indicates a normal amount of homerism, while a 110 BIAS means the team's fans are 10 percent more biased than the league average. A 90 BIAS doesn't mean fans are unbiased, just that their biases are 10 percent weaker than average.

Is your curiosity sufficiently piqued? Here are the numbers for each fanbase's BIASes: (click to embiggen)

Bias_by_team_chart_medium 

Star-divide

Anything surprise you here? Red Sox fans, notorious for their fervor, are relatively objective with their All-Star ballots according to BIAS. Cubs faithful, famed for their loyalty, are among the least blindly supportive of their players. Even the Yankees—the kings of media domination around whom the TV world revolves—fall well short of having the most biased fanbase. Meanwhile, Braves and Mariners players got more twice as many votes as they should have? Sure, the Phillies are a big-market club and the Giants and Rangers just made the World Series, but when did Brewers and Reds fans become so passionate?

If you look at some the individual players and how many votes my framework projected for them, it starts to make some sense. Sure, J.D. Drew (1.86 million votes over his projection) and Carl Crawford (3.13 million over) got too many votes, but at the same time Adrian Gonzalez (4.02 million under his projection) and Jacoby Ellsbury (4.61 million under) didn't run away with starting spots like they were projected to. Cubbies fans didn't exactly stuff the ballot boxes for Starlin Castro (estimated .78 million votes too low) or Aramis Ramirez (2.11 million under)—they weren't necessarily deserving of votes, but given Castro's popularity and the dearth of good NL third basemen this year it's safe to say fans underperformed with them—and in the Bronx Alex Rodriguez (2.48 million under) wasn't supported like the clear best third baseman in the league that he was.

Meanwhile, Atlanta fans helped to give all eight Brave players more votes than they were projected to receive, including, most egregiously, Chipper Jones (2.00 million over) and Jason Heyward (1.44 million over). Ichiro (2.44 million over) was the only Top 8 finisher for the Mariners, but his overratedness combined with the rest of his teammates' poor play was enough to give Seattle fans a high BIAS. Casey McGehee (1.77 million over) and Yuniesky Betancourt (1.65 million over) can tell you all you need to know about Milwaukee fans' passions, while Jay Bruce (2.20 million over) and Paul Janish (1.11 million over) were the biggest beneficiaries of Cincinnatians' fervor.

Another interesting thing to note: the three teams whose bars appear in black above (the Marlins, Padres, and Athletics) have BIAS scores based solely on my replacement-level estimations because none of their players cracked the Top 8 vote-getters at their respective positions. No Fish or Friar or A even made the list for outfielders, where the Top 24 candidates are revealed. But that's not the saddest part—based on my meager 457,546-vote minimum estimate for bottom-finishing candidates, Oakland players still got 28 percent too many votes. It's a rough time to be an A's fan.

So where do these biases come from? Local demographics don't seem to have much to do with it. BIAS scores and the populations of teams' cities and surrounding metropolitan areas actually had very slight negative correlations (R2=.011 and .002, respectively)—right around the significance of alphabetical order (R2=.005 for team names, .010 for team locations). Interestingly (or maybe not), teams' BIAS scores and the vote percentages Barack Obama received in their states in 2008 had an inverse correlation with R2=.021.

It seems that partisanship comes not from external factors but from the teams themselves. The age of a franchise correlated with BIAS scores for an Rof .103. Payroll (R2=.088) appeared to have some small impact, while this year's attendance (R2=.120) didn't do as well as I would have thought. Preseason expectations might have something to do with it: The correlations between BIAS and 2010 wins (.080) and PECOTA's February projections (.082) were at least enough to be interesting.

Overall, though, the clear most important factor is current team performance. BIAS and wins at the All-Star Break correlated at R=.466 for an Rof .217 (some of the above relationships are probably just functions of this). That's still not enough to explain the bias, though—there is a large intangible je ne sais quoi in play here that I cannot quantify (though given that I think most would put Boston and Chicago fans near the top if they compiled a gut-feeling list like this, I'm not sure whether anecdotal observations would really help much here).

It may be that, like UZR and BABIP, BIAS scores need more than one season's worth of data before they stabilize. This year's totals correlated with the results of the (much more simplistic) study I did last year at R=.370, so the leaderboards aren't a model of stability. Then again, if current-season performance is the biggest factor getting fans to the All-Star polls, perhaps the inconsistency is not a failing of the statistic but a reflection of changing real-world conditions. Interestingly, some of this year's surprises placed similarly last year, too—the Braves were near the top, while Red Sox (90) and Cubs (56) fans got almost exactly the same scores.

It's worth noting that some teams' scores are artificially inflated by having overrated, popular players on their rosters who other teams' fans voted for too: the Mariners wouldn't be at the top if not for Ichiro, and Twins fans would be below-average without Joe Mauer. I considered dropping each team's biggest outlier in each direction to balance it out, but that seemed unfair to the teams who don't have a Mauer or Ichiro. Especially since much of their popularity is tied up in their team—even outside of Minnesota, Mauer probably wouldn't have gotten as many votes if he had left as a free agent last winter.

One final thing to keep in mind: having a biased fanbase isn't necessarily good or bad. If you care about preserving the integrity of the All-Star Game then no, you really shouldn't be voting for Derek Jeter or Jarrod Saltalamacchia. At the same time, I don't think this speaks well for the fans on the lower end—how depressed does a fanbase have to be to not get a single player on a Top 8 list? A little BIAS is healthy. Just take it in moderation.

Comment 12 comments  |  1 recs  | 

Do you like this story?

Comments

Display:

Good stuff once again

This is theoretical bias, right, not actual. Since we don’t know who fans actually voted for we are extrapolating based on an assumption that the majority of “incorrect” votes would come from homer fans blinded by their allegiance.

But here are two other hypotheses:

1) It’s all about exposure: The strongest correlation you found was between wins and vote share. Possibly those teams that are winning garner greater national coverage and, therefore, the leading players on those teams get a boost in votes, regardless of fan affiliation. Now, there are obviously some outliers (e.g. the Red Sox). Maybe you could look at the correlation between national games and bias? ESPN, FOX, TBS, MLB, etc. Before the All-Star break, who the heck saw Andrew McCutchen all that much.

2) It’s about what stats fans look at: This is more of a stretch, but maybe those players that received a disproportionate number of votes than their expectation tended to have better traditional stats (i.e. HR, AVE, RBI). For RBI, that would make sense since the best hitters on good teams tend to have higher RBI totals simply because there are better players in front of them.

Just some thoughts. Again, very cool stuff.

Columnist at Beyond the Box Score. Contributor at Amazin' Avenue.

by Bill Petti on Aug 31, 2011 12:51 PM EDT reply actions  

Right

Sorry, I guess I should have explained that better. Yes, the assumption I was making is that teams getting more votes than expected is due to that team’s fans voting for their players. For individual players like Ichiro there’s more to it than that, but I figured that for most teams individual over/underratedness would be more or less cancelled out.

1. Interesting idea, I’ll look into that. There’s definitely a weight towards well-known and bigger-market teams here, but I was thinking about in more abstract terms (payroll, age of franchise).

2. I’ll be getting into that once I’m done overanalyzing team/player biases. I don’t want to give it away yet, but suffice to say I don’t think that’s really a factor.

Contributor @ Beyond the Box Score. Lead Blogger @ Wahoo Blues. Sophomore @ Brown University. Twitter: @LewsOnFirst
"Baseball, it is said, is only a game. True. And the Grand Canyon is only a hole in Arizona."—George Will

by Lewie Pollis on Aug 31, 2011 1:12 PM EDT up reply actions  

"Ichiro (2.44 million over) was the only Top 8 finisher for the Mariners"

I would imagine that this is because Ichiro is an easy vote for most people, especially since he’s run a lengthy consecutive streak of making all-star teams. Otherwise, for Mariner fans, there is this other guy called “Felix Hernandez.”

by ThundaPC on Aug 31, 2011 1:53 PM EDT reply actions  

If fans voted for pitchers in the All-Star Game, I’m sure Felix Hernandez would’ve been a top 8 finisher.

Let's go eat...a triceratops. /velociraptor'd

by LeepinLizardz on Aug 31, 2011 6:21 PM EDT up reply actions   1 recs

Not really surprised about the Braves. I mean their fans call the Braves “America’s Team”

I am not a Leader, and I am not a Legend.

by Aaron Go Bragh on Aug 31, 2011 7:42 PM EDT reply actions  

Wikipedia calls them that...

Do actual Braves fans call them that?

by SS451 on Aug 31, 2011 8:25 PM EDT up reply actions  

not really

it comes from the Ted Turner days where they always played on TBS prior to all the regional networks and all the ESPN games

by Delaware Boiler on Aug 31, 2011 8:31 PM EDT up reply actions  

I’ve heard it a couple times, usually followed by “look at how big the Braves’ radio network is”

I am not a Leader, and I am not a Legend.

by Aaron Go Bragh on Aug 31, 2011 11:29 PM EDT up reply actions  

No.

Never did.

"It breaks your heart. It is designed to break your heart. The game begins in the spring, when everything else begins again, and it blossoms in the summer, filling the afternoons and evenings, and then as soon as the chill rains come, it stops and leaves you to face the fall alone." A. Bartlett Giamatti

by sddbaker on Sep 1, 2011 2:47 AM EDT up reply actions  

I believe old, old, old TBS promos once did

…but like sdd stated, Brave fans never made such a claim. The Braves do have a widespread fan base with or without TBS, though, based on their history in three regions of the country.

Hey! I’m new.
by ChopMaster on Jul 7, 2011 10:24 PM CDT (joined Jul 19, 2010)

Twitter: @biggentleben

by biggentleben on Sep 5, 2011 6:48 AM EDT up reply actions  

Cincy fans ranking that high doesn't surprise me

Weren’t like 8 Reds elected as starters in 1958 or something, because Reds fans were literally stuffing the ballot box using highly questionable means?

by jhitts08 on Sep 1, 2011 4:28 AM EDT reply actions  

Not a fan at all of the methodology

Really doesn’t prove any bias in any fanbase, honestly. What it shows is that the Braves and Mariners and other top-rated teams in this study have “name” players that get a number of all-star votes. I’m a lifelong Braves fan, but I’ve turned in All-Star ballots a number of times in my lifetime without a Brave, but guys like Brian McCann and Dan Uggla have become very well-respected nationally for how they play the game, Chipper Jones is considered an active future HOFer, which always garners more votes, and guys like Jason Heyward and Freddie Freeman have garnered national attention for their early exploits. However, that is not why the methodology has issues. It is because you use actual All-Star votes as a function of current season statistics. People vote for a myriad of reasons. This year, you could make a very good argument to have a guy like Chipper Jones because it could be his last season of a HOF career. Likewise, I’d say when it gets to be the final season of Jeter or ARod’s careers, they’ll get extra national votes as a tribute to career accomplishments (okay, maybe not ARod as much, but you get the point).

I personlly typically vote for the best player at each position. If that means I vote for Mark Teixeira in the midst of a .220/.340/.420 first half, then so be it. When I move away from that is when a guy is having a statistical performance so tremendous that it cannot be ignored. Also, I typically vote very early in the process. A whole month of statistical accumulation happens between the opening of voting and closing of voting. Heck, typically a week of statistical accumulation happens between the close of voting and the All-Star game itself, so you could have a guy who was the borderline #3 guy at a position when voting closed now as the clear #1 guy with a torrid week that adds 5 HR and 11 RBI to his midseason totals. It’s the basic issue of doing any real finite analysis used to make overarching assumptions or consider as proof during the season because baseball can change so quickly during the season.

Hey! I’m new.
by ChopMaster on Jul 7, 2011 10:24 PM CDT (joined Jul 19, 2010)

Twitter: @biggentleben

by biggentleben on Sep 5, 2011 6:46 AM EDT reply actions  

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?

Follow us on Facebook!

Follow us on Twitter!

SaberGraphics

Yahoo_full_count

MLB Daily Dish

Get the latest MLB Trade Rumors, Transactions, and News at MLB Daily Dish!


Managing Editor:

Jbopp-kc_small Justin Bopp

Columnists:

Adam_small adarowski

Dme_small Satchel Price

Closeup4_small J-Doug

Carlosicon_small Julian Levine

Billy_and_daddy_4th_of_july_small Bill Petti

Featuring:

Dayton_small Jeff Zimmerman

12475953_small Jacob Peterson

Recent_pic_pg_small Patrick Gordon

Btbpro_small Dave Gershman

Me_small Bryan Grosnick

229331_10150183361996591_674441590_6760167_6637860_n3_small Lewie Pollis

Img_3830_small David Fung

30472_1481067225243_1190689185_1381415_997334_n_small Glenn DuPaul

1mnvxku7_small joshuaworn

Set_small MattFilippi18

Photo0011_small Nathaniel Stoltz