Beyond the Box Score: An SB Nation Community

Navigation: Jump to content areas:


Sports blogs for fans, by fans.
Around SBN: SB Nation NFL Power Rankings for Week 11

File

Percent Chance of Select Royals Hitters Hitting a Certain Number of Homers given 650 PAs in 2009, drawing on CHONE projections. If you need a closer look, click here.

I already posted this at Royals Review, but given the recent fun with Excel's BINOMDIST function and the accompanying charts (not to mention how long it took it to get through my thick head), I thought I'd post it here.

Those more in the know: is something like a binomial distribution roughly how stuff in projections systems like CHONE's percentiles and ZiPS' "Oddsinator" are generated?

10 months ago Newavatar_tiny devil_fingers 15 comments 0 recs  | 

Story-email Email Printer Print

Comments

Display:

no, actually, I didn't

Would that have given similar results?

What I did was simply go to the main projection page for the team, and took the number of number runs/number at bats as the %, then 650 as the number of trials, so, for Alex Gordon, CHONE projects 15 HR in 548 ABs, so the equastion (excel) is:

=(1-(BINOMDIST))

Where B3 is the number of home runs. So for the chart, I did 1, 5, 10, 15, 25… you get the picture.

Make sense?

by the way, using the “FALSE” Cumulative leads to some cool results when running this with win%

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 7:54 PM EST up reply actions   0 recs

WFB, baby

it looks like the curve dips below 0% for a bit – that’s true grit

---
Juuuust a bit outside!!
http://www.rightfieldbleachers.com

by Jack Moore on Jan 14, 2009 7:12 PM EST reply actions   1 recs

yeah, that's just ... something

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 7:54 PM EST up reply actions   0 recs

Not sure on whether they use binomial distributions

They might binomial, they might be normal, not sure. But, the key to any “odds” projection is the standard error of estimatation. It works much like a standard deviation does. So, if my projection for (name here) is .10 HR/FB, with an SEE of .01, then my model says there is a 68% chance (1 standard deviation either way) of the result being .09 or .11, and a 95% chance of it being between .08 and .12

http://mvn.com/mlb-stats

by pizzacutter on Jan 14, 2009 9:05 PM EST reply actions   0 recs

I like the non-cum graphs for comparing two teams or players.

The area of the graphs overlapping is related to the probability that the worse team will outperform the better team.

Beyond the Boxscore // Calling BJ Upton lazy is lazy.

by Sky Kalkman on Jan 14, 2009 9:06 PM EST reply actions   0 recs

can you show us an example of said graph?

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 9:23 PM EST up reply actions   0 recs

Not right now.

Just imagine two bell curves side by side with the right tail overlapping the left tail of the other one. The more they overlap, the better chance the worse team has of outperforming the better team (or worse HR hitter has of out-performing the better HR hitter).

Beyond the Boxscore // Calling BJ Upton lazy is lazy.

by Sky Kalkman on Jan 14, 2009 9:39 PM EST up reply actions   0 recs

yeah, just did one

I did one earlier today for win % for the Royals, where I overlapped the cum and non-cum win curve, thanks to one of the brainiac economists who frequent RR…

Here’s the rough version, too small, of course:

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 9:45 PM EST up reply actions   0 recs

Binomial distributions

I do something to this effect, but the results are going to be slightly too narrow and frequently too symmetrical if you don’t take the error of the probability into account.

As an example let’s say that a player is projected to hit 20 homers in 650 PA. The problem we run into is that the distribution is based off our best guess of the probability, not the actual probability.

The player’s “true” homer distribution is going to look different based on the probability of the projection itself. With a coin, you’re reasonably sure that the it’s a 50/50 bet to get heads or tails. With a 20-homer hitter, you’re not entirely sure if he’s not really an 18-homer or a 22-homer hitter and there’s a small chance that he’s actually a 10-homer or a 30-homer hitter.

Luckily, my projection for Guillen seems to also be 20 homers per 650 PA, which makes it easier for me. He’s more likely to “truly” be a 15 homer hitter than a 25 home run hitter (per 650 PA), so you get a more skewed distribution.

Once everything’s combined (I use statistica to simulate), i get

(per 650 PA)

<5 HR – 0.2%
<10 HR – 3.4%
<15 HR – 18.4%
<20 HR – 47.2%
<25 HR – 76.3%
<30 HR – 91.5%
<35 HR – 97.7%
<40 HR – 99.5%

--
Dan Szymborski
dan@baseballprimer.com

by D.Szymborski on Jan 14, 2009 10:35 PM EST reply actions   0 recs

thanks, Dan

even though my head now hurts

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 10:55 PM EST up reply actions   0 recs

Maybe I can put it simpler

Think of it this way.

We’re going to do 100 coin flips. Coins are typical 50/50 propositions. Let’s say we want to know the odds of flipping 45 heads or more. Assuming that 50/50 split, we’d guess that it would happen 86% of the time.

But what if someone tells us that the coin is weighted and will never come up heads? In that case, we would never flip 45 heads or more out of 100.

Then again, we don’t know this guy too well and think he might be lying. We’re evenly split between whether he’s lying or telling the truth. So, depending on what’s true, we have a good chance of getting 0 coin flips. But if we just took the average of the two probabilities of the coin, 0.5 and 0.0, our coin would simply “project” to being a 1-in-4 heads coin.

If we have a coin that comes up heads 1 time in 4, the odds of flipping tails 100 times in a row is 3.2 trillion-to-1. But in this scenario, our chances of flipping tails 100 times in a row is actually over 50%!

Does this make more sense?

--
Dan Szymborski
dan@baseballprimer.com

by D.Szymborski on Jan 15, 2009 12:29 AM EST up reply actions   0 recs

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?
Start posting on Beyond the Box Score »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Small
PZR-based Win Values 2001-2006

Recent FanPosts

Small
Determining Batted Ball Rates using Pitch Type and Location
Small
a new xBABIP calculator
Img587561916661595
Top 15 high school MLB draft prospects
Small
The "30 parks on a budget" challenge
Sunflower_small
World Series Simulation, Game #6
Small
JT20 Dynasty League
E52205a2_small
New Look
Sth70021_small
Exploring Hit f/x, Albeit Badly
Redcap_small
Ricky Nolasco: 4 WAR or 1 WAR?

+ New FanPost All FanPosts >

FanShots

Quick hits of video, photos, quotes, chats, links and lists that you find around the web.

Recent FanShots

Primer on BaseRuns
Cool Baseball Infographics
ESPN's Jerry Crasnick on defensive metrics
I’m also a follower, since Brian Bannister’s on our team, of sabermetric st...
Top Ten Baseball-Reference.com's Sponsorships
Primer on Linear Weights
JC Bradbury on "Hot Stove Myths"
Everyone Should Learn to Throw a Cutter
Criminals of WAR
Ten statisticians you should know about

+ New FanShot All FanShots >

BtB on Twitter

Main Feed: @BtBScore

Tommy B: @tommy_bennett
Sky: @BtB_Sky
Dan: @dturkenk
Harry: @harrypav
Jinaz: @jinazreds
Jack: @jh_moore
Erik: @Erik_Manning
Tommy R: @trancel
Justin: @justinbopp

Subscribe to BtB via Email

Enter your email address:

Delivered by FeedBurner

BtB Goes Social


Managers

Nando_small R.J. Anderson

Limes_125_small Sky Kalkman

E52205a2_small Tommy Bennett

Editors

Face_small Harry Pavlidis

Rawlings_baseball_bigger_small Dan Turkenkopf

770insig_small Jeff Zimmerman (TucsonRoyal)

Aviles_small Justin Bopp

Authors

Banny_small erik

Raysring1_small Tommy Rancel

Jinaz-reds-avatar_small JinAZ

Jmlogo_small Jack Moore

1753738656_110919ebe9_o_small vivaelpujols

1_small Graham

Baseball_small Mike Rogers

Redcap_small SFiercex4

Small Patrick Clark

Walter_album_small Walter Fulbright