Beyond the Box Score: An SB Nation Community

Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Around SBN: Has Kentucky Improved Since the Non-Conference Season?

File

Percent Chance of Select Royals Hitters Hitting a Certain Number of Homers given 650 PAs in 2009, drawing on CHONE projections. If you need a closer look, click here.

I already posted this at Royals Review, but given the recent fun with Excel's BINOMDIST function and the accompanying charts (not to mention how long it took it to get through my thick head), I thought I'd post it here.

Those more in the know: is something like a binomial distribution roughly how stuff in projections systems like CHONE's percentiles and ZiPS' "Oddsinator" are generated?

about 1 year ago Newavatar_tiny devil_fingers 15 comments 0 recs  | 

Story-email Email Printer Print

Comments

Display:

no, actually, I didn't

Would that have given similar results?

What I did was simply go to the main projection page for the team, and took the number of number runs/number at bats as the %, then 650 as the number of trials, so, for Alex Gordon, CHONE projects 15 HR in 548 ABs, so the equastion (excel) is:

=(1-(BINOMDIST))

Where B3 is the number of home runs. So for the chart, I did 1, 5, 10, 15, 25… you get the picture.

Make sense?

by the way, using the “FALSE” Cumulative leads to some cool results when running this with win%

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 7:54 PM EST up reply actions   0 recs

WFB, baby

it looks like the curve dips below 0% for a bit – that’s true grit

---
Juuuust a bit outside!!
http://www.rightfieldbleachers.com

by Jack Moore on Jan 14, 2009 7:12 PM EST reply actions   1 recs

yeah, that's just ... something

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 7:54 PM EST up reply actions   0 recs

Not sure on whether they use binomial distributions

They might binomial, they might be normal, not sure. But, the key to any “odds” projection is the standard error of estimatation. It works much like a standard deviation does. So, if my projection for (name here) is .10 HR/FB, with an SEE of .01, then my model says there is a 68% chance (1 standard deviation either way) of the result being .09 or .11, and a 95% chance of it being between .08 and .12

http://mvn.com/mlb-stats

by pizzacutter on Jan 14, 2009 9:05 PM EST reply actions   0 recs

I like the non-cum graphs for comparing two teams or players.

The area of the graphs overlapping is related to the probability that the worse team will outperform the better team.

Beyond the Boxscore // Calling BJ Upton lazy is lazy.

by Sky Kalkman on Jan 14, 2009 9:06 PM EST reply actions   0 recs

can you show us an example of said graph?

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 9:23 PM EST up reply actions   0 recs

Not right now.

Just imagine two bell curves side by side with the right tail overlapping the left tail of the other one. The more they overlap, the better chance the worse team has of outperforming the better team (or worse HR hitter has of out-performing the better HR hitter).

Beyond the Boxscore // Calling BJ Upton lazy is lazy.

by Sky Kalkman on Jan 14, 2009 9:39 PM EST up reply actions   0 recs

yeah, just did one

I did one earlier today for win % for the Royals, where I overlapped the cum and non-cum win curve, thanks to one of the brainiac economists who frequent RR…

Here’s the rough version, too small, of course:

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 9:45 PM EST up reply actions   0 recs

Binomial distributions

I do something to this effect, but the results are going to be slightly too narrow and frequently too symmetrical if you don’t take the error of the probability into account.

As an example let’s say that a player is projected to hit 20 homers in 650 PA. The problem we run into is that the distribution is based off our best guess of the probability, not the actual probability.

The player’s “true” homer distribution is going to look different based on the probability of the projection itself. With a coin, you’re reasonably sure that the it’s a 50/50 bet to get heads or tails. With a 20-homer hitter, you’re not entirely sure if he’s not really an 18-homer or a 22-homer hitter and there’s a small chance that he’s actually a 10-homer or a 30-homer hitter.

Luckily, my projection for Guillen seems to also be 20 homers per 650 PA, which makes it easier for me. He’s more likely to “truly” be a 15 homer hitter than a 25 home run hitter (per 650 PA), so you get a more skewed distribution.

Once everything’s combined (I use statistica to simulate), i get

(per 650 PA)

<5 HR – 0.2%
<10 HR – 3.4%
<15 HR – 18.4%
<20 HR – 47.2%
<25 HR – 76.3%
<30 HR – 91.5%
<35 HR – 97.7%
<40 HR – 99.5%

--
Dan Szymborski
dan@baseballprimer.com

by D.Szymborski on Jan 14, 2009 10:35 PM EST reply actions   0 recs

thanks, Dan

even though my head now hurts

Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.

by devil_fingers on Jan 14, 2009 10:55 PM EST up reply actions   0 recs

Maybe I can put it simpler

Think of it this way.

We’re going to do 100 coin flips. Coins are typical 50/50 propositions. Let’s say we want to know the odds of flipping 45 heads or more. Assuming that 50/50 split, we’d guess that it would happen 86% of the time.

But what if someone tells us that the coin is weighted and will never come up heads? In that case, we would never flip 45 heads or more out of 100.

Then again, we don’t know this guy too well and think he might be lying. We’re evenly split between whether he’s lying or telling the truth. So, depending on what’s true, we have a good chance of getting 0 coin flips. But if we just took the average of the two probabilities of the coin, 0.5 and 0.0, our coin would simply “project” to being a 1-in-4 heads coin.

If we have a coin that comes up heads 1 time in 4, the odds of flipping tails 100 times in a row is 3.2 trillion-to-1. But in this scenario, our chances of flipping tails 100 times in a row is actually over 50%!

Does this make more sense?

--
Dan Szymborski
dan@baseballprimer.com

by D.Szymborski on Jan 15, 2009 12:29 AM EST up reply actions   0 recs

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?
Start posting on Beyond the Box Score »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

770insig_small
BtB's "Ball On A Budget" Fantasy League - Discuss Participants, Payrolls and Position Eligibility

Recent FanPosts

Ds9_small
good graphing program?
Small
Predicting HR/FB Rates
Leopold_butter_scotch_southpark_small
Troy Tulowitzki vs Ryan Braun
Small
Pitchers batted ball observations
Small
Eric Byrnes: A player worth a look?
Small
Valverde Is Charging Detroit Double
Mukuro_small
Another question: About power rankings
Small
Why You Shouldn't Trade for Arroyo
Jinaz-reds-avatar_small
Last Call for BtB Sabermetric Writing Award Nominations

+ New FanPost All FanPosts >

FanShots

Quick hits of video, photos, quotes, chats, links and lists that you find around the web.

Recent FanShots

Can you spot the five guys NOT in the Hall of Fame? It's easy, I separated 'em for ya. :)

Here's the full post about guys elected to the Hall of Fame primarily for their defense. Guys like Brooks Robinson, Ozzie Smith, Bobby Wallace, John Ward, and Bid McPhee stand up as definite HOFers, with many others worthy of debate (and a couple aren't worth any debate at all).

I almost made it through the whole post without comparing some HOFers to our friends Bill Dahlen, Lou Whitaker, Bobby Grich, Alan Trammell, and Ron Santo. Almost.
If you care about newspaper coverage of MLB, read this post
Visualizing the Difference Between Offensive and Defensive Value for Catchers
First B-Pro and now ESPN. Tommy, you're growing up so fast
THT - Advancing by ground
Negro League Museum Close to Folding
It is a capital mistake to theorize before one has data. Insensibly one...
Ranking Minor League Systems Using Victors Wang's Prospect Valuations
Pitch f/x on Ricky Nolasco Stretch vs. Windup again
Veron Wells the artist.  I never knew.

http://www.vwellsart.com/

+ New FanShot All FanShots >

BtB on Twitter

Main Feed: @BtBScore

Jeff: @jeffwzimmerman
Steve: @steve_sommer
Sky: @BtB_Sky
Dan: @dturkenk
Harry: @harrypav
Jinaz: @jinazreds
Jack: @jh_moore
Erik: @Erik_Manning
Tommy R: @trancel
Justin: @justinbopp

Subscribe to BtB via Email

Enter your email address:

Delivered by FeedBurner

BtB Goes Social


Managers

Wbc_029_small Jeff Sullivan

Editors

Rawlings_baseball_bigger_small Dan Turkenkopf

Limes_125_small Sky Kalkman

770insig_small Jeff Zimmerman (TucsonRoyal)

Aviles_small Justin Bopp

Authors

Roots_game_small R.J. Anderson

Jinaz-reds-avatar_small JinAZ

Face_small Harry Pavlidis

1753738656_110919ebe9_o_small vivaelpujols

Ozzie_small erik

Raysring1_small Tommy Rancel

Redcap_small SFiercex4

St_louis_cardinals_ce1141_003263_small stevesommer05

Paige_small Satchel Price