Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Trent Richardson Interviews Fellow Brown Brandon Weeden

Yanks vs Sox Preview and Discussion Thread - Updated


Hey folks, right now this is mainly a placeholder for a series preview post that will be finished up after my little one goes to sleep tonight.  I'd like to start a thread like this each week where we preview one series from the upcoming week using a some sabermetric principles and a simple simulation tool I've been working on developing.  This week I picked the Yanks-Sox series, but I think in coming weeks I'll let you, the reader, vote on the series you want to see previewed.  Ok, enough introductions.  To hold you over until I get the full preview I present a graphic courtesy of our resident graphic guru Justin Bopp

Ny_vs_bos_05-06-10_medium

The graph compares the lineups of the two teams using my home-brewed version of an updated CHONE projection.  Sorry to have to cut this short right now, but feel free to use the comments as an open discussion thread for all things baseball, and I'll get back with the rest of the preview in a bit.

Ok the rest of the article will describe what the simulation tool does, and then look at the inputs and outputs for the Yanks - Sox series.

Star-divide

The first goal of the process is to take a team's lineup and compute an expected runs scored / game.  To do this I calculate Runs Created using wOBA (those Justin graphed above) and scale it to a single game using plate appearances per lineup slot.  You could also use a lineup simulator to get runs scored/game.  The next part of the equation is runs allowed per game.  Clearly runs allowed has two components, pitching and defense.  For defense I took my defensive projections if available for the player in question, if not I used CHONE as most of the unavailable players in my data set are catchers or young players.  The following two tables summarize both the offensive projections used in Justin's chart and the defensive projections used for the two teams

 

Representative Yankees Lineup
Name wOBA DEF/150
1 Jeter 0.36 -1
2 Johnson 0.359 0
3 Teixeira 0.388 1.5
4 Rodriguez 0.399 -1.5
5 Cano 0.367 -1
6 Posada 0.356 -3
7 Granderson 0.357 0.5
8 Swisher 0.362 -3
9 Gardner 0.342 3

 

and the Sox

Representative Red Sox Lineup
Name wOBA DEF/150
1 Scutaro 0.331 3.5
2 Pedroia 0.365 6.5
3 Martinez 0.355 -1
4 Youkilis 0.379 5.5
5 Drew 0.358 5
6 Ortiz 0.359 0
7 Beltre 0.329 8.5
8 Hermida 0.341 -6
9 McDonald 0.323 3

 

For the pitching side I take my home-brewed version of an updated CHONE ERA and prorate it to inning pitched / game start for the stater and the remainder of the game to the bullpen (aggregated over the top 5 relievers).  Once I get the game ERA I back that out to "theoretical" total runs allowed without defense.  After subtracting the defense I'm at actual runs allowed per game.  The pitching values look like the following for this series

 

Yankees Pitching
Starter Starter ERA IP/GS Bullpen ERA
Game 1 Hughes 4.05 6 3.8
Game 2 Sabathia 3.48 6.66 3.8
Game 3 Burnett 4.31 6.33 3.8

 

and for the Sox

Red Sox Pitching
Starter Starter ERA IP/GS Bullpen ERA
Game 1 Beckett 3.77 6.66 3.8
Game 2 Buchholz 4.21 6 3.8
Game 3 Lester 3.46 6.33 3.8

 

Now I have runs scored and runs allowed for each team in a vacuum.  That means I can run each through pythagenpat and get theoretical winning percentages (again in a vacuum).  To get actual game win percentages I plug the individual winning percentages into the log5 formula.  Once I have those, a simple monte carlo simulation can provide a distribution of series results (as could just plain old statistics using the binomial distribution, but I love me a monte carlo for some reason).  The results for this series (given the above projections of true talent combined with HFA for the Red Sox) are 

Yanks_sox_medium

 

So according to the above projections the Red Sox have a slight edge (some of which is due to having the home field advantage adjustment) but it looks like a tight series in that either team has a decent chance of winning the series. Next time I do one of these I'll go more in depth on how I'm doing my home-brew updating of projections (unless I do the smart thing and just use ZIPs updated and therefore don't have to update myself. I just had CHONE handy when I did the first one of these)

Comment 1 comment  |  1 recs  | 

Do you like this story?

Comments

Display:

Why are you using ERA?

That will just double count the defensive advantage of the Sox…

Also, what is t he homefield adjustment, and what’s the justification?

by themastah on May 9, 2010 3:34 AM EDT reply actions  

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?

Follow us on Facebook!

Follow us on Twitter!

SaberGraphics

Yahoo_full_count

MLB Daily Dish

Get the latest MLB Trade Rumors, Transactions, and News at MLB Daily Dish!


Managing Editor:

Jbopp-kc_small Justin Bopp

Columnists:

Adam_small adarowski

Dme_small Satchel Price

Closeup4_small J-Doug

Carlosicon_small Julian Levine

Billy_and_daddy_4th_of_july_small Bill Petti

Featuring:

Dayton_small Jeff Zimmerman

12475953_small Jacob Peterson

Recent_pic_pg_small Patrick Gordon

Btbpro_small Dave Gershman

Me_small Bryan Grosnick

229331_10150183361996591_674441590_6760167_6637860_n3_small Lewie Pollis

Img_3830_small David Fung

30472_1481067225243_1190689185_1381415_997334_n_small Glenn DuPaul

1mnvxku7_small joshuaworn

Set_small MattFilippi18

Photo0011_small Nathaniel Stoltz