Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Celtics, Heat Score On Purpose In Super Sunday Wins

Value of OBP and SLG by Lineup Position, Part 2

UPDATE ON FEB. 22-I CAN SEND MY DATA TO ANYONE WHO WANTS IT. JUST SEND ME AN EMAIL (cyrilmorong@sbcglobal.net).

Last week I posted some regression results in which team runs per game was the dependent variable (DV) and the OBP and SLG of each lineup postion were the independent variables (IVs). In some cases, the coefficient values were very different. I only looked at teams from 1989-2002. Retrosheet has how each team's lineup position did from 1959-2004 (but not the NL in 1959). So I went back and used all of the data that they have.

Here are the results:

Again, the coefficient values are not the same and vary quite a bit. I also ran the regression using only teams that had the DH (yes, sometimes the NL gets to use the DH-I mean AL teams only from 1973-2004). Here are those results:

You can judge for yourself if there are any differences when using only DH teams (where a real hitter bats 9th and not a pitcher).

Then I went back to all teams and put it SB and CS per game. Here are the results:

I know, in some cases things don't make sense. We see some negative values for SBs and positive values for CSs. I mentioned this last week. But I just wanted people to get a chance to look at this.

Then I did the SB/CS regression for DH teams only. Here are the results:

Again, judge for yourself if the differences are meaningful.

Now there could be collinearity between the IVs. I discussed this a little last week. I did not run any test yet for it this time. If I do, I will update this story. I ran a regression with some different variables to avoid or lessen this problem. Each lineup slot had 3 variables: walk percentage, hit percentage and extra-base percentage. For walks, hits, and extra-bases, the denominator was plate appearances (PAs). This is a little different than comparing OBP and SLG since OBP has PAs as the denominator and SLG has ABs. Also, by using extra-bases, it is a little like isolated power. SLG is not always as good measure of power because a guy who hits a single drives up his SLG. Isolated power is SLG - AVG, or extra-bases divided by ABs. Of course, here, I am using PAs. H1 is the hit% of the leadoff man, W1 is the walk% of the leadoff man, XB1 is the extra-base% of the leadoff man, etc. Here are the coefficient estimates:

Then I added SB and CS in. Here are those results:

Comment 16 comments  |  0 recs  | 

Do you like this story?

Comments

Display:

Hmm
As always, I can't tell if that is sarcasm or not.
"I don't set the rosters, I just make fun of the guy who does" - Rob Neyer

by Marc Normandin on Feb 19, 2006 10:11 PM EST reply actions  

My guess is sarcasm...
But I dunno.

by Dan Scotto @ Beyond the Box Score on Feb 19, 2006 10:14 PM EST up reply actions  

Power problems (the stats kind)
This is really interesting stuff.
One concern: given your sample size (big) small variances may have a trendency to come out as more significant than they really are.  This does not appear to be a problem for your H, W, XB terms, but I would take a hard look at the SB and CS results.  
I'm just speculating, but maybe stolen bases shouldn't be assigned to batting order positions.  I see the theoretical basis for doing so in the H, W, XB case (if only because your leadoff guy is more likely to be followed by a power threat); do you have a reason to think that a stolen base by a given batting order position would be worth more?
Have you tried using a more aggregate SB term?

by sunandrain on Feb 20, 2006 11:11 AM EST reply actions  

SBs, etc
Thanks for your comments.

I only put the SB/CS results there in case anyone was interested. I really don't know what to make of them.

It is possible that a SB in front of a guy who gets alot of hits (or just more singles, which I did not include) is more valuable than a SB in front of a guy who walks alot. That is the theoretical justification. Maybe it is a weak one.

Can you give me an idea of an aggregate SB term to try? Should I just try total SB/CS for each team instead of each lineup slot? Or maybe for the top 3, the middle 3 and last three batters?

by Cyril Morong on Feb 20, 2006 11:39 AM EST up reply actions  

This may be a repost - if so, ignore
Well, hmm...
If I were to test your theory, I'd do it a little differently.  The question is whether an attempted steal changes the number of outs produced by the baserunner/batter combination, right?  We can assume that fewer outs = more runs.  Could your dataset isolate this kind of combination (OBP of batter at the time of attempted steal, and outcome)?  That would be kind of cool to know.
In terms of an aggregate, I had thought of team aggregate and top and bottom of lineup.  Thirds as you suggest make more sense.  Basically, this becomes a control variable for what you demonstrate with the other variables.
Incidentally, your blog is really great.  How long have you been doing this?

by sunandrain on Feb 20, 2006 1:02 PM EST up reply actions  

Stealing
Actually, this blog was started and is run by Marc Normandin. I agree he does a great job.

I am not trying to see whether whether an attempted steal changes the number of outs produced by the baserunner/batter combination. Some of my earlier posts touch on that. Three people who have done good work on this are Tom Ruane, Ted Turocy and Mark Pankin. Also, I think this issue is discussed in "The Book" that is coming out soon by Tangotiger and others (mine has not come yet). This data set will not  isolate this kind of combination (OBP of batter at the time of attempted steal, and outcome). But you should search for those three guys on Google to see what they have come up with.

by Cyril Morong on Feb 20, 2006 3:01 PM EST up reply actions  

The Book
I need to order a copy of that next paycheck. Thanks for the compliment Cy and Sunandrain. Always glad to see new readers and users around these parts.
"I don't set the rosters, I just make fun of the guy who does" - Rob Neyer

by Marc Normandin on Feb 20, 2006 3:08 PM EST up reply actions  

Quick request
Cy, I hate to send you back to look at more work, but what does it look like for just the past 25 years? I'm currently reading Dayn Perry's new book, and he analyzes the playoff teams from the past quarter century to do his analysis on what exactly makes a team great nowadays. Just thought I'd ask, to see how different the data may look from the other two presented samples.
"I don't set the rosters, I just make fun of the guy who does" - Rob Neyer

by Marc Normandin on Feb 20, 2006 2:59 PM EST reply actions  

Quick request
That probably should not take too long once I get home. Just a matter of sorting, copying, pasting and telling the program to run the regression.

What is this new book?

by Cyril Morong on Feb 20, 2006 3:02 PM EST up reply actions  

Re: book
"Winners: How Good Baseball Teams Become Great Ones (and It's Not the Way You Think)"

It has been really interesting thus far, and I'll most likely throw an advertisent on the sidebar for it when I am finished. Dayn has some interesting ideas that he backs up with plenty of stat analysis, and he's entertaining just speaking in terms of his writing style as well.

"I don't set the rosters, I just make fun of the guy who does" - Rob Neyer

by Marc Normandin on Feb 20, 2006 3:06 PM EST up reply actions  

Advertisement*
"I don't set the rosters, I just make fun of the guy who does" - Rob Neyer

by Marc Normandin on Feb 20, 2006 3:07 PM EST up reply actions  

DH SLG3?
Is the SLG3 of the DH-only leagues really only 0.83?  That seems intuitively wrong to me.

I ran this on the A's using my script over on Catfish Stew, and it produces a lineup that consistently puts Mark Kotsay batting third.  That doesn't seem right.

by kenarneson on Feb 21, 2006 4:39 PM EST reply actions  

This is pretty radical, actually.
If these numbers are correct, the #3 slot is less important than the #7 and #8 slots in the order when there's a DH.

Here are the types it suggests are ideal.  I'll add up the OBP and SLG scores, just as a quick and dirty way to compare each slot's relative importance using this data:

#1: High OBP, Low SLG (3.55)
#2: Mid OBP, High SLG (3.34)
#3: Mid OBP, Low SLG (2.48)
#4: High OBP, Mid SLG (3.38)
#5: Low OBP, High SLG (2.44)
#6: Low OBP, High SLG (2.41)
#7: High OBP, Low SLG (3.16)
#8: High OBP, Low SLG (3.06)
#9: Mid OBP, Low SLG (2.22)

by kenarneson on Feb 21, 2006 7:21 PM EST up reply actions  

DH SLG3?
Ken, you raise a good point and I don't really have an answer. I can run the numbers again when I get home. It does seem like a strange result. Maybe it is because the cleanup batter usually has a high SLG, too. So if the third guy hits a single instead of a double and the cleanup guy hits a HR, the third place guy scores no matter what. That's the best I can come up with right now.

My main concern was seeing if the values for OBP and SLG would differ with lineup position. I think this shows that they do even if what I found are not the "true" values. You might take a look at what Mark Pankin found on this using Markov chains.

by Cyril Morong on Feb 22, 2006 11:58 AM EST up reply actions  

DH SLG3?
I reran the numbers after I made some corrections in the data (some team runs per game figures were misaligned). The value of SLG for the third place hitter changed in the DH only cases. It went from about .8 to 1.3. Sorry for any trouble this caused.

by Cyril Morong on Feb 22, 2006 10:44 PM EST up reply actions  

Check your e-mail
"I don't set the rosters, I just make fun of the guy who does" - Rob Neyer

by Marc Normandin on Feb 25, 2006 1:24 PM EST reply actions  

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?

FanPosts

Community blog posts and discussion.

Recent FanPosts

Small
Free Agent Compensation
Img_0001_small
Value of Various Plate Approaches
Strike_three2_small
Effect of Foul Area on Strikeouts: AL 1954-68: Erratum
Small
Baseball on a stick
Small
Player Evaluating Statistic
Baseball_small
Rays Outfield: Cheap but Extremely Productive
Small
A new xBABIP
Small
Jack Morris "pitching to the score"
Strike_three2_small
Foul Area and Differences in SO: AL vs NL
Baseball_small
Is there a Kuroda and Oswalt Alternative?

+ New FanPost All FanPosts >

Follow us on Facebook!

Follow us on Twitter!

SaberGraphics

MLB Daily Dish

Get the latest MLB Trade Rumors, Transactions, and News at MLB Daily Dish!


Managing Editor:

Jbopp-kc_small Justin Bopp

Columnists:

Adam_small adarowski

Dme_small Satchel Price

Closeup4_small J-Doug

Carlosicon_small Julian Levine

Billy_and_daddy_4th_of_july_small Bill Petti

Featuring:

Dayton_small Jeff Zimmerman

12475953_small Jacob Peterson

Picture-6_small Chris St. John

Btbpro_small Dave Gershman

229331_10150183361996591_674441590_6760167_6637860_n3_small Lewie Pollis

Img_3830_small David Fung