Beyond the Box Score: An SB Nation Community

Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: College Football Rankings - BlogPoll Top 25

A Little Saber-Rant: Or Dan's Demented Ramblings

On Monday, John Sickels ignited a mini firestorm in the sabermetric community by admitting he's becoming somewhat overwhelmed and tired with the recent advances in sabermetrics.  He still enjoys baseball, but he's beginning to question the relevance of increasing the complexity of our measures for small gains in accuracy.  I'm paraphrasing here, but I don't think I'm too far off from what John is trying to say. 

There have been numerous well-written responses, ranging from Dan Novick and Pat Andriola at The Hardball Times, to Tango over at The Book Blog.  Most seem to grant John's point that at least some of sabermetric study is becoming extremely specialized, and go on to point out how the field in general is becoming more accessible to the average person.

This is my foray into the discussion.  I don't know if I have anything interesting to add, but since it's a topic that's been kicking around my head for a while, you get to hear from me whether you like it or not.

Star-divide

Pushing the envelope in sabermetrics has indeed become the realm of specialists.  I don't think it's possible to argue that (of course someone will, I'm sure).  The last few years have seen the advent of Pitch F/X and the tease of Hit F/X and Field F/X - all of which require some fairly in-depth knowledge in physics to really make sense of.  Many of the advances that aren't physics related are deeply statistical (like SIERA), or deeply computational (like UZR).

This isn't necessarily a bad thing.  And it's certainly not an unexpected thing.  I'll get back to my thoughts on the topic at hand in a moment, but first a historical diversion.

Throughout history, as cultures mature, specialization follows.  The advent of agriculture led to food surpluses, which allowed specialization of labor and the growth of villages and cities. 

During the Renaissance, gentleman scholars like Leonardo Da Vinci or William Herschel, who discovered Uranus and wrote 24 symphonies, led the way - making breakthroughs in many fields.  Fast forward to today.  You need many years of schooling and many years of work to become an expert in just a single field.  Occasionally a newcomer may make an impact on the field, but that seems to becoming more and more rare.

We're at a transition point in sabermetrics.  I'm just too young to have participated in the first big wave of popular sabermetrics - the USENET forums whose discussions were inspired by Bill James' work. (And no, that's not an age joke, Szymborski)  But I have been active in the field for almost a decade now - following Rob Neyer, visiting Baseball Think Factory (back before registration was required), and for the last few years making my own contributions through writing.  I've seen sabermetrics progress from a place where a talented dabbler could make a mark to a place where advanced statistical methods are being used to move the needle slightly forward.

Along with the increase in specialization, we're seeing a tremendous amount of popularizers. Those writers who get the hard stuff and make it accessible to the masses.  These types of writers are essential for the sabermetric community, just as they are for the scientific community.  For every Nobel prize winning physicist, there's the journalist who translates her work for the common people.  For every Patriot, there's a Dave Cameron. Both types of analysts are needed for the field to progress.

And progress it has.  In 2000, Baseball Prospectus published its version of the Hilbert problems for sabermetrics.  For those of you unfamiliar with the reference, at the beginning of the 20th Century, Hilbert, a mathematician, published a list of 23 unsolved problems that he deemed the most important and interesting left to be studied.  Out of the list of 23 unsolved sabermetric problems that BP raised, I count at least 10 that are either "solved" or have had substantial progress made on them.

On the other end of things, sabermetrics has never been more mainstream. Whether it's advanced fielding analysis showing up in the pages of the New York Times, or the MLB Network discussing WPA, saber topics have made inroads into places it's never been before. 

So the combination of specialist and popularizers is a successful one for the field of sabermetrics as a whole.

Why, then, do I feel some of the same ennui as John Sickels?  The easiest answer is that, to some extent, sabermetrics has passed me by.  I'm the dabbler I mentioned up above.  I have the occasional creative idea, and just enough computational and statistical ability to get the idea across. 

But I don't have the knowledge to run a LOESS regression, nor can I calculate the Magnus force on a fastball.  I suppose I could learn how to do those things if I invested the time, but I have a job I like, and a family I love, and I don't have the interest I once did.

Beyond that though, I think there's some staleness in the community right now.  There seem to be three major trends in sabermetrics at the moment: small advances in areas (projections, for example), trying to figure out how to make Pitch F/X data useful, and valuing a player's contribution using WAR. 

WAR may in fact be the holy grail, the one measure to rule them all, but it makes for boring reading when it's the beginning, the middle and the end of the analysis.     

As I'm writing this, I'm in the middle of a Twitter conversation about the value of the Pitch F/X work that's been done to date.  There have been some great pieces, and some flashes of brilliance, but I think we're still looking for the "Eureka" moment that opens up new avenues of study.  Josh Kalk's look at predicting injuries is one of the most innovative things I've seen, and I'm disappointed that he was hired by the Rays and unable to move further on it (it's great for him of course).

Anyway, this has been a long semi-rant, and I do feel very much like an old fogey here - at the age of 30 no less.  After all this, you probably feel entitled to some recommendations.

Unfortunately,  I don't have many to offer.  I'm slightly disillusioned and I don't really know why.  I can tell you what I'd like to see the sabermetric community focus on, but I'm just one man, and my recommendations don't necessarily carry much weight.

I'd like to see a re-emphasis on creativity; creativity while asking the question, creativity while studying the data, and creativity while writing.  Don't just ask how much movement this pitcher's fastball has.  Instead, wonder whether he can control the amount of movement based on his pitch sequence.  Don't just run a regression.  Instead, try to understand the theoretical basis for component interrelationships.  And don't just say Player A had 2 more WAR than Player B so he's the better player.  Examine what A did differently to earn those extra wins.

I'd like to see us focus more on the field of play rather than just the numbers.  That's a strange thing for me to say, because I probably watch a lot less baseball than most of you due to my family commitments.  But we, at times, get so caught up in our calculations that we forget there's a lot outside those calculations too.  Some of those things might be meaningless to our evaluation of the game, but some of them might not be (catcher game-calling for instance) and we just haven't measured them yet.


Finally, I'd like to see a sabermetric community that doesn't become so technically advanced that the average person can't participate to a fairly deep level.  This might be a pipe dream, but I'll say it anyway.  If all of our measures are so far divorced from what the average person can understand, we've lost the battle.  We haven't reached that point yet, but we're within sight of the cliff. We may be forced off that cliff in order to make future advances, but it's not a step we should take lightly.

I know the field is going to continue to move forward, and is likely going to become more and more specialized. It's just the nature of the beast.  We've seen it time and time again in other arenas. And if it leaves me behind, so be it.  I've made my choices and I don't regret not spending more time studying statistics or physics (if only I were still in college).  I know this sounds like a retirement letter, but it's not that.  And it's not a excuse for why I haven't been writing as much over the past 6 months (the twins would be that excuse).  I've had these thoughts for a while but reading Sickel's article made me want to get them out of my head and into a forum for discussion. 

For those who actually made it through these 1500 rambling words, I'd love to hear your thoughts.

4 recs  |  Comment 20 comments |

Story-email Email Printer Print

Comments

Display:

I by and large agree with what you have to say

But I do think that there’s going to be a huge leap in analysis once Hit F/X and, more importantly, Field F/X become figured out. Even if the data themselves aren’t easily understood, they can be incorporated into metrics that tell us more than we’ve ever known about how players perform. First of all, Hit F/X+Field F/X will give us the fielding metric to end all fielding metrics; rather than relying on a stringer to classify balls in play as fliner, liner, grounder, etc., and mark where it lands, we’ll have exceedingly specific data on what exactly each sort of ball in play is. Perhaps more importantly, we’ll finally be able to incorporate data on fielder positioning, and see how much that contributes. Second of all, imagine Hit F/X creating, for example, a wOBA that gives a player extra credit for hitting a hard lineout. We’d be more able than ever to separate out performance and true talent. From what I understand, this technology is coming, and when it does, it’s going to shake everything up.

Brewers Baseball and other assorted nonsense (mostly the assorted nonsense) at my blog, What's a Tararrel?

by Lefti on Feb 16, 2010 11:28 PM EST reply actions  

Well that's kind of Dan's point isn't it?

Even if Hit f/x and Field f/x are released to the public, they will be needed to be made useful by a very specialized group of analysts. Guys who have at least a good understanding of Physics, statistics, writing code, etc. are really the only ones who will have the ability to make use of the data, and it becomes specialized because of that.

I think want Dan wants is for people to be looking for new ways to analyze the old questions. Since the F/X’s are a long way from being useful to the public, we need to be thinking of new and creative ways to look at baseball.

by vivaelpujols on Feb 16, 2010 11:42 PM EST up reply actions  

Pretty much agree with you.

There’s a way in which we’ve pretty much done the easy stuff. And so it is going to take folks that either have the patience to do really labor intensive heavy lifting, or people who have technical skills to make advances that aren’t easy for mere mortals to do. And often both will be true. This is why, I think, we’re seeing a lot of people with fairly advanced training making some of the bigger splashes these days (though, it should be said, that someone like Victor Wang made a big splash as a freshman/sophomore in college, so clearly it’s still possible to do important work without graduate-level training).

However, at the same time, I think the place where someone like myself can make a contribution is as an intermediary. I’m not a Rob Neyer or Dave Cameron communicator, necessarily. But pretty much everything I do is an application of what the first-tier researcher people are discovering and inventing. For example, I took the established player valuation work that folks like Tango and Patriot had worked out and put it together in 2008 to produce total value rankings—essentially WAR. This was a decent little contribution—gave people an accessible tool to evaluate players that was better on many levels than what BPro had to offer. Essentially zero of it was novel, but it was something I could do with pivot tables and vlookups in Excel.

FanGraphs was kind enough to do this themselves this past season (not saying they were copying—it was a void that needed to be filled, and they did a great job filling it), so I didn’t have to. But this freed me up to do the power rankings. I expect that at some point someone will come along and do that too, at which point I’ll cheer and probably stumble upon something else that I can do.

Anyway, the point is that the community is large enough that we don’t all have to be cutting-edge researchers these days (and you’re far more of a genuine researcher). There’s room for people like that; guys who have the training, time, and creativity to make real advances. But there’s a lot of readership for someone like me who does applied stuff too. And there’s still-more readership for people who take the applied stuff and communicate it to those who aren’t well-versed in what’s going on in this little field of ours.
-j

by JinAZ on Feb 16, 2010 11:39 PM EST reply actions  

I've given up on actually being much of a researcher

It is highly specialized and while I like to think I understand a good deal about math and statistics, a lot that’s said over on The Book Blog goes right over my head. And I love looking at really cool charts and heat maps, but I’m not nearly good enough with computers to make it happen.

Recently, I’ve started to view my role as one of those intermediaries. I’m not a Carson Cistulli or Dave Cameron, but I like to try my hand at being creative while using sabermetrics and logic. Sabermetrics are going more and more mainstream and as a result, this gives us more chances to show off our writing skills and to make people interested.

But yeah, it is a weird line to walk and I can understand your sentiments for sure.

"I never threw an illegal pitch. The trouble is, once in a while I toss one that ain't never been seen by this generation." - Satchel Paige

by Steve Slowinski on Feb 16, 2010 11:56 PM EST reply actions  

I think your work linking the FSR and UZR was actually needed.

People, including myself have been saying it for a while, but someone just has to do it.

- .-. ..- … – / – …. . / .—. .-. - .. . … …

by Jeff Zimmerman on Feb 17, 2010 12:11 AM EST up reply actions  

Man, where to start without writing 1500 words

I feel John’s frustration with Sabermetrics, but also with other aspects of baseball. Truthfully, the ease of getting data out to the public is the problem. I could really care less about the 50 different trade rumors a day or top 200 prospect lists and why number 102 should be at 98. I think everyone has what makes them tick.

I try to write about what I would like to read and if everyone doesn’t like it, too bad. I am usually trying to answer some question I have in the back of my mind that others aren’t answering. Now-a-days, most easy questions are answered. It will take quite a bit of time to get the new data (Victor Wong’s Work) or computer knowledge to bring data together.

If someone wants to make an impact, go to B-Pro’s list and try to answer the questions. I would love for a group of people some to do:

11) Creating a way to better analyze mechanics.

Get videos, collect comments, rate motion and see if the pitcher breaks down. Again, a lot of work, but would be pretty sweet.

- .-. ..- … – / – …. . / .—. .-. - .. . … …

by Jeff Zimmerman on Feb 17, 2010 12:10 AM EST reply actions  

I didn't read the Sickles article.

I value his contributions as a talent evaluator. And, frankly, he doesn’t need to know a whole lot about sabermetrics to do that. He needs to know about scouting, projection, and basically what makes a valuable player (i.e. he needs to know that making outs is bad and that a player’s BABIP fluctuates a lot), he doesn’t need to be keeping up with the latest advances in pitch F/X findings or the new SIERA formula to do his job, though. I really don’t care how involved he is with sabermetrics. That’s not to say I don’t value his thoughts on the matter—and I will eventually read the article—but as long as he doesn’t do a complete 180 and start saying things like “RBI ROOLZ!”, I’ll still value his contributions as a talent evaluator, no matter if he decides to quit following sabermetrics or not.

I think these two paragraphs:

I’d like to see a re-emphasis on creativity; creativity while asking the question, creativity while studying the data, and creativity while writing. Don’t just ask how much movement this pitcher’s fastball has. Instead, wonder whether he can control the amount of movement based on his pitch sequence. Don’t just run a regression. Instead, try to understand the theoretical basis for component interrelationships. And don’t just say Player A had 2 more WAR than Player B so he’s the better player. Examine what A did differently to earn those extra wins.

I’d like to see us focus more on the field of play rather than just the numbers. That’s a strange thing for me to say, because I probably watch a lot less baseball than most of you due to my family commitments. But we, at times, get so caught up in our calculations that we forget there’s a lot outside those calculations too. Some of those things might be meaningless to our evaluation of the game, but some of them might not be (catcher game-calling for instance) and we just haven’t measured them yet.

Are exactly right.

http://www.capitolavenueclub.com/

by PWHjort on Feb 17, 2010 12:27 AM EST reply actions  

I don't see how anyone could stop respecting Sickels' work for this

He’s still clearly one of the better prospect evaluators out there, and his stance is quite understandable given the direction of sabermetrics.

I really liked the post, Dan, and I agreed with a whole lot of it. I get concerned sometimes that sabermetrics is becoming so bogged down into the complex statistics required to find the accuracy we desire that it’s losing touch with really interacting with the game itself on anything other than a numeric level. I love the numbers, but there’s a point where posts hardly sound like they’re about baseball at times.

Obviously the dirty work needs to be done, but being able to present it on a level that’s helpful and understandable to the common fan is going to become increasingly important. Because most fans either don’t have the time or aren’t interested in trying to really understand many of the more complex statistics at anything more than face value.

I’m not really worried because they’re are so many talented and smart guys contributing to the effort, but I think that it’s good time to maybe take a step back and reexamine exactly what the goals of the community are going to be going forward.

I like baseball.
I write for Beyond the Box Score and The Hardball Times Fantasy

by Satchel Price on Feb 17, 2010 9:33 AM EST up reply actions  

I thought for a second you were talking about my demented ramblings

by Dan Novick on Feb 17, 2010 1:49 AM EST reply actions  

Kinda sorta

I’ve commented a few times in the past, but I was met with huge amounts of frustration. I commented this one time on Driveline Mechanics (coincidentally), and immediately after doing so, tons of my saved info on the web was deleted. Facebook wouldn’t sign in automatically, my info on other blogs was deleted, and even some passwords became unsaved. I still haven’t figured out what I did, or how to fix it, but I’m over it now.

Anyway, I just made a new account using the facebook connect thing, which was ridiculously easy to do.

http://www.hardballtimes.com/

by Dan Novick on Feb 17, 2010 2:46 AM EST up reply actions  

...

“SBNation: if you try to comment, your saved passwords and info on other sites will be deleted!”

by vivaelpujols on Feb 17, 2010 3:17 AM EST up reply actions  

pretty sure

Dayton Moore had his techies come up with they virus

No word on how much they offered john, though.

I'm not a sabermetrician, but I do play one at FanGraphs.

Can't get enough of me? Check out my Twitter feed.

by Matt Klaassen on Feb 17, 2010 7:43 AM EST via mobile up reply actions  

I'm new.

So this is interesting to see.

I don’t know what a linear weight is. I don’t really understand regression. At least I don’t think I do. Things like Pitch F/X trajectories scare the crap out of me.

I think one of the reasons I focus on analyzing past performance is that it is what it is. It’s kind of comforting. It’s not too often that a brand new stat comes along and changes everything (like WAR), so when it happens it is exciting. In the field of projecting performance, it seems there’s a new metric every few hours.

I’m a daddy of three kids five and under. Writing about baseball is my therapy. I can’t keep up with all the new metrics. Don’t get me wrong, I love reading about them, but I don’t have the time or ability to apply them.

This one stood out:

Finally, I’d like to see a sabermetric community that doesn’t become so technically advanced that the average person can’t participate to a fairly deep level. This might be a pipe dream, but I’ll say it anyway. If all of our measures are so far divorced from what the average person can understand, we’ve lost the battle.

WAR and it’s components seem to help this, actually. I love showing a Cardinals fan who doesn’t know advanced stats beyond OPS+ that YES, Jim Edmonds was as good in the field as you thought he was… YES, Yadier Molina is a monster behind the plate… YES, Ozzie Smith made up for his offensive shortcomings—and then some—with legendary defense.

It can be made even more accessible—especially the whole part about positional adjustments and replacement value. I’m hoping to simplify that in the near future.

Great post, Dan.

by adarowski on Feb 17, 2010 7:42 AM EST reply actions  

Well Thought Out

Dan,

Thanks for the great post here. While you claim it’s a rant, I think it’s one of the most well thought out posts on this subject (others have been fairly offputting or rude, despite Sickel’s polite remarks simply about his opinion/preferences in baseball analysis). I love the lead in as well, and you seem to recognize where the problem could lie.

I find almost all analysis informative in some way. Some is boring (short little studies I’ve done myself are admittedly boring as hell), but just skimming those pieces and looking at the conclusion usually points me in some new direction. However, I think you hit it on the nose when you discuss the creativity issue. Because Sabermetrics is often performed as a hobby, it is in many instances used to feed intellectual curiosity. This is great, because I have my own curiosities; however, in the scheme of the reason for sabermetrics and applying it to baseball, the ‘WHY’ seems to have been lost in many articles. Convince me that you research is the most important thing to come about in the last 100 years of baseball.

We have this really neat data Pitch F/X. We can plot it and see what’s going on. But I have yet to see a great reason WHY we do this communicated to readers (I can think of plenty of my own reasons, but focusing on this part of the analysis would be very beneficial). In most places, especially in a business where application is king, the WHY is the most important part. If it cannot lead to being useful in the workplace/ballfield, then people are going to ask what the point is. I think that effectively communicating this simple point would go a long way in disillusioning some from the area. It’s difficult to spark new ideas without knowing why we would want to do them. Everyone has bad ideas, and many more have great ones. Knowing the reason for the analysis may help to dissimenate one from the other.

If I’m readings new things about PCA on batting and pitching skills, I want to know why this is helpful (I actually really enjoy these multivariate techniques). I would really like to know how Pitch F/X can be applied and integrated into coaching and on field performance. I want to use UZR to position my players correctly. Once I do this, the technical stuff becomes much more interesting to me. Once I know the question (what), I want to know how it applies (why), and if I’m convinced, I definitely want to know the ways in which it was calculated (how).

Again, thanks for the level headed and polite post. I truly enjoyed it.

by BMMillsy on Feb 17, 2010 9:45 AM EST reply actions  

Not sure how much I can contribute here

Though I agree the field is getting much more nuanced and specialized and thus loses some of it’s charm and accessibility.

I will say one thing however – if anything is going to jump start the field and force it to evolve, it’s going to be a concert of 1500-word rants venting frustration over the current trajectory of sabermetric advancement.

"These are thin mints. I put them in the freezer. My favorites. So good."
--Reds outfielder Adam Dunn, on the girl scout cookies he keeps in his locker

by Resolution on Feb 17, 2010 12:54 PM EST reply actions  

Comments For This Post Are Closed


User Tools

We use numbers and stuff.
Community Guidelines
Why be a member?
Start posting on Beyond the Box Score »

Join SB Nation and dive into communities focused on all your favorite teams.

Connect_with_facebook

FanPosts

Community blog posts and discussion.

Recent FanPosts

Small
Real Dirty Mets Looking for a saberminded author
Baseball_small
WAR By Decade: 1871-1879
Prosser_small
Cliff Lee: No longer invincible
Paige_small
Kelly Johnson Cleared Waivers; I Think That's Weird
Jeter_06_world_series_small
Top 10 players to start a franchise with revised.
Ballgame_2006_vs_texas_revised_small
The Myth of the Spoiler Returns
Small
Denard Span's Strikezone
Small
Matusz: Danks 2.0
Paige_small
I Think I Offended Juan Pierre
Leopold_butter_scotch_southpark_small
HOF/PED Quandry

+ New FanPost All FanPosts >

Sign up for the BtB Newsletter!

FanShots

Quick hits of video, photos, quotes, chats, links and lists that you find around the web.

Recent FanShots

Of course, historical trends do not always hold, as has been the case for Secret Sauce since ~2002. Take a look at the success of the Secret Sauce favorites in head-to-head competition since the playoffs expanded to eight teams in 1995.

Continue reading Weak Sauce? Secret Sauce's Predictive Capacity Wanes in Recent Years
2010 Fans Scouting Report
Plate discipline trends
What's Wrong With Mike Pelfrey?
Lightest Players in History (min 1000 PA or 500 IP)
Statistical Head Scratchers: The Sacrifice Fly
Adam Wainwrights Curve
Jose Batista Facts
A PitchFX look at how R.A. Dickey is able to change speeds with his knuckleball to be so effective
Out Rate: a simple new upgrade on OBP

+ New FanShot All FanShots >

BtB on Facebook

BtB on Twitter

RSS Feed: @BtBScore

Sky: @BtB_Sky

Jeff: @jeffwzimmerman
Steve: @steve_sommer
Dan: @dturkenk
Harry: @harrypav
Jinaz: @jinazreds
Jack: @jh_moore
Tommy R: @trancel
Justin: @justinbopp
Satchel: @SatchelPrice
Adam: @baseballtwit
Larry: @wezen_ball
Peter: @CapitolAvenue
Paul: @TheDiaTribe
Daniel: @CamdenCrazies
Matt: @devil_fingers

SBNation.com Recent Stories

SEATTLE - JULY 10:  Starting pitcher Felix Hernandez #34 of the Seattle Mariners celebrates after defeating the New York Yankees 4-1 at Safeco Field on July 10 2010 in Seattle Washington. (Photo by Otto Greule Jr/Getty Images)

MLB Power Rankings: On The Challenge Of Identifying A League's Best Pitcher

ANAHEIM CA - SEPTEMBER 08:  Jeff Mathis #5 of the Los Angeles Angels of Anaheim is mobbed by teammates after hitting a  walk off sacrifice fly to score Torri Hunter form third base against the Cleveland Indians in the 16th inning on September 8 2010 at Angel Stadium in Anaheim California.   The Angels won 4-3 in 16 innings.  (Photo by Stephen Dunn/Getty Images)

Mathis' 16th-Inning Sac Fly Lifts Angels Over Indians 4-3

Philadelphia Phillies' Jimmy Rollins, left, slides into home to score past the tag of Florida Marlins catcher Brad Davis on a single by Carlos Ruiz in the third inning of a baseball game, Wednesday, Sept. 8, 2010, in Philadelphia. (AP Photo/Matt Slocum) +2 updates

Phils Top Marlins 10-6, Jimmy Rollins Leaves Game With 'Hamstring Tightness'

More from SBNation.com >


Managers

Limes_125_small Sky Kalkman

Wbc_029_small Jeff Sullivan

Editors

Rawlings_baseball_bigger_small Dan Turkenkopf

Dayton_small Jeff Zimmerman

Aviles_small Justin Bopp

Paige_small Satchel Price

Authors

Jinaz-reds-avatar_small JinAZ

Face_small Harry Pavlidis

Newavatar_small Matt Klaassen

Wezenball-logo_small lar

Big_pun--300x300_small Tommy Rancel

Adam_small adarowski

Redcap_small SFiercex4

St_louis_cardinals_ce1141_003263_small stevesommer05

Small garik16

Julio_teheran_2_small PWHjort

Cclogo_small Daniel Moroz

Closeup4_small J-Doug

Nick_cage_small The DiaTriber