A few months ago, I introduced the first iteration of CAL, the comparison and likeness prospect classification system I derived based off Bill James’ Similarity Score. In a nutshell, the algorithms try to provide a better context to evaluate minor leaguers by determining similarly performing players (based on age, position, level of competition, specific points of production, etc.). Since then I’ve tweaked the formulas, made a few bold claims – such as Maikel Franco’s High Bust Potential — and rolled out the updated results in my book, The 2015 Prospect Digest Handbook.
With the handbook firmly out of the way, and the subsequent writer’s burnout squarely in my rearview mirror (I think), I’ve spent the last couple of months developing not only a MLB-version of CAL, but a projection system based squarely off of a player’s top comparables. And the initial results — and I say initial because I’m sure I’ll continue to tweak the formulas in an effort to improve the results — appear to be encouraging.
First, though, the process, some notes, and a little extra commentary. And, of course, I’d love to hear any feedback — positive or negative. If you’d like to skip through the verbage just jump to the last paragraph and click on the hyperlink to view all the projections.
• The CAL database was constructed from FanGraphs MLB statistics, beginning with the start of the Live Ball Era (1920).
• The system uses a player’s previous three seasons. Or in the case of younger players, whatever the available sample size (i.e. if a player is entering his third season, it will use the previous two years)
• After CAL determines a player’s top comparables, I developed a system of projection algorithms of varying weights.
• And, finally, to check the accuracy of the projection system I used statistics from 2011-to-2013 to project what they would have done in 2014. One additional note: the only 2014 statistic I did not project was plate appearance totals in order to correctly gauge the system’s accuracy.
Now for some of CAL’s biggest hits and, of course, digging into some of the bigger misses as well.
Heading into the 2014 season, the All Star center fielder’s top comparables, according to CAL, were a pair of Hall of Famers (Dave Winfield and Goose Goslin), another pair of high-performing All Stars (Roy White and Bobby Abreu), and former Pittsburgh outfielder Jim Russell. And here are McCutchen’s 2014 projections based off those five players:
Heading into last season Carpenter was coming off back-to-back seasons in which he hit a combined .310/.383/.475 through more than 1,000 plate appearances. And yet, his production took a rather noticeable downturn in 2014 — he batted .272/.375/.375. CAL was able to sniff out his decline:
CAL linked the former can’t-miss-superstar-turned-bust-turned-one-of-the-better-players-in-baseball to four All Stars (Al Smith, Hunter Pence, Sam West, and Ken Griffey Sr.) and Derek Bell, who hit .334/.384/.442 in 1995 but failed to make the Midsummer Classic. For his part, Gordon was coming off of a bit of down season after two stellar campaigns; he slugged .298/.372/.478 in 2011-2012 and followed that up by hitting .265/.327/.422. CAL, however, wasn’t too fond of Gordon’s bounce-back potential:
And here are two of CAL’s biggest misses:
The Orioles slugger set the world ablaze by bashing 95 extra-base knocks en route to slugging .286/.370/.634 in 2013. He promptly followed that up with a paltry .196/.300/.404 triple-slash line. His top comparables heading into his disappointing season: Richie Sexson, Danny Tartabull, Craig Wilson, Mo Vaughn, and Nate Colbert.
Ouch. CAL didn’t think Davis would be able come close to even matching his career year, but it definitely didn’t think he’d crash as far as he did.
It took a long time for the former Johan Santana-trade-centerpiece to develop into the perennial All Star candidate that we recognize today. Gomez batted just .247/.294/.379 through his first 2,100+ plate appearances, including his .260/.305/.463 triple-slash line in 2012. His top comparables heading into last season: Claudell Washington, Andy Van Slyke, B.J. Upton, Sammy Sosa, and Mike Davis. And here’s his projection:
Obviously, the CALs range from very, very good (Sammy Sosa) to very, very poor (B.J. Upton). Gomez was, in CAL’s opinion, pretty volatile.
For the rest of the projections click HERE. I’d love to hear some feedback — good, bad, or indifferent — so let me know what you think.
All statistics courtesy of FanGraphs.
For more analysis check out Joe Werner's site: ProspectDigest.com. You can follow him on Twitter at @JoltinJoey.
Check out Joe's book, The 2015 Prospect Digest Handbook, click HERE.