The Curse of Jerry Hairston, Jr./Eric Hinske:
 








RSS 2.0 Atom

*ADVERTISEMENT*
Our new URL is: http://www.rlyw.net
*ADVERTISEMENT*

*ADVERTISEMENT*

image
Way back in the 20th century, Bill James wrote the first essential book about baseball managers. Chris Jaffe has just written the second.
- Rob Neyer, ESPN.com

From now on, whenever I have a question about a manager, Jaffe's book will be the first and last one I reach for.
- Sean Forman, Baseball-Reference.com


*ADVERTISEMENT*

*ADVERTISEMENT*
John Brattain Memorial Fund

The Hardball Times has set up a memorial fund for John Brattain's family. He left behind a wife and two teenage daughters.

Four years ago, I found from personal experience how generous the online community can be to its own in their hour of need. I am now literally begging you to be even more generous than you were to me.


*ADVERTISEMENT*

*ADVERTISEMENT*

*ADVERTISEMENT*

*ADVERTISEMENT*

*ADVERTISEMENT*


Does Robinson Cano’s Approach Change With Men on Base?
(50 Comments - 1/26/2010 10:44:25 am)

2010 CAIRO Projections v0.2
(14 Comments - 1/25/2010 10:56:33 pm)

One Of The Following Stories May or May Not Be True
(26 Comments - 1/25/2010 1:51:23 pm)

What Happened to Wang?
(13 Comments - 1/24/2010 11:53:14 pm)

NY Times - Glanville: Seeing is Disbelieving
(62 Comments - 1/24/2010 9:27:27 pm)

RealGM Baseball: Yankees Among Teams Interested In Edmonds
(3 Comments - 1/23/2010 4:52:40 pm)

Should Jesus Montero Be an Option for Left Field?
(65 Comments - 1/22/2010 10:24:20 am)

CAIRO Projected 2010 AL East Standings as of January 16
(35 Comments - 1/21/2010 2:53:01 pm)

MLB.com - Bauman: Yankees appear stronger
(18 Comments - 1/21/2010 5:21:26 am)

TSBG Versus High and Low Fastballs
(5 Comments - 1/20/2010 9:00:27 am)



Player

Current Projected

Look what people have to say about Larry Mahnken's commentary!

"Larry, can you be any more of a Yankee apologist?.... Just look past your Yankee myopia and try some objectivity."
- Bernal Diaz

"Mr. Mahnken is enlightened."
- cordially, as always,
rm

"Wow, Larry. You've produced 25% of the comments on this thread and said nothing meaningful. That's impressive, even for you."
- Anonymous

"After reading all your postings and daily weblog...I believe you have truly become the Phil Pepe of this generation. Now this is not necessarily a good thing."
- Repoz

"you blog sucks, it reeds as it was written by the queer son of mike lupica and roids clemens. i could write a better column by letting a monkey fuk a typewriter. i dont need no 181 million dollar team to write a blog fukkk the spankeees"
- yan

"i think his followers have a different sexual preference than most men"
- bob

"Boring and predictable."
- No Guru No Method

"Are you the biggest idiot ever?"
- Randal

"I'm not qualified to write for online media, let alone mainstream media."
- Larry Mahnken



This site is best viewed with a monitor.

Disclaimer: If you think this is the official website of the New York Yankees, you're an idiot. Go away.


Monday, September 15, 2008

Putting Linear Weights in Context

As most of the readers here are probably aware, my favorite method for evaluating offensive performance is using batting runs as calculated by linear weights.  Linear weights are flexible, thorough, and can be easily manipulated to compare players to average players, replacement-level players, and players who play the same position.

The key to linear weights is understanding that every positive or negative outcome has a commensurate run value.  Since runs are the key to winning baseball games, linear weights gives us what I think is the bulk of the information we need to look at why teams score or don’t score, which then tells us why they win or lose.

However, this year, the Yankees have exposed some of the flaws in using a context-neutral statistic like linear weights when looking back at past performance.  The normal linear weights formula assume that every single is worth a certain amount, every double is worth a certain amount, every out is worth a certain amount, etc., However, we all know this isn’t really true in real life. A single with runners on second and third is worth more than a single with the bases empty, but in the normal linear weights formula, they’re treated equally.

That’s fine if you are trying to assess the skill level of players, but it’s not quite as useful when looking back at results retroactively.  I messed around with the Hardball Times’s version of clutch as defined by Bill James but reader feedback and some more thinking led me to realize that it’s fairly limited.  So then I looked at some situational splits using wOBA which I liked better, but that’s more of a rate stat and less useful when trying to look at what it means as far as runs.

I am still morally opposed to WPA because I think it’s far too timing and teammate dependent.

Thankfully, there’s a way to make linear weights more useful in retroactive analysis by contextualizing them.  To do this, I’m going to use the information from this study by Tango Tiger called linear weights by men on base.  The subject here is coincidentally enough how linear weights value differ based on the different baserunner states.  You could take this out to a more precise extreme by factoring in the out states but I think that’s a lot of extra work for very little additional payoff.

So what I did is use the matrix from the article linked above and split out every hitter’s performances in the eight different baserunner states.

-No one on
-Runner on first
-Runner on second
-Runner on third
-Runners on first and second
-Runners on first and third
-Runners on second and third
-Bases loaded

If you look at the chart in the linked article, you can see for example that a single with no one on base is worth .29 runs, and a single with the bases loaded is worth 1.38 runs.  So I ran the splits for each of the eight situations above for 2008, and calculated the batting runs per out for each of the eight splits for each player as well as for the average player. Then players are compared to the average player in each splits and their batting runs above average for each of the eight splits are added to get their contextual linear weights value.  This is compared to their context-neutral linear weights to get a difference.  The bigger the difference, the more productive a player in situations with runners on base.  I’m going to use comparisons to average here instead of comparisons to replacement level, and I’m not position-adjusting for now. 

One other thing I like about this is it accounts for the fact that strikeouts are more damaging in certain situations.  With none on, a strikeout is no worse than any other out (-0.2 runs).  Put a runner on third though, and a strikeout is worth -0.48 runs compared to -0.29 for other types of outs.

One issue I have with this data is the data source I’m using (David Pinto’s Day by Day Splits Database) does not differentiate performance for players who’ve played on multiple teams, so I am going to exclude people like Xavier Nady and Richie Sexson in this initial run through, although I can modify my start and end dates to include just their Yankee tenure later on.

So, without further ado, here’s what this measure says for the players who have only seen MLB time with the Yankees this year.

None on: Batting runs above average with no one one base.
1—: Batting runs above average with a runner on first.
-2-: Batting runs above average with a runner on second.
—3: Batting runs above average with a runner on third.
12-: Batting runs above average with runners on first and second.
1-3: Batting runs above average with runners on first and third.
-23: Batting runs above average with runners on second third.
123: Batting runs above average with the bases loaded.
cTotal: contextual linear weights total (all 8 of the above added together)
cnTotal: context-neutral linear weights batting runs above average.
Diff: cTotal - cnTotal

A positive difference between cTotal and cnTotal indicates a player was more productive with runners on base.  Alex Rodriguez has been less productive with runners on base this year, and that’s indisputable. However, he’s still been the most valuable offensive player on the Yankees.  His critics will ignore that and cherry-pick the numbers that “prove” their point, but they’re wrong. 

Bobby Abreu has been more productive this season than his raw numbers show. His defense still stinks though.

Jason Giambi’s hit poorly with runners in scoring position, but he’s done well with runners on first and overall he’s not been as bad with runners on as his RISP numbers make him look.

Johnny Damon, Hideki Matsui, and Derek Jeter haven’t been much different with runners on vs. not.

Jose Molina, Melky Cabrera, and Robinson Cano have been equal opportunity suckers for the most part, although Cano’s actually been about five runs worse than his context-neutral numbers show.

Just looking at this set of Yanks, we can see that they’ve fallen around 19 runs worse than they should have so far this season given their actual YTD performance and how it’s translated to runs in linear weights.  Their context-neutral linear weights runs above average are 44.5, but when you contextualize it they drop to 25 runs above average.

In actuality the Yankees have been only 8 runs better than average, so we still have to account for 17 runs.  If we add in Pudge Rodriguez (-6), Alberto Gonzalez (-7), and Xavier Nady (+5), that’s another -8.  So we still have nine or so missing runs, which could just be double plays and/or baserunning issues, or a limitation in this model.

Bear in mind, this is above and beyond their actual underperformance compared to their pre-season projections, particularly for Cano and Cabrera.

The other thing this may be useful for is MVP balloting, if I had a vote.  Which I don’t.  Here’s a look at the MLB leaders in contextual linear weights.

Remember that this is strictly offense and not position-adjusted.  Albert Pujols is just a monster.  What’s interesting is the gap between him and Lance Berkman narrows if we contextualize it, although Pujols is still having the better season and this ignores Pujols’s significant defensive edge.

Manny and Teixeira’s numbers are spread across leagues, so they’re not realistic MVP candidates.

The name that pops out to me here is Justin Morneau.  His raw stats make him look like a bad MVP candidate, but a lot of intelligent Twins fans insist he’s the Twins’ MVP, and if you look at the shape of his production it makes sense.  He’s a barely average performer with no one base, but get runners on and he’s above average in just about every split.  He’s neck and neck with Kevin Youkilis, who had a slight edge defensively last I checked.

As with other methods I’ve mentioned or read about, I don’t necessarily think this is an ideal way to look at the issue of clutch either, but it may add something to the discussion.

--Posted at 2:05 pm by SG / 64 Comments | - (396)



Page 1 of 1 pages: