A Survey of Baseball Player Performance Evaluation Measures

John F. Jarvis

Assigning numerical measures to various aspects of a ball player's performance is a convenient and perhaps useful, if not completely defensible, practice. A numerical measure certainly eases the problem of rank ordering players. Hitting measures such as batting average and it's cousins are the best known known of these measures. The ease and finality of such comparisons guarantees that these measures will always be with us even if what is being ranked is not entirely clear. So, if we are going to do this, we should use the best measures available. My purpose with this paper is to to systematically compare a number of well known and a few less well known measures that have been proposed to summarize a player's offensive capabilities. This is not a new activity. Both the Bennett&Flueck paper and HGB book contain such analysis.

The starting point for any player evaluation measure are the counting statistics collected over some period of time, perhaps a season or a career. Examples of counting statistics include the number of stolen bases, number of hits and number of assists on defense. Any event that be adequately described in baseball can be counted and the resulting number potentially can be used as a component part of the calculation of a player evaluation measure.

Intuitively, if a measure does a good job of rating the offense, it should also work well on defense. A secondary purpose of the paper is to show that offensive evaluation measures are equally applicable to the defense, if the same counting statistics are provided.

There are two types of measures in common use. The first form is the linear weights method which is a linear combination of basic counting statistics using empirically determined coefficients or weights. The second, and more common method is to define a complex and usually nonlinear combination of the basic quantities (various sums, products and ratios) that empirically reproduces individual or team runs.

The data used in the following discussions are the 1286 team season records derived from full season event files for both leagues for the 1956 to 2007 seasons obtained from Retrosheet, Inc. I have written a parser for these data sets and it is used to provide the team summary data used in the following calculations.

A formula where each coefficient appears linearly can have these coefficients determined by the linear regression procedure (Numerical Recipes in C, 2nd Edition, 1992, Chapter 15). As an example, consider the following simple linear weights formula:

RUNS = W0*TotalBases + W1*(Walks+HitByPitch)+ W2*(Ab-hits)+W3*Errors

This particular linear weights formulation will be referred to as BWOE which is suggested by its component parts, Bases, Walks plus hit by pitch, Outs and Errors.

The linear regression procedure minimizes the sum of squared differences between the actual runs scored by a team and the value estimated from the formula, the chi-square value, by determining the 4 coefficients, W0 - W3. For obvious reasons this procedure is also called Least Squares Fitting. The regression coefficients, Wi, effectively give the number of runs each single event contributes to the team runs. The following table shows the results of the regression based on these four terms.


param weight average contrib 0 BASES 0.3761 2141.9 805.53 1 BB+HP 0.3125 513.7 160.53 2 AB-HITS -0.0943 4008.6 -378.09 3 ERRS 0.6222 179.3 111.57 Table 1. Four Parameter Linear Weights Formula, "BWOE"

The square root of Chi Square (the sum of the squares of differences between the data value and calculated value from the regression) divided by the number of data points (1286) is the Standard Deviation and is abbreviated STD. Smaller values of the STD or Chi Square indicate a better representation of the actual runs scored. The standard deviation for this regression is 22.35 runs. Column "average" is the per season value for the parameter averaged over the entire data set. "contrib" is the weight for the parameter times the average number thus represents the number of runs due to the particular parameter. Table 4, which is a summary of the various runs estimators, also includes this result.

To achieve the best results, defined here as the smallest STD (18.8 runs) for the regression, a number of additional counting statistics are used in Table 2. One of the advantages of using the full season events files is the ease of determining non traditional events counting statistics. Three such statistics have been defined and determined. ROH is the count of runners out on a hit, either base runners or the batter trying unsuccessfully for an additional base. RAO is runner advance on an out which includes the traditional sacrifice. Official scoring only credits a sacrifice when there is intent on the part of the hitter. Advances on an out are much more common than official sacrifices suggest. Any base advance increases the potential for scoring thus all such events must be counted. Similarly RSO, Runs Scored on an Out, generalizes the sacrifice fly. ER_BF and ER_RA are the number of times the batter is safe on first and the number of times a runner was able to advance on an error. The OUT category includes outs that are not counted in other categories (CS, ROH, second out on a GDP, K). The meaning of the other counting statistics should be obvious.


param weight average contrib 0 OUTS -0.0997 3030.6 -302.27 1 SNGL 0.4768 982.1 468.29 2 DOUB 0.6309 245.8 155.07 3 TRIP 1.0034 34.9 34.97 4 HRUN 1.4404 140.9 202.98 5 BB+HP 0.3327 513.7 170.93 6 IBB 0.1562 48.7 7.62 7 SB 0.0625 97.7 6.11 8 CS -0.1701 56.6 -9.63 9 ROH -0.3239 26.3 -8.52 10 GDP -0.4404 149.2 -65.71 11 ER_BF 0.5976 73.5 43.91 12 ER_RA 0.3062 105.8 32.41 13 RAO 0.0203 196.8 3.99 14 RSO 0.6656 82.1 54.66 15 K -0.1046 910.2 -95.23 Table 2. The 16 Parameter Linear Weights Formula, "ALL"

Table 2 provides considerable insight into the value of these events. It is a reasonably complete list of offensive events. Stolen bases only make a modest contribution to team runs. Each caught stealing event has about three times the influence as a stolen base. Errors of any kind contribute significantly to the offense. Errors allowing the batter to advance to first are somewhat more beneficial to the offense compared to errors that allow a base runner to advance. Fielding outs and strike outs, K, have approximately the same influence on the outcome of a game. BB includes only unintentional bases on balls as intentional bases on balls are included as a separate term (IBB). The much smaller weight for IBBs compared to BBs suggests that the practice is a useful one for the defense even if the net results is to slighly increase runs scored. The category OUT does not include outs included in other categories: K, CS, RSO and the second out in GDB. The weight of RSO (Run Scored on an Out) is less than one as the negative cost of the out must be accounted for. A linear weights formula can provide great insight into the importance of particular events as well as providing a means for measuring the offense.

Table 3 shows the regression results for a group of commonly available statistics. As might be expected the STD for this lists is between what is shown in Tables 1 and 2.


param weight average contrib 0 SNGL 0.4821 982.1 473.49 1 DOUB 0.7463 245.8 183.42 2 TRIP 1.1479 34.9 40.01 3 HRUN 1.4170 140.9 199.68 4 BB+HP 0.3354 513.7 172.31 5 AB-HITS -0.1143 4008.6 -458.31 6 ERRS 0.4963 179.3 89.00 Table 3. The 7 Parameter Linear Weights Formula, "Basic"

The methodology for this offensive measure survey, following closely the HGB, pp 57-61, and Bennett&Flueck presentations on the same topic, is to determine the constants SLOPE and INTERCEPT in the following linear regression formula:

RUNS = SLOPE*MEASURE + INTERCEPT

The resulting formula is used to evaluate the goodness of the fit by evaluating the Chi-square statistic over the entire team season data set. The direct linear weights measures ("ALL", "Palmer+errors", "BASIC", "BWOE") are also subjected to the linear fitting procedure even though they are, by definition, already linear regression formulas. BASIC is defined in Table 3. This results in slopes near 1 and intercepts near 0. If a constant term had been included in the linear weight formulas, the slope would be exactly 1 and the intercept would be 0. The linear fit also allows the calculation of the correlation coefficient which is useful for the comparisons. The linear fit can be considered a conversion of the measure into runs. The correlation and standard deviation results are intrinsic to the measure. The linear fit makes the relationship explicit.


rank measure slope intrcpt R^2 chisq std 1 All (Table 2) 0.9985 1.05 0.96602 455688 18.8 2 Palmer+errors 0.9976 1.68 0.96147 516666 20.0 3 Basic (Table 3) 0.9999 0.06 0.95761 568461 21.0 4 EqR(Davenport) 0.9788 13.99 0.95550 596740 21.5 5 LWTS(unnormed) 0.9976 2.19 0.95355 622977 22.0 6 XR(Furtado) 0.9781 27.90 0.95258 635888 22.2 7 BWOE (Table 1) 0.9959 2.94 0.95211 642185 22.3 8 OERA+EFBF 0.8765 16.00 0.95081 659659 22.6 9 ERP(Johnson) 0.9745 19.72 0.94869 688072 23.1 10 ERPA(B&F) 0.9585 45.29 0.94772 701159 23.4 11 RC(1998) 0.9327 54.35 0.94724 707554 23.5 12 RC(Tech-1) 0.9311 52.06 0.94704 710296 23.5 13 XRR(Furtado) 0.9746 33.60 0.94698 711105 23.5 14 Base Runs(Smyth) 0.9725 21.95 0.94672 714505 23.6 15 ERP-2(Johnson) 0.9619 33.57 0.94575 727597 23.8 16 LWTS(KenSchmidt) 0.8950 -49.49 0.94437 746060 24.1 17 OERA(Cover) 0.8971 56.07 0.94258 770004 24.5 18 OPA3(Pankin) 0.3961 5.92 0.94249 771226 24.5 19 LWTS(fixed) 0.9681 21.28 0.94023 801547 25.0 20 RC(basic) 0.9502 37.10 0.93906 817302 25.2 21 BOP(Codel) 0.2279 -115.92 0.93512 870070 26.0 22 AB*TA(Boswell) 0.2273 -74.22 0.92593 993326 27.8 23 AB*DX(Cook) 1.0186 35.28 0.92420 1016538 28.1 24 AB*BRA(OBP*SLG) 1.0478 29.13 0.92223 1042928 28.5 25 BWA(Cramer) 0.8558 12.49 0.91747 1106856 29.3 26 BASES+BB+HP 0.3234 -175.09 0.90876 1223655 30.8 27 OBP+(3/4)ISP(Rickey) 0.3975 -228.26 0.89667 1385705 32.8 28 AB*PRO 0.2436 -255.70 0.87792 1637251 35.7 29 BASES 0.3826 -119.90 0.87244 1710640 36.5 30 HITS+BB+HP 0.5054 -293.98 0.79233 2785052 46.5 31 AB*OBP 0.5822 -337.00 0.77103 3070726 48.9 32 ISP 0.6028 254.61 0.74852 3372555 51.2 33 OPS=SLG+OBP 1857.3121 -645.60 0.72660 3666465 53.4 34 TBR(O'Reilly) 2623.4273 -477.72 0.70977 3892244 55.0 35 HITS 0.6737 -246.09 0.69965 4027915 56.0 36 SLG 2512.9088 -294.04 0.68231 4260418 57.6 37 Palmer(normed) 0.0000 -2.31 0.00000 5642908 66.2 38 BA 5660.2216 -767.72 0.52688 6344919 70.2 39 AVERAGE 0.0000 699.60 0.00000 13410768 102.1 Table 4. A Survey of Baseball Offense Evaluation Measures.

It is necessary to distinguish between COUNTING stats, the number of times the particular event being counted occurred and RATE stats, a counting stat that has been divided by the number of chances used to get the particular counted value. The best known rate stat is BA, the batting average, defined as hits/atbats. Since a rate stat does not reflect the number of chances used it will not be a good predictor of counting stats. Since team runs scored is the counting stat being estimated in Table 4 the poor showing of the rate stats (BA, SLG, OPS and TBR) is not unexpected. The counting version of BA is total hits (HITS) which is a somewhat better predictor of team runs. Similarly compare SLG with BASES and TBR with BASES+BB+HP. In Table 4 several other rate stats have been multiplied by AB indicated by AB* in the measure column.

The simple AVERAGE of all the runs scored is given as this establishes a maximum for both the Chi-square and Standard Deviation thus provides a reference for evaluating the other measures. The column R^2 is the square of the linear correlation coefficient between the measure and observed runs. The conventional interpretation is that the correlation coefficient squared is the fraction of variance (Chi-square) in runs explained by the measure.

The first seven entries in Table 4 are linear weights formulas. ALL and BASIC are defined in the discussions of Tables 2 and 3. "Palmer+errors" is a linear regression that uses the same terms as the Palmer linear weights formula and an additional errors term. "BWOE", is a four parameter linear weights formula that summarizes all hitting with total bases instead of the four individual hit type counts (Table 1). Total bases in combination with at bats, errors and walks plus hit by pitch is a simple and effective measure for estimating team runs from basic hitting statistics. When used as an offensive runs measure for a player, the error term would be dropped. The component of these linear weights formulas leading to their excellent performance is the inclusion of a negative weight for outs made. The negative value of an out can be explicitly seen in Tables 1 - 3.

The four linear weights formulas that had their weights explicitly determined by regressions clearly provide the best representation of runs scored from the available team season data. Not surprisingly, including a larger selection of counting statistics results in a more accurate formula.

A Linear Weights formula determined from team season data should include an team errors term. This is evident as error contributions to runs scored are estimated to be around 10% of total runs allowed depending on the details of the particular formula being used. The effect of errors is larger than several other terms often used. The use of an explicit error term preserves the accuracy of the other terms in the regression.

OERA is the Cover and Keilers Offensive Earned Run Average. Their method determines the expected future runs for all 24 combinations of outs and base runners in an inning. OERA is 9 times the future runs expectation for no outs and no one on base. A separate document Implementing the Cover-Keilers Offensive Earned Run Average provides additional details of this measure including source code implementing its calculation. Expected Future Runs have been determined from the data used in this study and are compared to the OERA predictions. Since OERA was explicitly designed to be a player evaluator it did not include the effect of defensive team errors. Errors allowed a batter to reach first have a very similar effect as singles. Adding the number of such errors to the number of singles then doing the calculations is represented by the entry OERA+ER_BF. This combination provides a larger overestimate of team runs than the basic OERA but also correlates better with them.

Pete Palmer has introduced a method for evaluating the offensive contribution of a ball player known as linear weights. Three variations of this method are in Table 4, each starting LWTS. A key feature of Palmer's linear weights system is normalization. An average player or team will have a rating of 0 runs. This is accomplished by adjusting the weight of the AB-Hits term so that a league will have 0 runs for the season. A separate weight is required for each league and each season. This calculation is the (normed) variant in Table 4. The (unnormed) variant adjusts the AB-Hits weight for each league season to obtain agreement with total runs for the season. The (fixed) variant uses a single AB-Hits weight obtained by averaging the individual AB-Hits weights from the (unnormed) calculation. The (unnormed) calculation ranks among the better ones and (fixed) is only slightly inferior. The real problem is the very poor standard deviation shown by the (normed) calculation. The likely cause of this problem is the normalization step. The small standard deviations shown by linear regressions result from a very careful balance between each of the weighted terms. The normalization process requires a large adjustment of the AB-Hits weight which appears to corrupt the balancing need to produce a small standard deviation. In fairness to Palmer, he does not claim that his linear weights system is a simple linear regression. Still, the comparisons given here suggest that a different route to normalization is needed if an accurate assessment of player or team contributions is desired.

The other measures in the survey include Paul Johnson's ERP, Estimated Runs Produced, BJBA85 pp 277-281 which is also a linear formula. The Bill James Runs Created formulas Tech-1, RC(tech1), and basic, RC(basic), are defined in BJHBA(1987) pp 279 and 281. A newer version of Runs Created from the Bill James ML Handbook for 1998 is given as RC98 and is slightly more accurate than the older versions. BOP is Base Out Percentage by Barry Codell and is defined in BRJ79 pp 35-39. BWA is Richard Cramer's Batter Win Average, BRJ77 pp 74-79. TA is Tom Boswell's Total Average and is defined in TB5. Earnshaw Cook introduced gave us "Cooks Scoring index" abbreviated DX, HGB p 45. BRA is Dick Cramer's Batter Run Average defined as the product of On Base Percentage and Slugging Average. PRO is production which is defined as the sum of On Base Percentage, OBP, and Slugging Average. ISP is isolated power and is the difference between slugging average and batting average. Total Bases, total hits and OBP are also shown as hitting evaluation measures. Curiously, batting average and slugging average are not as effective a predictor of runs as are total hits and total bases. ERPA is from Bennet&Flueck. EqR is Clay Davenport's Equivalent Runs and is documented in the Baseball Prospectus. Jim Furtado's extended runs formulas are XR and XRR. OPA3 is from Mark Pankin's article in Operations Research. TBR = (BASES+BB+HP)/(AB+BB+HP+SF). While I have tried to find and cite the original references on the measures in Table 4,the best current source for such information is the Glossary of Statistical Terms in TB7.

Since the original posting of this paper, Ken Schmidt and David Smyth have sent me their evaluators which are identified by their names.

Specifically excluded from this analysis are any formulas that include runs batted in (RBI) or runs. Since runs are to be explained or predicted by the measures, using runs or RBIs in a measure is essentially circular reasoning (violates the need for linear independence of the quantities used in the regression).

Evaluating the offensive component of a baseball player appears to be straight forward. The Tech-1 version of Runs Created is an excellent measure and deserves the increasing usage it is receiving. However, a well crafted linear weights formula can provide the most accurate measure relating team counting statistics to team runs scored. A linear weights measure also allows evaluation of the same formula using individual player statistics without increasing uncertainty by more than the effects caused by the size of the statistical sample.

Application of the Offensive Evaluation Measures to the Defense

At present, the official statistical record for the defense is not kept to the same level of detail as for the offense. However, any quantity that can be defined and counted for offensive evaluation purposes can be defined for the defense. The very great advantage of working with full season events files is the ability to define and determine non traditional statistics. All the team related counting statistics used in the preparation of Tables 1 - 4 also have been determined for the defense. The same measure survey, using exactly the same methodology as described for the offense, was done on the defensive statistics with the results shown in Table 5.


rank measure slope intrcpt r^2 chisq std 1 All 0.9972 1.97 0.96597 470719 19.1 2 Palmer+Errors 0.9972 1.97 0.96191 526864 20.2 3 Basic 1.0000 -0.02 0.95666 599537 21.6 4 BWOE 0.9970 2.13 0.95337 645080 22.4 5 EqR(Davenport) 0.9826 11.23 0.95113 676023 22.9 6 XR(Furtado) 0.9879 21.12 0.94932 701070 23.3 7 LWTS(unnormed) 1.0174 -11.69 0.94923 702327 23.4 8 OERA+EFBF 0.8629 26.18 0.94827 715621 23.6 9 RC(1998) 0.9359 52.07 0.94491 762114 24.3 10 RC(Tech-1) 0.9345 49.64 0.94442 768930 24.5 11 ERP(Johnson) 0.9783 17.08 0.94385 776816 24.6 12 XRR(Furtado) 0.9811 29.18 0.94328 784671 24.7 13 Base Runs(Smyth) 0.9829 14.43 0.94245 796171 24.9 14 ERPA(B&F) 0.9664 39.92 0.94235 797480 24.9 15 ERP-2(Johnson) 0.9715 26.93 0.94229 798369 24.9 16 LWTS(KenSchmidt) 0.8966 -50.83 0.93873 847646 25.7 17 OERA(Cover) 0.8931 58.60 0.93822 854614 25.8 18 OPA3(Pankin) 0.3987 1.40 0.93782 860178 25.9 19 LWTS(fixed) 0.9857 8.92 0.93533 894675 26.4 20 RC(basic) 0.9478 38.56 0.93215 938637 27.0 21 BOP(Codel) 0.2304 -124.69 0.92865 987008 27.7 22 AB*TA(Boswell) 0.2327 -92.79 0.92641 1017966 28.1 23 BWA(Cramer) 0.8451 20.98 0.92293 1066096 28.8 24 AB*DX(Cook) 1.0193 34.68 0.91649 1155227 30.0 25 AB*BRA(OBP*SLG) 1.0452 30.67 0.91475 1179347 30.3 26 BASES+BB+HP 0.3297 -192.12 0.90512 1312547 31.9 27 OBP+(3/4)ISP(Rickey) 0.4034 -241.84 0.88987 1523463 34.4 28 AB*PRO 0.2453 -262.58 0.87156 1776827 37.2 29 BASES 0.3902 -136.19 0.85901 1950358 38.9 30 HITS+BB+HP 0.5068 -296.84 0.81357 2579005 44.8 31 AB*OBP 0.5870 -345.47 0.79327 2859775 47.2 32 ISP 0.6587 213.28 0.75078 3447608 51.8 33 HITS 0.6882 -266.38 0.73064 3726146 53.8 34 OPS=SLG+OBP 1849.0146 -639.59 0.72470 3808380 54.4 35 TBR(O'Reilly) 2699.3363 -511.79 0.71335 3965343 55.5 36 SLG 2579.9550 -320.60 0.67758 4460200 58.9 37 Palmer(normed) 0.0000 -2.31 0.00000 5868301 67.6 38 BA 5671.1721 -770.57 0.57312 5905231 67.8 39 AVERAGE 0.0000 699.60 0.00000 13833490 103.7 Table 5. Standard Evaluation Measures Applied to the Defense

All the discussions following Table 4 apply to Table 5. The standard deviations achieved and the relative rankings of the different measures are essentially the same using either offensive or defensive statistics.

There are two additional counting statistics available for defensive play analysis: put outs and assists. Put outs are redundant in the regressions since each out must be included in only one linear weights term. Assists are new information thus potentially could be expressed as a runs value by the regression. Table 6 shows the results obtained by adding assists to the set of parameters defining the linear regression, ALL.


param weight average contrib 0 OUTS -0.1024 3030.6 -310.46 1 SNGL 0.4785 982.1 469.95 2 DOUB 0.6752 245.8 165.97 3 TRIP 1.0912 34.9 38.03 4 HRUN 1.3969 140.9 196.84 5 BB+HP 0.3459 513.7 177.70 6 IBB 0.1113 48.7 5.43 7 CS -0.3025 56.6 -17.13 8 ROH -0.6138 26.3 -16.15 9 GDP -0.4422 149.2 -65.98 10 SB 0.1038 97.7 10.14 11 ER_BF 0.6289 73.5 46.22 12 ER_RA 0.3100 105.9 32.82 13 RAO -0.0701 196.8 -13.80 14 RSO 0.5734 82.1 47.08 15 K -0.1050 910.2 -95.55 16 ASSISTS 0.0167 1704.8 28.45 Table 6. A Linear Weights Determination for Defensive Statistics

Compared to the entry for ALL in Table 5, the Standard Deviation has been reduced, but only slightly (19.09 compared to 19.13). The total runs value of assists is +28.4 in a defensive linear weights measure they would be expected to lead to reduced runs allowed. Consequently, assists do not appear to be significant at this level of analysis and demonstrably can not be associated with a runs value in a linear weight formula.

It is possible to combine both the offensive and defense data sets into a single one and perform the same survey calculation. The relative rankings and standard deviations are essentially the same providing another demonstration that all events have the same meaning to both the offense and defense.

Conclusions

Linear weights is best considered a technique for generating an evaluation measure. Different combinations of parameters and differing sets of data will lead to different values for individual parameters in these formulas. When a judicious set of parameters is used, linear weights formulas appear to provide the most accurate evaluation measures available. In addition, a linear formula can be developed using one set of data, team season results for example, and be applied meaningfully to other tasks such as evaluating individual players.

The four term formula, BWOE, is a particularly good combination of accuracy and simplicity. In spite of its simplicity it is more accurate than any of the existing measures for predicting team runs for the data available to this study. When used for player evaluation only three coefficients and three readily available statistics are needed.

Errors must be accounted for when determining the coefficients in any, but especially a linear weights, evaluation measure. When the measure is applied to an individual player the error term is simply not used.

The relative ranking of offensive runs estimation measures from the basic counting statistics is essentially the same as given by Thorn and Palmer (HGB pp 58-59). The only significant difference between HGB and this paper is their linear weights result. In this investigation their formula yields standard deviations of about 24 runs when applied to single or groups of seasons having the same number of games. Offensive player evaluation is in good shape with several strong techniques vying for general acceptance.

When the same counting statistics are available for the defense as for the offense, the hitting measures do an equally good job of estimating runs allowed. All the methods used to rate offense are applicable to defensive evaluations. Assists do not improve the accuracy of linear weights formulas when they are applied to the defense.

References

"An Evaluation of Major League Baseball Offensive Performance Models", Jay M. Bennett & John A. Flueck, American Statistician Vol 37 No, 1 (Feb 1983), pp 76-82
Baseball Prospectus (Brasseys, 1998)
BJBAyy is the Bill James Baseball Abstract for year yy. (Balantine Books)
BJHBA is the Bill James Historical Baseball Abstract. (Villard Books, 1986)
BRJyy is the SABR Baseball Research Journal for year yy.
"Evaluating Offensive Performance in Baseball", Mark D. Pankin, Operations Research 26, #4, Jul-Aug 1978, pp 610-619
HGB is the Hidden Game of Baseball by John Thorn and Pete Palmer. (Doubleday, 1984)
"An Offensive Earned-Run Average for Baseball", Thomas M. Cover & Carroll W. Keilers, Operations Research, Vol 25, No 5, Sep. 1977
TB7 is Total Baseball, Seventh Edition. John Thorn, Pete Palmer and Michael, (Total Sports Publishing, 2001)

Revision History

June 13, 1998: Original Posting to the Web

August 6, 1998: 1980 season statistics added to the analysis. A completely new treatment of the Palmer Linear Weights system was used. A paper copy of this version of the document is on file with the SABR Archives.

January 3, 2000: Data from 1980 -1998 seasons was used in the analysis. A few additional evaluators were included and several references were added to the bibliography.

June 2, 2000. Data from the 1979 - 1999 seasons has been used in the analysis.

June 6, 2001. Data from the 1978 - 2000 seasons has been used in the analysis. The linear weights formula BASIC was added to the analysis.

July 4, 2002. Data from the 1969 and 1974-2001 seasons has been used in the analysis.

September 29, 2003. Data from 1963, 1967 and 1968 AL and both leagues in 1969, 1972-2002 has been used in the analysis.

August 7, 2006. All RETROSHEET available seasons from 1957 to 2005 are included in the analysis.

January 28, 2008. All RETROSHEET available seasons from 1956 to 2007 are included in the analysis.

April 4, 2008. An error is the counting of CS in OUTS was corrected. Only the OUTS and CS lines of Tables 4 and 6 are changed. No changes are made to the conclusions.

Back to the J. F. Jarvis Baseball page.


Copyright © 1998-2008, John F. Jarvis