Journal of the American Statistical Association
In this article, a nonparametric correlation coefficient is defined that is based on the principle of maximum deviations. This new correlation coefficient, RgB is easy to compute by hand for small to medium sample sizes. In comparing it with existing correlation coefficients, it was found to be superior in a sampling situation that we call "biased outliers," and hence appears to be more resistant to outliers than the Pearson, Spear- man, and Kendall correlation coefficients. In a correlational study not included in this article of some social data consisting of five variables for each of 51 observations, Rg was compared with the other three correlation coefficients. There was agreement on 8 of the 10 possible correlations, but in one case, Rg was significant when the others were not, and in yet another case, Rg was not significant when the others were. A further analysis of this data set indicated that there were three to six data points that were anomalies and had a severe effect on the other correlations but not Rg. Apparently, the statistic Rg measures association in a unique fashion. This different measure of association for real data is extended to a population interpretation and expressed in terms of the copula function.
In consideration of ties, this article suggests a randomization method and a computation of the minimum and maximum possible correlation values when ties are present. These ideas are illustrated with an example.
Critical values of Rg and enough examples are included so that this new statistic can be applied to data. The success that we have had with the use of Rg in hypothesis testing suggests that Rg may have important applications wherever robustness is desired.
© 1987 American Statistical Association