[R] 1-Pearson's R Distance

William Revelle lists at revelle.net
Thu Nov 27 19:13:21 CET 2008


At 5:00 PM +0100 11/27/08, Claudia Beleites wrote:
>Hi Rodrigo,
>
>afaik, (1 - r_Pearson)/2 is used rather than 1 - r_Pearson. This gives a
>distance measure ranging between 0 and 1 rather than 0 and 2. But after all,
>dies does not change anything substantial.
>see e.g. Theodoridis & Koutroumbas: Pattern Recognition.
>
>I didn't know of the proxy package, but the calculation it straightforward
>(though a bit wasteful I suspect: first the whole matrix is produced, and
>as.dist cuts it down again to a triangular matrix):
>
>as.dist (0.5 - cor (t(x) / 2))
>
>Take care wheter you want to use x or t(x).
>
>HTH Claudia
>

 From the law of cosines, d = sqrt(2(1-r)) is a somewhat more 
appropriate transformation of a Pearson correlation to a distance.

Although this is monotonically related to the (1-r)/2, by taking the 
square root  it will lead to somewhat different solutions in 
clustering.

Bill

-- 
William Revelle		http://personality-project.org/revelle.html
Professor			http://personality-project.org/personality.html
Department of Psychology             http://www.wcas.northwestern.edu/psych/
Northwestern University	http://www.northwestern.edu/
Attend  ISSID/ARP:2009               http://issid.org/issid.2009/



More information about the R-help mailing list