[R] Unique.data.frame...still getting duplicates

F Z gerifalte28 at hotmail.com
Fri Jun 25 04:12:28 CEST 2004


Hi there

I have a data frame with about 65,000 rows and 8 variables.  I am trying to 
get rid of the double entries of a factor variable "ID" so I can get a 
unique observation for each ID

I tried:

>dupl_unique.data.frame(data[ID,]) #I obtain a data frame with 21,547 
>observations..so far so good, but then when I check for duplicates

>d_duplicated(dupl2$ID)
>summary(as.factor(d))
FALSE  TRUE
  6836 14711

Meaning that I am still getting 14,711 duplicates!

I tried changing the ID type to integer and repeated the process but I got 
dentical results....what am I missing?

Thanks!




More information about the R-help mailing list