[R] delete duplicated from data.frame

Uwe Ligges ligges at statistik.uni-dortmund.de
Tue May 18 16:39:59 CEST 2004


Christian Schulz wrote:

> Hi,
> 
> ?unique
> unique returns a vector, data frame or array like x * but with duplicate 
> elements removed *
> 
> what i'm doing wrong delete duplicated rows with same MEMEBRNO.  
> 
> februar <-  dmsegment[unique(dmsegment$MEMBERNO),]

unique() returns values, but not indices nor logicl values.
Try instead:

februar <-  dmsegment[!duplicated(dmsegment$MEMBERNO),]

Uwe Ligges

> This reduce from 197.188 rows to 184.199  but duplicated MEMBERNO  didn't left 
> all what a Primary Key setting in mysql me say and with a fix(februar) could 
> recognize .
> 
> Courious why MEMEBRNO 4,5 ,6 and 11  are left !
> dmsegment$MEMBERNO[1:10]
> [1] 1  4  5  6  7  9  10  11  16  21
> 
> februar$MEMBERNO[1:10]
> [1] 1  6  7  9  10  16  21  26  53  72 
> 
> Using unique with a single vector it works like i expect.
> 
> 
> P.S.
> i try -duplcated but get not better succes?
> 
> Many Thanks,
> Christian
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html




More information about the R-help mailing list