[R] unique/subset problem
lalithaviswanath at yahoo.com
Fri Jan 26 16:20:44 CET 2007
The pruned dataset has 8 unique genomes in it while
the dataset before pruning has 65 unique genomes in
However calling unique on the pruned dataset seems to
return 65 no matter what.
Any assistance in this matter would be appreciated.
--- Weiwei Shi <helprhelp at gmail.com> wrote:
> Even you removed "many" genomes1 by setting score<
> -5; it is not
> necessary saying you changed the uniqueness.
> To check this, you can do like
> p0 <- unique(dataset[dataset$score< -5, "genome1"])
> # same as subset
> p1 <- unique(dataset[dataset$score>= -5, "genome1"])
> setdiff(p1, p0)
> if the output above has NULL, then it means even
> though you remove
> many genomes1, but it does not help changing the
> On 1/25/07, lalitha viswanath
> <lalithaviswanath at yahoo.com> wrote:
> > Hi
> > I am new to R programming and am using subset to
> > extract part of a data as follows
> > names(dataset) =
> > c("genome1","genome2","dist","score");
> > prunedrelatives <- subset(dataset, score < -5);
> > However when I use unique to find the number of
> > genomes now present in prunedrelatives I get
> > identical to calling unique(dataset$genome1)
> > subset has eliminated many genomes and records.
> > I would greatly appreciate your input about using
> > "unique" correctly in this regard.
> > Thanks
> > Lalitha
> > TV dinner still cooling?
> > Check out "Tonight's Picks" on Yahoo! TV.
> > ______________________________________________
> > R-help at stat.math.ethz.ch mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide
> > and provide commented, minimal, self-contained,
> reproducible code.
> Weiwei Shi, Ph.D
> Research Scientist
> GeneGO, Inc.
> "Did you always know?"
> "No, I did not. But I believed..."
> ---Matrix III
Bored stiff? Loosen up...
More information about the R-help