[R] Repeated factor levels - inconsistency of factor and levels<- functions?

Peter Dalgaard P.Dalgaard at biostat.ku.dk
Thu Sep 25 15:59:22 CEST 2008


Honza Hucin wrote:
> Hello,
>
> I have a vector x containing letters ("a", "b" etc.). Now I want to
> convert it to factor and group some letters into one common level. If I do
> it by factor function, giving the same label names for all values I want
> to group, it doesn't work:
>
>   
>> x<-letters[1:5]
>> x
>>     
> [1] "a" "b" "c" "d" "e"
>   
>> f<-factor(x,levels=letters[1:5],
>>     
>          labels=c("vowel","consonant","consonant","consonant","vowel"))
>   
>> levels(f)
>>     
> [1] "vowel"     "consonant" "consonant" "consonant" "vowel"
>
> But, after it, if I update level names by a single assignment, levels with
> the same names will group, even when I don't change all of them:
>
>   
>> levels(f)[1]<-"vowel" #changing only one vector item will make ALL
>>     
> levels to group
>   
>> levels(f)
>>     
> [1] "vowel"     "consonant"
>
> I'm rather confused! I think this behavior is double inconsistent. First,
> the labeling in factor function should work similarly as in levels<- ,
> i.e. they should group levels with the same names either BOTH or NONE.
> Second, if I change only one vector item, it should not change anything
> else, especially it should not make any "invisible" grouping.
>
> Or am I wrong? Or is it a bug?
>
>   
I asked Brian Ripley the same thing half a year ago and his answer was:
"Back compatibility ...."

I'm at a loss trying to figure out what kind of code would depend on
current behaviour, but the workaround is rather obvious, so the
motivation for fixing (changing!) it is not too great.


-- 
   O__  ---- Peter Dalgaard             Øster Farimagsgade 5, Entr.B
  c/ /'_ --- Dept. of Biostatistics     PO Box 2099, 1014 Cph. K
 (*) \(*) -- University of Copenhagen   Denmark      Ph:  (+45) 35327918
~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk)              FAX: (+45) 35327907



More information about the R-help mailing list