[R] tapply error svyby function "survey" package

Martin Canon martin.canon at gmail.com
Wed Nov 12 13:59:17 CET 2014


Hi.


I'm trying to calculate the weighted mean score of a quality of life
measure (ovt) in patients with irritable bowel syndrome by their
marital status (d7).

This is a summary of the structure of the dataset:

> str(sii.tesis)
'data.frame':    1063 obs. of  75 variables:
 $ id         : int  51 52 53 54 55 56 57 58 59 60 ...
 $ stratum    : Factor w/ 6 levels "MEst","MAcad",..: 1 4 NA 4 4 1 6 NA 4 4 ...
 $ expfc      : num  22.8 17.1 NA 17.1 17.1 ...
 $ d6         : Factor w/ 3 levels "Estudiante","Profesor",..: 1 1 NA
1 1 1 3 NA 1 1 ...
 $ d7         : Factor w/ 6 levels "Soltero","Casado",..: 1 1 NA 1 1 1
1 NA 1 1 ...
 $ d7c        : Factor w/ 2 levels "No estable","Estable": 1 1 NA 1 1
1 1 NA 1 1 ...
 $ s1cm       : Factor w/ 2 levels "No","Si": 1 2 NA 1 1 1 2 NA 1 1 ...
 $ ovt        : num  NA 93.4 NA NA NA ...

I declared the sampling design:

> sii.design <- svydesign(
  id = ~1,
  strata = ~stratum,
  weights = ~expfc,
  data = subset(sii.tesis, !is.na(stratum)))

Then I tried to get the result:

> svyby(~ovt, ~d7, sii.design, svymean, na.rm = TRUE, level = 0.95)

but i get the error:

Error in tapply(1:NROW(x), list(factor(strata)), function(index) { :
  arguments must have same length


The length of both variables is the same. If the variable ovt exists,
there is a d7 match in the data frame.

I try the same thing using another variable instead - "role" (d6) -
and it works.

> svyby(~ovt, ~d6, sii.design, svymean, na.rm = TRUE, level = 0.95)
                           d6      ovt       se
Estudiante         Estudiante 71.01805 1.370569
Profesor             Profesor 72.30923 6.518378
Administrativo Administrativo 75.69102 3.715050

If I use the recategorized d7 variable (d7c,  two levels only) it works too:

> svyby(~ovt, ~d7c, sii.design, svymean, na.rm = TRUE, level = 0.95)
                  d7c      ovt      se
No estable No estable 70.92344 1.37460
Estable       Estable 74.53719 4.16954


What could be the problem?


Regards.


Martin Canon
Colombia, South America



More information about the R-help mailing list