[R] Calculation of Age heaping

Richard O'Keefe r@oknz @end|ng |rom gm@||@com
Tue Aug 10 09:33:02 CEST 2021


If you want to look at each digit, you should take a step back and
think about what the
Whipple index is actually doing.  Basically, the model underlying the
Whipple index is
that Pr(age = xy) = Pr(age = x*)Pr(age = *y) if there is no age
heaping.  Or rather,
since the age is restricted to 23..62 (a whole number of decades), it is that
Pr(age - 23 = xy) = Pr(age - 23  = x*)Pr(age - 23 = *y) for 0 <= x <=
3, 0 <= y <= 9
and the "nothing to see here" case is Pr(age = *y) = 1/10.

I wasted way too much time trying to find a free age data set where
age *wasn't* already
grouped into 5 year bands.

So what's wrong with a chi-square test?
I would certainly want to check whether the high and low digits of age
- 23 were in fact independent.

On Mon, 9 Aug 2021 at 23:48, Md. Moyazzem Hossain <hossainmm using juniv.edu> wrote:
>
> Dear Greg,
>
> Thank you very much for your suggestion. I will try it and follow your
> advice.
>
> Actually, I want to find out the index for each digit like 0, 1, ..., 9.
>
> Thanks in advance. Take care.
>
> Md
>
>
>
> On Mon, Aug 9, 2021 at 12:05 PM Greg Minshall <minshall using umich.edu> wrote:
>
> > Md,
> >
> > if this is what you are looking for:
> > ----
> > https://en.wikipedia.org/wiki/Whipple%27s_index
> > ----
> >
> > then, the article says the algorithm is
> > ----
> > The index score is obtained by summing the number of persons in the age
> > range 23 and 62 inclusive, who report ages ending in 0 and 5, dividing
> > that sum by the total population between ages 23 and 62 years inclusive,
> > and multiplying the result by 5. Restated as a percentage, index scores
> > range between 100 (no preference for ages ending in 0 and 5) and 500
> > (all people reporting ages ending in 0 and 5).
> > ----
> >
> > that seems fairly straight forward.  if you are trying to learn R,
> > and/or learn programming, i might suggest you *not* use a package, and
> > rather work on coding up the calculation yourself.  that would probably
> > be a good, but not too hard, exercise, of some interest.  enjoy!
> >
> > cheers, Greg
> >
> >
>
> --
> Best Regards,
> Md. Moyazzem Hossain
> Associate Professor
> Department of Statistics
> Jahangirnagar University
> Savar, Dhaka-1342
> Bangladesh
> Website: http://www.juniv.edu/teachers/hossainmm
> Research: *Google Scholar
> <https://scholar.google.com/citations?user=-U03XCgAAAAJ&hl=en&oi=ao>*;
> *ResearchGate
> <https://www.researchgate.net/profile/Md_Hossain107>*; *ORCID iD
> <https://orcid.org/0000-0003-3593-6936>*
>
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list