[Rd] meaning of "trim" in mean()

Prof Brian Ripley ripley at stats.ox.ac.uk
Sun Oct 28 07:54:58 CET 2007


There is only one _range_ mentioned, (0, 0.5).  I don't see how you can 
construe 'that range' to be a reference to anything other than (0, 0.5).

And why do you suppose the description for argument 'trim' is referring to 
'values' of a different argument?

It is telling you what happens for values of trim < 0 or > 0.5: that is 
not information that it is appropriate to excise.


On Thu, 25 Oct 2007, Peter Dalgaard wrote:

> Liaw, Andy wrote:
>> (I see this in both R-patched r43124 and R-devel r43233.)
>> In the Argument section of ?mean:
>>
>> trim     the fraction (0 to 0.5) of observations to be trimmed from each
>> end of x before the mean is computed. Values outside that range are
>> taken as the nearest endpoint.
>>
>> Then in the Value section:
>>
>> If trim is non-zero, a symmetrically trimmed mean is computed with a
>> fraction of trim observations deleted from each end before the mean is
>> computed.
>>
>> The description in "trim" to me sounds like Windsorizing, rather than
>> trimming.  Should that be edited?
>>
>>
> I think so:
>
>> x <- sort(rnorm(10))
>> mean(x,trim=.1)
> [1] -0.6387413
>> mean(x[2:9])
> [1] -0.6387413
>> mean(x[c(2,2:9,9)]) # Winsorizing
> [1] -0.6204222
>
> So yes, it is trimming, not Winsorizing, and the last sentence in the
> description of "trim" is misleading and should be, well..., trimmed.
>
>

-- 
Brian D. Ripley,                  ripley at stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595



More information about the R-devel mailing list