[Rd] meaning of "trim" in mean()
Peter Dalgaard
p.dalgaard at biostat.ku.dk
Sun Oct 28 14:42:49 CET 2007
Patrick Burns wrote:
> If the sentence in question were amended to:
>
> Values of trim outside that range ...
>
> then I think it would rule out the misinterpretation of
> the sentence.
Yup. And I maintain that although the wording may not be technically
ambiguous, the whole operation is about restricting the range of x, and
therefore is easy to interpolate "the range of x after trimming from
each end".
>
> Pat
>
>
> Prof Brian Ripley wrote:
>
>> There is only one _range_ mentioned, (0, 0.5). I don't see how you
>> can construe 'that range' to be a reference to anything other than
>> (0, 0.5).
>>
>> And why do you suppose the description for argument 'trim' is
>> referring to 'values' of a different argument?
>>
>> It is telling you what happens for values of trim < 0 or > 0.5: that
>> is not information that it is appropriate to excise.
>>
>>
>> On Thu, 25 Oct 2007, Peter Dalgaard wrote:
>>
>>
>>
>>> Liaw, Andy wrote:
>>>
>>>> (I see this in both R-patched r43124 and R-devel r43233.)
>>>> In the Argument section of ?mean:
>>>>
>>>> trim the fraction (0 to 0.5) of observations to be trimmed from
>>>> each
>>>> end of x before the mean is computed. Values outside that range are
>>>> taken as the nearest endpoint.
>>>>
>>>> Then in the Value section:
>>>>
>>>> If trim is non-zero, a symmetrically trimmed mean is computed with a
>>>> fraction of trim observations deleted from each end before the mean is
>>>> computed.
>>>>
>>>> The description in "trim" to me sounds like Windsorizing, rather than
>>>> trimming. Should that be edited?
>>>>
>>>>
>>>>
>>> I think so:
>>>
>>>
>>>> x <- sort(rnorm(10))
>>>> mean(x,trim=.1)
>>>>
>>> [1] -0.6387413
>>>
>>>> mean(x[2:9])
>>>>
>>> [1] -0.6387413
>>>
>>>> mean(x[c(2,2:9,9)]) # Winsorizing
>>>>
>>> [1] -0.6204222
>>>
>>> So yes, it is trimming, not Winsorizing, and the last sentence in the
>>> description of "trim" is misleading and should be, well..., trimmed.
>>>
>>>
>>>
>>
>>
>>
--
O__ ---- Peter Dalgaard Øster Farimagsgade 5, Entr.B
c/ /'_ --- Dept. of Biostatistics PO Box 2099, 1014 Cph. K
(*) \(*) -- University of Copenhagen Denmark Ph: (+45) 35327918
~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk) FAX: (+45) 35327907
More information about the R-devel
mailing list