[Rd] (PR#13487) Segfault when mistakenly calling [.data.frame

Tony Plate tplate at acm.org
Fri Jan 30 18:38:14 CET 2009


Simon Urbanek wrote:
>
> On Jan 30, 2009, at 10:30 , Christian Brechbühler wrote:
>
>> On Thu, Jan 29, 2009 at 4:44 PM, Prof Brian Ripley 
>> <ripley at stats.ox.ac.uk>wrote:
>>
>>> What did your actual application do?  This seems a very strange 
>>> thing to
>>> do, and the segfault is in trying to construct the traceback.
>>>
>>> Only by using do.call on the object (and not even by name) do I get 
>>> this
>>> error.  E.g.
>>>
>>> `[.data.frame`(1:10, 3)
>>>>
>>> Error in NextMethod("[") : object not specified
>>>
>>>> do.call("[.data.frame", list(1:10, 3))
>>>>
>>> Error in NextMethod("[") : object not specified
>>>
>>> are fine.
>>>
>>> Obviously it would be nice to fix this, but I'd like to understand the
>>> actual circumstances: there is more to it than the subject line.
>>
>>
>> Yes, there is more.  For reporting the problem, we tried to pare it 
>> down to
>> a concise, self-contained test case.
>>
>> My boss was debugging an issue in our R code.  We have our own "[...."
>> functions, because stock R drops names when subscripting.
>
> ... if you tell it to do so, yes. If you tell it to not do that, it 
> won't ... ever tried drop=FALSE ?
The common situation I have (which might be the same as the OP's) is 
wanting to get a vector from a data frame, and having the rownames of 
the dataframe become the names on the vector.

With matrix, the behavior I want is the default behavior, e.g.,
 > x <- cbind(a=c(x=1,y=2,z=3),b=4:6)
 > x
  a b
x 1 4
y 2 5
z 3 6
 > x[,1]
x y z
1 2 3

But with a data frame, subscripting returns a vector with no names:
 > xd <- as.data.frame(x)
 > xd[,1]
[1] 1 2 3

One can use drop=FALSE, but then you've still got a data frame, not a 
vector:
 > (xd1 <- xd[,1,drop=FALSE])
  a
x 1
y 2
z 3

The simplest way I know to get a named vector is to use as.matrix on the 
one-column dataframe:
 > as.matrix(xd1)[,1]
x y z
1 2 3
 >
(Which works fine except in the case where xd1 has only one row...)

And BTW, am I missing something, or does the behavior of drop() not 
conform to the description in ?drop:
 > Value:
 >     If 'x' is an object with a 'dim' attribute (e.g., a matrix or
 >     'array'), then 'drop' returns an object like 'x', but with any
 >     extents of length one removed.  Any accompanying 'dimnames'
 >     attribute is adjusted and returned with 'x': if the result is a
 >     vector the 'names' are taken from the 'dimnames' (if any).  If the
 >     result is a length-one vector, the names are taken from the first
 >     dimension with a dimname.

How is this last sentence consistent with the following behavior?
 > dimnames(x[1,1,drop=F])
[[1]]
[1] "x"

[[2]]
[1] "a"

 > drop(x[1,1,drop=F])
[1] 1
 >
 From the description in "Value:" in ?drop, I would have expected above 
result to have the name "x" (the name from the first dimension with a 
dimname).

 > sessionInfo()
R version 2.8.1 (2008-12-22)
i386-pc-mingw32

locale:
LC_COLLATE=English_United States.1252;LC_CTYPE=English_United 
States.1252;LC_MONETARY=English_United 
States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base    
 >

-- Tony Plate
>
> Cheers,
> S
>
>
>> To bypass our
>> now-suspect functions and get the "real" subscripting method, he used 
>> "get"
>> from package:base.  He was examining a large object, and believing it 
>> was a
>> data frame, chose "[.data.frame".  As it turns out, that object was 
>> not a
>> data frame, and he got an unexpected segfault.  I think it was a matrix.
>> But it doesn't matter -- a vector as in the test case will give the 
>> same.
>>
>> We have since fixed the bug in our replacement subscripting function, 
>> so the
>> issue might not affect us any more.
>>
>> Thanks,
>> /Christian
>>
>>
>>> On Thu, 29 Jan 2009, brechbuehler at gmail.com wrote:
>>>
>>> Full_Name: Christian Brechbuehler
>>>> Version: 2.7.2, 2.8.1
>>>> OS: linux-gnu
>>>>
>>>> If we mistakenly believe the object is a data frame (as we did in a 
>>>> much
>>>> more
>>>> complicated real situation), this happens:
>>>>
>>>>> do.call(get("[.data.frame",pos="package:base"),list(1:10,3))
>>>> Error in NextMethod("[") :
>>>>   no calling generic was found: was a method called directly?
>>>>
>>>>  *** caught segfault ***
>>>> address (nil), cause 'unknown'
>>>>
>>>> Process R:2 segmentation fault (core dumped) at Thu Jan 29 09:26:29 
>>>> 2009
>>>>
>>>> The Error message is appropriate.  But the segmentation fault is
>>>> unexpected.
>>>>
>>>
>>
>>     [[alternative HTML version deleted]]
>>
>> ______________________________________________
>> R-devel at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-devel
>>
>>
>
> ______________________________________________
> R-devel at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel
>



More information about the R-devel mailing list