[Rd] corruption of data with serialize(ascii=TRUE)

Roger D. Peng rpeng at jhsph.edu
Thu Feb 9 14:10:55 CET 2006


Okay, I just wasn't sure of the source of the changes.  In retrospect, character 
and other vectors did serialize/unserialize to the original objects.

-roger

Prof Brian Ripley wrote:
> It is known (happens with save() too and did in earlier save formats). 
> Nothing particularly clever is done (the format is "%.16g\n") and 
> similarly as.character/parse are not inverses.
> 
> Perhaps more relevant is
> 
>> b/x -1
>  [1]  0.000000e+00 -1.110223e-16  2.220446e-16  0.000000e+00  0.000000e+00
>  [6]  2.220446e-16  4.440892e-16  0.000000e+00  2.220446e-16  0.000000e+00
> 
> so the error (on my system) is about what you would expect from 
> floating-point computations.
> 
> There is a comment in serialize.c
> 
>         /* 16: full precision; 17 gives 999, 000 &c */
> 
> which suggests that the format is optimized for size not maximal 
> possible accuracy.
> 
> Really all you have said is `floating point operations are subject to 
> rounding error'.
> 
> 
> On Wed, 8 Feb 2006, Roger D. Peng wrote:
> 
>> I noticed the following peculiarity with `serialize()' when `ascii = 
>> TRUE' is
>> used.  In today's (svn r37299) R-devel, I get
>>
>> > set.seed(10)
>> > x <- rnorm(10)
>> >
>> > a <- serialize(x, con = NULL, ascii = TRUE)
>> > b <- unserialize(a)
>> >
>> > identical(x, b)  ## FALSE
>> [1] FALSE
>> > x - b
>>  [1] -3.469447e-18  2.775558e-17 -4.440892e-16  0.000000e+00  
>> 5.551115e-17
>>  [6] -5.551115e-17 -4.440892e-16  0.000000e+00  2.220446e-16 
>> -5.551115e-17
>>
>>
>> I expected `x' and `b' to be identical, which is what I get when 
>> `ascii = FALSE':
>>
>> > a <- serialize(x, con = NULL, ascii = FALSE)
>> > b <- unserialize(a)
>> >
>> > identical(x, b)  ## TRUE
>> [1] TRUE
>>
>>
>> The same phenomenon occurs with `.saveRDS(ascii = TRUE)',
>>
>> > .saveRDS(x, file = "asdf", ascii = TRUE)
>> > d <- .readRDS("asdf")
>> >
>> > identical(x, d)  ## FALSE
>> [1] FALSE
>> >
>>
>> Has anyone noticed this before?  I didn't see anything in the docs for
>> `serialize()' that would indicate this behavior should be expected.
>>
>> I'm on Linux Fedora Core 4.
>>
>> -roger
>>
> 

-- 
Roger D. Peng  |  http://www.biostat.jhsph.edu/~rpeng/



More information about the R-devel mailing list