[Rd] package installation fails when symlink of same name exists

Martin Maechler maechler at stat.math.ethz.ch
Thu Jul 21 18:03:34 CEST 2016


>>>>> Kevin Ushey <kevinushey at gmail.com>
>>>>>     on Wed, 20 Jul 2016 10:19:33 -0700 writes:

    > On Wed, Jul 20, 2016 at 3:52 AM, Martin Maechler
    > <maechler at stat.math.ethz.ch> wrote:
    >>>>>>> Jeroen Ooms <jeroenooms at gmail.com>
    >>>>>>> on Wed, 20 Jul 2016 10:26:19 +0200 writes:
    >> 
    >> > On Tue, Jul 19, 2016 at 6:46 PM, Kevin Ushey <kevinushey at gmail.com> wrote:
    >> >> R fails to install a package from source over a pre-existing package
    >> >> when the path to that package is a symlink, rather than a directory.
    >> >> ...
    >> >> I don't think anyone's reported this being an issue before
    >> 
    >> > I ran into this as well a while back:
    >> >   https://bugs.r-project.org/bugzilla3/show_bug.cgi?id=16725
    >> 
    >> I've now at least "acknowledged" that bug report.
    >> and have looked into changing the  is_subdir() function so it
    >> returns TRUE in the case of a symlink [on those platforms where
    >> Sys.readlink() "works", i.e., supposedly not on Windows; however
    >> that maybe sufficient to close that bug report and also Kevin's
    >> issue, right ?]
    >> 
    >> However, Kevin, in his posting, continues
    >> 
    >> > I guess my wish here would be that R would check if any file already
    >> > existed at the 'instdir' path, and if it existed and was a symlink, R
    >> > would remove that symlink before install.
    >> 
    >> are you sure?
    >> I think ... and from what you mention below ("packrat") it would
    >> rather be important to *keep* the symlink, and install to
    >> whereever the symlink is pointing,  no ?

    > For packrat's case at least, removing the symlink and installing to a
    > newly-created directory within the library would be fine -- later,
    > when a user wants to 'save the state' of their library, they would
    > call 'packrat::snapshot()', and that call would take care of moving
    > the newly-installed package to the cache and restoring the symlink as
    > required.

    > That said, installing within the symlinked directory would definitely
    > be nice :-) I just thought the request might be out of scope.

    >> > It could happen before creating the directory, e.g. here:
    >> 
    >> > https://github.com/wch/r-source/blob/62f5acbdbdf36e1fc618510312125d1677d79941/src/library/tools/R/install.R#L277-L281
    >> 
    >> > One thing that was a bit surprising to me -- R does not remove a
    >> > pre-existing package installation if it exists (when installing from
    >> > source), it merely installs over it, so files / artifacts from a
    >> > previous package installation could be left over after installing a
    >> > new package. It seems this is not a problem in practice since I don't
    >> > think anyone's reported this being an issue before, but for hygiene it
    >> > seems like a pre-existing directory could / should be removed when
    >> > installing a new package. (It appears that R does clear out a
    >> > pre-existing directory when downloading and installing a package
    >> > binary directly from CRAN.)
    >> 
    >> Well, at least with  update.packages()  it seems natural to me
    >> that R would not just remove all previous parts there ..
    >> 
    >> > For motivation: I bumped into this when attempting to implement a
    >> > package caching feature with packrat. A packrat project using a global
    >> > cache will have a (private) R library containing symlinks to R package
    >> > installations in a separate, global library. This allows projects to
    >> > effectively be isolated from one another, while avoiding duplication
    >> > of packages used across multiple projects.
    >> 
    >> Yes, I found this a nice feature when I heard about packrat.
    >> 
    >> But then, really R should *not* remove the symlink and create a
    >> regular subdirectory in that library there !

    > I agree this would be ideal, I just thought this request might be out
    > of scope, since the typical use case for R libraries is a
    > directory-of-directories, not a directory-of-symlinks-to-directories
    > (although packrat has had a lot of success with the second scenario!)

    > Thanks, Martin!

You are welcome.
I have committed a change (svn rev 70955) which no longer "errors out" on
symlinks {{and the same change improves debugging: you can turn
	  off the "dreaded" q(), and that's done by default if(interactive())}}

However, that change indeed was mainly to  is_subdir()
and indeed the code later *does* replace the package-name symlink by a newly
created directory <lib>/<package>  rather than leaving the
symlink.. where I continue to find the latter *the* correct
action, but that would need changes in other places of the code.

 [tested (and "minimal")  patches are welcome for that other goal ..]

Martin

    >> > Unfortunately, some packrat
    >> > users bump into this when attempting to update a package that has
    >> > entered the cache (and so is a symlink in their R library).
    >> 
    >> > Thanks for your time,
    >> > Kevin
    >>



More information about the R-devel mailing list