[Rd] bug fix for R, and speed gain (was Re: R build failure under gcc 4.9's link time optimization)
htl10 at users.sourceforge.net
Tue Dec 23 21:38:34 CET 2014
Switching on link time optimization in R gives very impressive speed gain:
the 1000 Genome-related code in snpMatrix is more than 10% faster.
There were 3 sets of changes, a bug fix, a command line option, and
a code change to make it happen.
AR=gcc-ar is needed by both R itself, Matrix, (and also R2SWF from my routine upgrade).
The fix to https://bugs.r-project.org/bugzilla/show_bug.cgi?id=15011
was wrong and ineffective, and needs to be re-opened. I posted my fix there.
And the code change below - an alternative change is to modify
EXTRA_STATIC_LIBS a few lines below in "src/main/Makefile.in" to
start with "-Wl,--whole-archive" and ends with "-Wl,--no-whole-archive".
(I filed this as https://gcc.gnu.org/bugzilla/show_bug.cgi?id=64390)
All three of these are gcc-specific; but then both R for windows and mac OS X are
built with gcc. Together with linux, that covers all the major platforms. For 10% speed
gain, I'll let somebody else worry about machines that are not MS windows, not mac OS X
On Mon, 15/12/14, Hin-Tak Leung <htl10 at users.sourceforge.net> wrote:
R fails to build with visibility on
and gcc 4.9's link time optimzation, because
of its practice of building part of it as archive first.
it builds some bundled libraries as archive first, the
symbols of which
are then entirely invisible in gcc 4.9.
The Matrix package also does this awful practice, with
CHOLMOD.a COLAMD.a AMD.a SuiteSparse_config.a .
One way of fixing R is:
diff --git a/src/main/Makefile.in b/src/main/Makefile.in
index 908b7ab..ce015b4 100644
@@ -86,7 +86,7 @@ ALL_CPPFLAGS = $(ZLIB_CPPFLAGS)
$(BZLIB_CPPFLAGS) $(PCRE_CPPFLAGS) \
R_ZLIBS = @BUILD_ZLIB_TRUE@ ../extra/zlib/libz.a
R_BZLIBS = @BUILD_BZLIB_TRUE@ ../extra/bzip2/libbz2.a
R_PCRE = @BUILD_PCRE_TRUE@ ../extra/pcre/libpcre.a
-R_TRE = @BUILD_TRE_TRUE@ ../extra/tre/libtre.a
+R_TRE = @BUILD_TRE_TRUE@ `ls ../extra/tre/*.o`
R_XDR = @BUILD_XDR_TRUE@ ../extra/xdr/libxdr.a
R_XZ = @BUILD_XZ_TRUE@ ../extra/xz/liblzma.a
R_LIBINTL = @BUILD_LIBINTL_TRUE@ ../extra/intl/libintl.a
According the gcc changes doc, setting $AR is probably
Here is the relevant excerpt from https://gcc.gnu.org/gcc-4.9/changes.html
Link-time optimization (LTO) improvements:
When using a linker plugin, compiling with the -flto option
now generates slim object files (.o) which only contain
intermediate language representation for LTO. Use
-ffat-lto-objects to create files which contain additionally
the object code. To generate static libraries suitable for
LTO processing, use gcc-ar and gcc-ranlib; to list symbols
from a slim object file use gcc-nm. (This requires that ar,
ranlib and nm have been compiled with plugin support.)
More information about the R-devel