[R] issue with encoding in R-2.8.1 invalid multibyte character
Prof Brian Ripley
ripley at stats.ox.ac.uk
Wed Dec 31 10:09:07 CET 2008
On Wed, 31 Dec 2008, Wijffels, Jan wrote:
> Yes, it was the section sign (double s) symbol that I was trying to
> print connecting from a Windows machine with Latin1 encoding to a UTF-8
> Linux machine.
> I changed the translation behaviour in my Putty SSH from Latin1 to UTF-8
> and now the interactive R programming works.
> My scripts which I run with Rscript my_script.r contain quite some
> Latin-1 characters. These ran ok in R2.7.0 but not any more in R2.8.1
> but I presume this is because in 2.7.1 the changes made to the system
> indicated 'The parser sometimes accepted invalid quoted strings in a
> UTF-8 locale'.
> So this means for me I need to change the scripts I develop in Latin1 on
> Windows to UTF-8 before I upload them to our server.
Or, as I suggested below, run the R session on the server in Latin1.
% LC_ALL=nl_BE R
(guessing, or use en_US) should do it.
> Thanks for the help.
> -----Oorspronkelijk bericht-----
> Van: Prof Brian Ripley [mailto:ripley at stats.ox.ac.uk]
> Verzonden: woensdag 31 december 2008 9:22
> Aan: Peter Dalgaard
> CC: Wijffels, Jan; r-help at r-project.org
> Onderwerp: Re: [R] issue with encoding in R-2.8.1 invalid multibyte
> On Wed, 31 Dec 2008, Peter Dalgaard wrote:
>> Prof Brian Ripley wrote:
>>> Well, we don't see what you see. but if ? was hex a7, the message is
>>> entirely correct. If you want to enter that, use "\xa7".
>> We see different things.
> Right, and my point is that we do not know what he actually sees.
>> I see a section sign (double s) symbol. From the
>> symptoms, I would suspect that the terminal is set to latin-1 or -15
>> have the section sign at 0xa7) even though the system (and thus R) is
> I thought of that, but if the system is in UTF-8, so would its keyboard
> be. Perhaps this is a remote session from a Windows system to a UTF-8
> one? (In which case set the remote locale appropriately.)
> The issue seemed to be about entering Latin characters (-1 or -9, I
> latin-9 is ISO 8859-15), and that is what I tried to answer.
> Brian D. Ripley, ripley at stats.ox.ac.uk
> Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/
> University of Oxford, Tel: +44 1865 272861 (self)
> 1 South Parks Road, +44 1865 272866 (PA)
> Oxford OX1 3TG, UK Fax: +44 1865 272595
Brian D. Ripley, ripley at stats.ox.ac.uk
Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel: +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UK Fax: +44 1865 272595
More information about the R-help