[Rd] UTF-8 and .Rd files

Prof Brian Ripley ripley at stats.ox.ac.uk
Tue Jun 27 20:22:58 CEST 2006


We describe how to use \enc for possible transliterations for exactly this 
purpose in the `Writing R Extensions' manual.

In answer to Göran's question, yes latin1 is safer than UTF-8 for HTML 
browsers but neither are guaranteed to contain a glyph for ö in a font 
used e.g. in a Russian locale.

On Tue, 27 Jun 2006, Spencer Graves wrote:

> Hello, Göran:
>
> 	  Have you considered the German solution:  "Goeran"?  (e.g., Wuertz 
> for Würtz)?
>
> 	  Be thankful that you aren't Russian or Greek or Arabic or Chinese, 
> etc., for which there may be no standard transliteration into the Latin 
> alphabet.
>
> 	  Sorry I can't be more helpful.
>
> 	  Spencer Graves
> p.s.  When I'm with native Spanish speakers who don't know English, I 
> pronounce my name very differently, like "Espencer Gra-ve", to match how they 
> would pronounce my name when they see it written.  Similarly, I once heard a 
> French Canadian take about his young son, Guillaume.  If you ask him in 
> English, "What's your name?" he replies, "Bill".  If you ask the same 
> question in French, he replies, "Guillaume".
>
> Hin-Tak Leung wrote:
>> Göran Broström wrote:
>>> On 6/27/06, Prof Brian Ripley <ripley at stats.ox.ac.uk> wrote:
>>>> On Tue, 27 Jun 2006, Göran Broström wrote:
>>>> 
>>>>> I have been converting to utf8 from latin1, and this gives me
>>>>> problems, some solved, but here is one unsolved: In my .Rd files, I
>>>>> have included '\encoding{UTF-8}' at the top. Despite this, the HTML
>>>>> help pages contains 'content="text/html; charset=iso-8859-1"', and my
>>>>> name is mangled. What can I do about this?
>>>> Reproducible example, please!  (I've just tried this and it works for 
>>>> me.)
>>>> 
>>>> As described in my talk at UseR 2006, you may well not want to do this if
>>>> you intend to distribute the package.  Your name contains characters that
>>>> are not in the fonts used in UTF-8 in non-European locales, and Windows
>>>> users do no have ready access to UTF-8 viewers (even if they know the
>>>> files are UTF-8).
>>> Thanks for your answer! So this means that 'latin1' does not cause
>>> problems for non-European locales and Windows users, I take it.
>>> 
>>> I really only need non-ascii to write the name ot the author (me)
>>> correctly. I tried LaTeX code ({\"o}), but that didn't work. Is there
>>> a way around this?
>>> 
>>> Göran
>> 
>> The \"o character in my latin1 (iso 8859-1) man page says it is 0xF6
>>   F6 - LATIN SMALL LETTER O WITH DIAERESIS
>> The capital version is
>>   D6 -  LATIN CAPITAL LETTER O WITH DIAERESIS
>> 
>> in html I think you need to do &#F6; or something for that character to 
>> appear?
>> 
>> HTH
>> 
>> HTL
>> 
>> ______________________________________________
>> R-devel at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-devel
>
>

-- 
Brian D. Ripley,                  ripley at stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595


More information about the R-devel mailing list