Dear Kurt,

At 03:21 PM 26/05/2003, Kurt Hornik wrote:
>>>>> Gordon Smyth writes:

> At 05:22 AM 26/05/2003, Kurt Hornik wrote:
>> >>>>> Gordon Smyth writes:
>>
>> > I've noticed a curious interpretation of escaped characters in \examples{}
>> > in .Rd files.
>>
>> > For example, if I type
>>
>> > files <- dir(pattern="\\.txt")
>>
>> > at the R prompt, I will get a vector containing all file names in the
>> > current directory containing the string ".txt". If I put
>>
>> > \examples{ files <- dir(pattern="\\.txt") }
>>
>> > in an .Rd file of a package, then the generated documentation will replace
>> > my command with
>>
>> > Examples
>>
>> > files <- dir(pattern="\.txt")
>>
>> > i.e., the escaped backslash has been interpreted into a single
>> backlash. If
>> > a user types example(myfun) at the R prompt, where myfun is the topic
>> alias
>> > of the .Rd file, then they will actually get
>>
>> > files <- dir(pattern=".txt")
>>
>> > i.e., the backslash is interpreted a second time.
>>
>> > This seems an undesirable feature. What is in the .Rd file, what is
>> > displayed in the online help, and what is generated by example() are
>> > all different commands. I had expected anything in \examples{} to be
>> > reproduced in the online help and by \example{} entirely literally
>> > with no intepretation.
>>
>> > I am using R 1.7.0pat under Windows 2000 Professional.
>>
>> R-exts clearly says
>>
>> The "comment" and "control" characters `%' and `\' _always_
>> need to be escaped. Inside the verbatim-like commands (`\code'
>> and `\examples'), no other(1) characters are special. Note that
>> `\file' is *not* a verbatim-like command.
>>
>> -k


> Dear Kurt,

> Thanks very much for this reference from the R-exts document. You
> don't address though the main point of my post, which is that the code
> displayed the online help and executed and checked by R CMD check is
> different from the code extracted and executed by the function
> 'example'.

> Suppose I have an .Rd file for a function 'readFiles' containing the
> text '\examples{dir(pattern="\\\\.txt")}'. The quadruple-backlash will
> ensure that 'help' displays and R CMD checks the correct command
> 'dir(pattern="\\.txt")'. However, if a user types
> 'example("readFiles")' at the R prompt, the command that R will
> execute is 'dir(pattern=".txt")'.  This is *different command* and
> will generally give *wrong* results. There is no way that I can see to
> ensure that R CMD check and 'example' execute the same code.

Gordon,

I may be missing something obvious here but if e.g. look at strsplit.Rd
in the base package, this has

     unlist(strsplit("a.b.c", "."))
     ## [1] "" "" "" "" ""
     ## Note that `split' is a regexp!
     ## If you really want to split on `.', use
     unlist(strsplit("a.b.c", "\\."))
     ## [1] "a" "b" "c"

in the rendered version and

strspl> unlist(strsplit("a.b.c", "."))
[1] "" "" "" "" ""

strspl> unlist(strsplit("a.b.c", "\\."))
[1] "a" "b" "c"

and I also get

R> unlist(strsplit("a.b.c", "\\."))
[1] "a" "b" "c"

so these seem rather consistent to me?

No I'm the one who was missed something. As the strsplit example shows, I could have got "*\\.txt" as I wanted in the help file by putting four backlashes, "*\\\\.txt", in the .Rd file.


There are still aspects though to the substitution of backlashes when going from the help file to running examples which seem unexpected to me. I tried out the following example.

In the example in the .Rd file:

c("\\\\","\\\","\\\\#","\\#","\#")
c("\\+","\\?","\\.","\\*","\\^","\\$","\\(","\\)","\\[","\\]","\\{","\\}","\\|","\\_")
c("\+","\?","\.","\*","\^","\$","\(","\)","\[","\]","\{","\}","\|","\_")
c("\\w","\\W","\\s","\\S","\\d","\\D","\\A","\\Z","\\b","\\B","\\G","\\n","\\r","\\f","\\t")
c("\w","\W","\s","\S","\d","\D","\A","\Z","\b","\B","\G","\n","\r","\f","\t")

This results in the help file:

c("\\","\\","\\#","\#","\#")
c("\+","\?","\.","\*","\^","\$","\(","\)","\[","\]","\{","\}","\|","\_")
c("\+","\?","\.","\*","\^","\$","\(","\)","\[","\]","{","}","\|","\_")
c("\w","\W","\s","\S","\d","\D","\A","\Z","\b","\B","\G","\n","\r","\f","\t")
c("\w","\W","\s","\S","\d","\D","\A","\Z","\b","\B","\G","\n","\r","\f","\t")

Executed by example():

c("\\", "\\", "\\#", "#", "#")
c("+", "?", ".", "*", "^", "$", "(", ")", "[", "]", "{", "}", "|", "_")
c("+", "?", ".", "*", "^", "$", "(", ")", "[", "]", "{", "}", "|", "_")
c("w", "W", "s", "S", "d", "D", "A", "Z", "\b", "B", "G", "\n", "\r", "\f", " ")
c("w", "W", "s", "S", "d", "D", "A", "Z", "\b", "B", "G", "\n", "\r", "\f", " ")


When going from the .Rd file to the help file, single backslashes are left as-is but double backslashes are converted to singles. So to get a double backslash one has to start with four backslashes. This is fine.

When going from the help file to rendered code, double backlashes are now left as-is. Single backslashes are removed, *except* for "\b", "\n", "\r" and "\f". Escaped t's, "\t" are converted to actual tabs.

Is this all as it is supposed to be? The conversion of \t to <tab> does seem a potential problem.

The conversion of \t to <tab> occurs in other examples of rendering code also, for example:

> myfun <- function(x,pattern="\t") "hello"
> args(myfun)
function (x, pattern = "        ")

Best
Gordon

> My concern here is the with actual behavior of R rather than what R-exts
> says but, since you are emphasising the clarity of the R-exts document, it
> may be worth pointing out that R-exts also says

>         \examples{...}
>         Examples of how to use the function. These are set verbatim in
> typewriter font.

> This seems to say that \examples{} is a verbatim environment rather
> than merely verbatim-like. This statement comes 4 pages earlier in the
> pdf version than the paragraph you quote.

And the same is said for \usage.  Perhaps the part about being set
verbatim should be removed then.

Thanks
-k

______________________________________________ [EMAIL PROTECTED] mailing list https://www.stat.math.ethz.ch/mailman/listinfo/r-devel

Reply via email to