[R] scanning a pdf scan

2006-10-27 Thread roger koenker
I have a pdf scan of several pages of data from a quite famous old paper by C.S. Pierce (1873). I would like (what else?) to convert it into an R dataframe. Somewhat to my surprise the pdf seems to already be in a character recognized form, since I can search for numerical strings and they

Re: [R] scanning a pdf scan

2006-10-27 Thread Gabor Grothendieck
I don't have specific experience with this but strapply of package gsubfn can extract information from a string by content as opposed to delimiters. e.g. library(gsubfn) strapply(abc34def56xyz, [0-9]+, c)[[1]] [1] 34 56 On 10/27/06, roger koenker [EMAIL PROTECTED] wrote: I have a pdf scan of

Re: [R] scanning a pdf scan

2006-10-27 Thread roger koenker
Thanks for your suggestions. Trial and error experimentation with adobe acrobat produced the following method: It looks like it is possible to highlight the numerical part of the table in Acrobat and then copy/paste into a text file, with about 98 percent accuracy. Wonders never cease. url: