I have a pdf scan of several pages of data from a quite famous old
paper by
C.S. Pierce (1873). I would like (what else?) to convert it into an
R dataframe.
Somewhat to my surprise the pdf seems to already be in a character
recognized
form, since I can search for numerical strings and they
I don't have specific experience with this but strapply
of package gsubfn can extract information from a string by content
as opposed to delimiters. e.g.
library(gsubfn)
strapply(abc34def56xyz, [0-9]+, c)[[1]]
[1] 34 56
On 10/27/06, roger koenker [EMAIL PROTECTED] wrote:
I have a pdf scan of
Thanks for your suggestions. Trial and error experimentation
with adobe acrobat produced the following method:
It looks like it is possible to highlight the numerical part of the
table in Acrobat and then copy/paste into a text file, with about
98 percent accuracy. Wonders never cease.
url: