Re: [R] Extracting the first currency value from PDF files

2020-05-13 Thread John Kane
It looks like you are using the str_nth_currency() function from the strex package but we have no idea of what the pdf files are or how you are importing them is to R. We need a lot more information on what you are doing "before" you use the function. Have a look at

Re: [R] Extracting the first currency value from PDF files

2020-05-13 Thread Rasmus Liland
On 2020-05-13 06:44 -0700, Jeff Newmiller wrote: > On May 13, 2020 6:33:03 AM PDT, Manish Mukherjee wrote: > > > > How to extract this value from a number > > of PDF files and put it in a data frame. > > they could be part of embedded bitmaps. Dear Manish and Jeff, I recently found the

Re: [R] Extracting the first currency value from PDF files

2020-05-13 Thread Jeff Newmiller
PDF files are actually "programs" that place graphic symbols on pages, and the order in which those symbols are placed (the order in which most pdf-to-text conversions return characters) may have nothing to do with how they appear visually. There is not even a guarantee that those symbols are

[R] Extracting the first currency value from PDF files

2020-05-13 Thread Manish Mukherjee
Hi All, Need some help with the following code , i have a number of pdf files , and the first page of those files gives a currency value $xxx,xxx,xxx . How to extract this value from a number of PDF files and put it in a data frame . I am able to do it for a single file with the code where