Pinky Iyer wrote:
Hi !
I am trying to use xpdf for pdf parser, the problem i encounter is when i encounter
a file with .pdf extension, i call the pdftotext script to convert to text, which in
turn uses the file system and leaves the same file with .txt extension in same dir.
How can i get
THis means that i have to use the htmlparser again on the converted document. Is that
right? Also is there a way to use these without utilizing the filesystem, by way of
streams or so.
Michael Wechner [EMAIL PROTECTED] wrote:Pinky Iyer wrote:
Hi !
I am trying to use xpdf for pdf parser, the
Thanks Bruce!
I dont know how i missed that! Thanks anyway! It works nowthough stuck with title
and summary...
P Iyer
Bruce Ritchie [EMAIL PROTECTED] wrote:Pinky,
If you had actually read the documentation that came with pdftotext you would know
that if you pass
in a - (dash) as the