Re: xpdf parser usage for lucene

2003-02-25 Thread Michael Wechner
Pinky Iyer wrote: Hi ! I am trying to use xpdf for pdf parser, the problem i encounter is when i encounter a file with .pdf extension, i call the pdftotext script to convert to text, which in turn uses the file system and leaves the same file with .txt extension in same dir. How can i get

Re: xpdf parser usage for lucene

2003-02-25 Thread Pinky Iyer
THis means that i have to use the htmlparser again on the converted document. Is that right? Also is there a way to use these without utilizing the filesystem, by way of streams or so. Michael Wechner [EMAIL PROTECTED] wrote:Pinky Iyer wrote: Hi ! I am trying to use xpdf for pdf parser, the

Re: xpdf parser usage for lucene

2003-02-25 Thread Pinky Iyer
Thanks Bruce! I dont know how i missed that! Thanks anyway! It works nowthough stuck with title and summary... P Iyer Bruce Ritchie [EMAIL PROTECTED] wrote:Pinky, If you had actually read the documentation that came with pdftotext you would know that if you pass in a - (dash) as the