Richard,
 
It could be a faulty .PDF file, a bug in pdftotext, or a recent file version which your pdftotext cannot read.  Acrobat Reader does seem able to cope with .PDF files which produce error messages from pdftotext.
 
From what you write it appears that pdfinfo can read the file, but that pdftotext can't.  You should be able to confirm that quite easily.
 
If pdftotext is producing no O/P there is little that can be done, apart from checking that you have the latest version of xpdf.
 
If pdftotext is producing some O/P which conv_doc.pl is failing to use, then you should get better results using doc2hml.
 
David Adams
Corporate Information Services
Information Systems Services
University of Southampton
 
----- Original Message -----
Sent: Friday, February 11, 2005 4:22 PM
Subject: Re: [htdig] error message from rundig

Thanks for the reply.
On Feb 11, 2005, at 5:42 AM, David Adams wrote:

This message is coming from the utility you are using to read the .pdf file, though I couldn't say what it means.   Does Acrobat Reader display the file without problems?
Acrobat Reader correctly displays the file. 
Even when pdftotext produces such an error message it usually outputs some text from the .pdf file and it is indexed.  Exaclty what method are you using to handle .pdf files?
The converter is conv_doc.pl. That PERL program calls pdftotext.

 
The strange thing here is that only the document title is indexed, no words internal to the file. A similar pdf file indexes correctly.

--dick peskin

____________________________________
Richard L. Peskin, RLP Consulting, Londonderry, VT
http://www.rlpcon.com
http://www.caip.rutgers.edu/~peskin

Reply via email to