Re: [htdig] Problem with PDF files....

2001-01-12 Thread Elijah Kagan

Gilles,

1. I run htdig with an explicit -c option, so it uses the correct conf
file.
2. I rewrote the external_parsers so it includes only one line...
3. ..and it is the first line in the file

Results are the same! It is still looking for an acroread!

Please, help. I am getting desperate...

-- elijah


On Thu, 11 Jan 2001, Gilles Detillieux wrote:

 According to Elijah Kagan:
  
  Dear Everyone
  
  Hope this is the correct list to send such questions. If not, accept my
  apologies.
  
  When I run htdig on my files I get the following message when it comes to
  a PDF document:
  
  41:41:3:http://myserver/~elijah/document.pdf: PDF::parse: cannot find pdf
  parser /usr/local/bin/acroread  size = 1965732 
  
  For some reason htdig looks for an Acrobat while its config file clearly
  states:
  
  external_parsers: application/msword-text/html /usr/local/bin/conv_doc.pl \
application/postscript-text/html /usr/local/bin/conv_doc.pl \
application/pdf-text/html /usr/local/bin/conv_doc.pl
  
  The conv_doc.pl exists and working and the content type received from the
  server is application/pdf.
  
  Any ideas?
 ...
  P.S.  I am running htdig 3.1.5 on a Debian system.
 
 There are a few possibilities:
 
 1) htdig isn't looking at this config file, but another one, without
 the external_parsers definition;
 2) there's a typo in the external_parsers definition that isn't showing up 
 in the text you e-mailed above, e.g. a misspelled word or a space after
 one of the backslashes at the end of the first two lines; or
 3) there's a definition right above your external_parsers definition that
 mistakenly ends with a backslash at the end of the line, causing your
 external_parsers definition to be swallowed up by the previous line.
 
 That htdig is attempting to invoke acroread confirms two things:  a)
 the PDF file is correctly being tagged by the server as application/pdf,
 and b) htdig is not seeing a usable definition of an external parser
 for that content-type, for any of the reasons outlined above.
 
 -- 
 Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
 Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
 Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
 Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930
 



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




[htdig] Problem with PDF files....

2001-01-11 Thread Elijah Kagan

Dear Everyone

Hope this is the correct list to send such questions. If not, accept my
apologies.

When I run htdig on my files I get the following message when it comes to
a PDF document:

41:41:3:http://myserver/~elijah/document.pdf: PDF::parse: cannot find pdf
parser /usr/local/bin/acroread  size = 1965732 

For some reason htdig looks for an Acrobat while its config file clearly
states:

external_parsers: application/msword-text/html /usr/local/bin/conv_doc.pl \
  application/postscript-text/html /usr/local/bin/conv_doc.pl \
  application/pdf-text/html /usr/local/bin/conv_doc.pl

The conv_doc.pl exists and working and the content type received from the
server is application/pdf.

Any ideas?


Thanks,

Elijah Kagan


P.S.  I am running htdig 3.1.5 on a Debian system.






To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html