When I run "htdig -i" I get this error for every single .pdf file:

Error: Page count in top-level pages object is incorrect


I'm using external parsers to search trough pdf, word and excel files.

--- in htdig.conf ---
external_parsers: application/pdf->text/html /opt/www/bin/doc2html.pl \
                  application/msword->text/html /opt/www/bin/doc2html.pl \
                  application/msexcel->text/html /opt/www/bin/doc2html.pl
---------------------

----- in doc2html.pl ------
my $PDF2HTML = '/opt/www/bin/pdf2html.pl';

...

  # Adobe PDF file using Perl script
  if ($PDF2HTML) {
    $mime_type = "application/pdf";
    $cmd = $PDF2HTML;
    # Replace default title (if used) with filename:
    $cmdl = "$cmd $Input $mime_type $name";
    $magic = '%PDF-|\0PDF CARO\001\000\377';
    &store_html_method('PDF (pdf2html)',$cmd,$cmdl,$mime_type,$magic);
  }
-----------------------------------------------------------------------------

I've upgraded to the latest version of xpdf (xpdf-1.01) and still get the
error.

Any idea where to look to fix the problem?

Thanks
Peter


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to