Dear Franck,

Those URL's below are suspicious.  Make sure that AUTOINDEX is turned on in
your server, else a URL of that form will get you a 404.

Try those URLs from a browser and see what happens.

Those are not .PDF files.  They can only be directories (which contain .PDF
files?).  Unless AUTOINDEX is on, that is a 404.  With AUTOINDEX on, you'll
get a directory listing and HTDIG may follow the links.

Dave.

-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED]]On Behalf Of Franck
Collineau
Sent: Tuesday, November 27, 2001 10:19 AM
To: [EMAIL PROTECTED]
Subject: [htdig] Indexing PDF Files


Hi!

I have read the 4.9 FAQ and i have follwed the instructions:

I have copied conv_doc.pl in my /usr/local/bin folder

I have put in my configuration file the lines:

external_parsers: application/msword->text/html  /usr/local/bin/conv_doc.pl
\
                  application/postscript->text/html /usr/local/conv_doc.pl \
                  application/pdf->text/html /usr/local/bin/conv_doc.pl

I have configured conv_doc.pl.

I have set the max_doc_size attribute to 5000000

It doesn't work !

When i launch rundig i have the following messages for thr PDF folders:

728:728:3:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem01
45/:
 not found
729:729:3:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem01
46/:
 not found
730:730:3:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem01
47/:
 not found
731:731:3:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem01
40/:
 not found
732:732:3:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem01
41/:
 not found
733:733:3:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem01
42/:
 not found
734:734:3:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem01
43/:
 not found
735:735:3:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem01
36/:
 not found


Can you help me please ?

Thanks

Regards,

Franck

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to
<[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to