According to Franck Collineau: > I have 2 questions: > > 1)I don't know where is AUTOINDEX parameters; i have made a grep on my PC and > edited commonhttpd.conf and httpd.conf. Didn't find
Different web servers will call this parameter by different names. Essentially, you need to figure out how to enable your web server to turn on the automatic generation of indexes for directories. In Apache, you need to use "Options Indexes" in a .htaccess file, or within a <Directory ...> block in your access.conf or httpd.conf file. Either that, or you must ensure that directories have an index.html file in them, or whatever file is designated as the default directory index file on your web server. E.g. in Apache, there's usually a directive in srm.conf or httpd.conf like this: DirectoryIndex index.html index.shtml index.cgi indicating any of these files will be served to the browser or other web client when it requests a directory URL. Some other servers may use index.htm or default.htm, or other file names. We don't know what web server you have or how it's configured, so you need to figure this out for yourself. > 2) I have tried > http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/index.html > from a browser and it works. > But when i set start_url with this URL i have the following message: > New server: r-pc-vpc-svr.rd.francetelecom.fr, 80 > 0:0:0:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/: > not found > htmerge: Sorting... > htmerge: Removing doc #0 > DB2 problem...: missing or empty key value specified > > Deleted, no excerpt: > 0/http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/ > > > I don't understand Well, there is a difference between http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/index.html and http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/ Do both work from your browser? If the second one doesn't, it's because your server doesn't do automatic generation of indexes, and doesn't treat index.html as the default directory index file. You need to find out what your web server uses as an index file name, and use that, as well as configure htdig's remove_default_doc to agree with what your server uses, or you need to change what your server uses if you want to stick with index.html. If you don't change remove_default_doc, htdig will always strip the "index.html" off the end of URLs, because it assumes that's the standard directory index file. In any case, as David correctly pointed out, none of the URLs you reported as not working are URLs for .pdf files, so it's misleading to report this as a problem with PDFs. > Le Mercredi 28 Novembre 2001 00:41, David T. Ashley a �crit : > > Those URL's below are suspicious. Make sure that AUTOINDEX is turned on in > > your server, else a URL of that form will get you a 404. > > > > Try those URLs from a browser and see what happens. > > > > Those are not .PDF files. They can only be directories (which contain .PDF > > files?). Unless AUTOINDEX is on, that is a 404. With AUTOINDEX on, you'll > > get a directory listing and HTDIG may follow the links. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

