According to Franck Collineau:
> I have 2 questions:
> 
> 1)I don't know where is AUTOINDEX parameters; i have made a grep on my PC and 
> edited commonhttpd.conf and httpd.conf. Didn't find

Different web servers will call this parameter by different names.
Essentially, you need to figure out how to enable your web server to
turn on the automatic generation of indexes for directories.  In Apache,
you need to use "Options Indexes" in a .htaccess file, or within a
<Directory ...> block in your access.conf or httpd.conf file.

Either that, or you must ensure that directories have an index.html file
in them, or whatever file is designated as the default directory index
file on your web server.  E.g. in Apache, there's usually a directive
in srm.conf or httpd.conf like this:

  DirectoryIndex index.html index.shtml index.cgi

indicating any of these files will be served to the browser or other
web client when it requests a directory URL.  Some other servers may
use index.htm or default.htm, or other file names.  We don't know what
web server you have or how it's configured, so you need to figure this
out for yourself.

> 2) I have tried 
> http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/index.html 
> from a browser and it works.
> But when i set start_url with this URL i have the following message:
> New server: r-pc-vpc-svr.rd.francetelecom.fr, 80
> 0:0:0:http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/:  
> not found
> htmerge: Sorting...
> htmerge: Removing doc #0
> DB2 problem...: missing or empty key value specified
> 
> Deleted, no excerpt: 
> 0/http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/
> 
> 
> I don't understand

Well, there is a difference between

http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/index.html

and 

http://r-pc-vpc-svr.rd.francetelecom.fr/cnet-hebdo-technique/sem0147/

Do both work from your browser?  If the second one doesn't, it's because
your server doesn't do automatic generation of indexes, and doesn't treat
index.html as the default directory index file.  You need to find out
what your web server uses as an index file name, and use that, as well
as configure htdig's remove_default_doc to agree with what your server
uses, or you need to change what your server uses if you want to stick
with index.html.  If you don't change remove_default_doc, htdig will
always strip the "index.html" off the end of URLs, because it assumes
that's the standard directory index file.

In any case, as David correctly pointed out, none of the URLs you reported
as not working are URLs for .pdf files, so it's misleading to report this
as a problem with PDFs.

> Le Mercredi 28 Novembre 2001 00:41, David T. Ashley a �crit :
> > Those URL's below are suspicious.  Make sure that AUTOINDEX is turned on in
> > your server, else a URL of that form will get you a 404.
> >
> > Try those URLs from a browser and see what happens.
> >
> > Those are not .PDF files.  They can only be directories (which contain .PDF
> > files?).  Unless AUTOINDEX is on, that is a 404.  With AUTOINDEX on, you'll
> > get a directory listing and HTDIG may follow the links.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to