According to Stephen Yeoh:
> After searching the htdig site for information on indexing an SSL site, I have
> set up indexing by using the local_url setting.
>
> htdig does to go the file system and grabs index.html, but none of the other
> files in the directory. I have tried with and without the local_default_doc
> setting.
>
> I touch index.html and then I run with the -vvv option, I get the message while
> parsing index.html for every single link that is found:
>
> start_url: https://foo.com/
> limit_urls_to: ${start_url}
> local_urls: https://foo.com/=/www/foo/
>
> Rejected: not an http or relative link
>
> on every single link, even if they are on the same https://foo.com/ site. My
> site uses all relative links except for external references.
Currently, htdig will not support URLs that begin with https://, even when
using local_urls to bypass the server. A trick that might work would be
to index using http:// instead, but use local_urls to point to the directory
that contains the contents of the secure server. You'd need to use separate
configuration files for digging and searching, and use url_part_aliases in
each of these configuration files to rewrite the http:// into https:// in the
search results.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.