According to Gabriele Bartolini:
> > o Using valid_extensions also allows URL's with no extension
> > (after my patch to Retriever.cc). This is as it should be,
> > since many URL's with no extension are subdirectories,
> > which we need to index. But other URL's with no extension
> > are binary executables or heaven knows what.
>
> Yes, but I think with a more accurate control on the 'stat' result, we may
> recognize if we are managing a directory or not. Not only byu controlling
> if it's a regular file or not. And, what if we encounter a symbolic link? I
> am not sure, but 'stat' doesn't return a IS_REG result. If so, this is a
stat() follows a symlink to the target file or directory, and reports on
that target. lstat() will report on the symlink itself. This is all
beside the point...
> problem, because we have to retrieve it too ... and to obtain info on it. I
> tried to implement some code for it ... but now I don't remember how I did
> it. :-P If this problem really exists for you too, let me know.
>
> > o Users can't be relied on to use a trailing slash in links
> > that point to a directory, e.g. <A HREF="subdirectory/">.
> >
>
> Yes, but I can only understand if it's a directory by controlling the stat
> result. And if it's a directory I should control for the default document,
> for example index.htm. Or not?
That may work for local_urls files accessed via the local filesystem, but
you can't stat files remotely through an HTTP server.
> P.S.: Why don't we use Map Files to improve performances instead of
> <ifstream>? I know it may not be portable, but we can do it maybe with an
> appropriate 'configure' script. But, it's just a proposal ... I have a
> simple class ready, so let me know. I am curious of your opinion too.
ifstreams aren't used that heavily by htdig/htsearch, so there'd be almost
no performance improvement.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.