According to Gabriele Bartolini:
> >    o Using valid_extensions also allows URL's with no extension
> >      (after my patch to Retriever.cc).  This is as it should be,
> >      since many URL's with no extension are subdirectories,
> >      which we need to index.  But other URL's with no extension
> >      are binary executables or heaven knows what.
> 
> Yes, but I think with a more accurate control on the 'stat' result, we may
> recognize if we are managing a directory or not. Not only byu controlling
> if it's a regular file or not. And, what if we encounter a symbolic link? I
> am not sure, but 'stat' doesn't return a IS_REG result. If so, this is a

stat() follows a symlink to the target file or directory, and reports on
that target.  lstat() will report on the symlink itself.  This is all
beside the point...

> problem, because we have to retrieve it too ... and to obtain info on it. I
> tried to implement some code for it ... but now I don't remember how I did
> it. :-P If this problem really exists for you too, let me know.
> 
> >    o Users can't be relied on to use a trailing slash in links
> >      that point to a directory, e.g. <A HREF="subdirectory/">.
> >
> 
> Yes, but I can only understand if it's a directory by controlling the stat
> result. And if it's a directory I should control for the default document,
> for example index.htm. Or not?

That may work for local_urls files accessed via the local filesystem, but
you can't stat files remotely through an HTTP server.

> P.S.: Why don't we use Map Files to improve performances instead of
> <ifstream>? I know it may not be portable, but we can do it maybe with an
> appropriate 'configure' script. But, it's just a proposal ... I have a
> simple class ready, so let me know. I am curious of your opinion too.

ifstreams aren't used that heavily by htdig/htsearch, so there'd be almost
no performance improvement.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] 
You will receive a message to confirm this. 

Reply via email to