Ciao amici,

        as 2 years ago I programmed a link checker running on a locale system (cos
at that age I had only one server running ... yes time change !!!) probably
I can help you. I had a fast look at the HtFile code and I noticed you use
the mime. OK. So, I think no problem for recognizing content-type
(obviously, myme-types has to be written right :-)). I repeat, I only had a
quick look at the code, so I'm certain to be wrong with some of my argues ...

>    o We must index only valid_extensions, since we have no
>      control over what individual users put in their web
>      directories, and some are ...uhm... indiscriminate.

I agree.

>    o Using valid_extensions also allows URL's with no extension
>      (after my patch to Retriever.cc).  This is as it should be,
>      since many URL's with no extension are subdirectories,
>      which we need to index.  But other URL's with no extension
>      are binary executables or heaven knows what.

Yes, but I think with a more accurate control on the 'stat' result, we may
recognize if we are managing a directory or not. Not only byu controlling
if it's a regular file or not. And, what if we encounter a symbolic link? I
am not sure, but 'stat' doesn't return a IS_REG result. If so, this is a
problem, because we have to retrieve it too ... and to obtain info on it. I
tried to implement some code for it ... but now I don't remember how I did
it. :-P If this problem really exists for you too, let me know.

>    o Users can't be relied on to use a trailing slash in links
>      that point to a directory, e.g. <A HREF="subdirectory/">.
>

Yes, but I can only understand if it's a directory by controlling the stat
result. And if it's a directory I should control for the default document,
for example index.htm. Or not?

I hope I've been useful ... A warm greeting from Italy.
Ciao Ciao
-Gabriele


P.S.: Why don't we use Map Files to improve performances instead of
<ifstream>? I know it may not be portable, but we can do it maybe with an
appropriate 'configure' script. But, it's just a proposal ... I have a
simple class ready, so let me know. I am curious of your opinion too.



-------------------------------------------------

Gabriele Bartolini
Computer Programmer (are U sure?)
U.O. Rete Civica - Comune di Prato
Prato - Italia - Europa

e-mail: [EMAIL PROTECTED]
http://www.po-net.prato.it

"Life teaches you never stop learning ..."

-------------------------------------------------

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] 
You will receive a message to confirm this. 

Reply via email to