Ciao amici,
as 2 years ago I programmed a link checker running on a locale system (cos
at that age I had only one server running ... yes time change !!!) probably
I can help you. I had a fast look at the HtFile code and I noticed you use
the mime. OK. So, I think no problem for recognizing content-type
(obviously, myme-types has to be written right :-)). I repeat, I only had a
quick look at the code, so I'm certain to be wrong with some of my argues ...
> o We must index only valid_extensions, since we have no
> control over what individual users put in their web
> directories, and some are ...uhm... indiscriminate.
I agree.
> o Using valid_extensions also allows URL's with no extension
> (after my patch to Retriever.cc). This is as it should be,
> since many URL's with no extension are subdirectories,
> which we need to index. But other URL's with no extension
> are binary executables or heaven knows what.
Yes, but I think with a more accurate control on the 'stat' result, we may
recognize if we are managing a directory or not. Not only byu controlling
if it's a regular file or not. And, what if we encounter a symbolic link? I
am not sure, but 'stat' doesn't return a IS_REG result. If so, this is a
problem, because we have to retrieve it too ... and to obtain info on it. I
tried to implement some code for it ... but now I don't remember how I did
it. :-P If this problem really exists for you too, let me know.
> o Users can't be relied on to use a trailing slash in links
> that point to a directory, e.g. <A HREF="subdirectory/">.
>
Yes, but I can only understand if it's a directory by controlling the stat
result. And if it's a directory I should control for the default document,
for example index.htm. Or not?
I hope I've been useful ... A warm greeting from Italy.
Ciao Ciao
-Gabriele
P.S.: Why don't we use Map Files to improve performances instead of
<ifstream>? I know it may not be portable, but we can do it maybe with an
appropriate 'configure' script. But, it's just a proposal ... I have a
simple class ready, so let me know. I am curious of your opinion too.
-------------------------------------------------
Gabriele Bartolini
Computer Programmer (are U sure?)
U.O. Rete Civica - Comune di Prato
Prato - Italia - Europa
e-mail: [EMAIL PROTECTED]
http://www.po-net.prato.it
"Life teaches you never stop learning ..."
-------------------------------------------------
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.