According to Joe R. Jah:
> Here is some statistics that may address your questions;)
> 
> ___________________ 112501 + parsedate.0 + ssl.6 ___________________
> Without any change.
> htdig: Start digging:   Thu Nov 29 23:27:58 PST 2001
> htmerge: Start merging: Thu Nov 29 23:33:35 PST 2001    337 seconds
...
> ___________________ 112501 + parsedate.0 + ssl.6 ___________________
> remove reference to regex.o in htlib/Makefile
> 
> htdig: Start digging:   Thu Nov 29 23:42:15 PST 2001
> htmerge: Start merging: Thu Nov 29 23:44:38 PST 2001    143 seconds
...

OK, obviously removing the bundled regex and making sure it doesn't
get into htlib.a has a big impact.

> ___________________ 112501 + parsedate.0 + ssl.6 ___________________
> rm htlib/regex.*
> remove reference to regex.o in htlib/Makefile
> #undef HAVE_BROKEN_REGEX in include/htconfig.h
> 
> htdig: Start digging:   Thu Nov 29 22:22:32 PST 2001
> htmerge: Start merging: Thu Nov 29 22:24:14 PST 2001  104 seconds
...
> ___________________ 112501 + parsedate.0 + ssl.6 ___________________
> rm htlib/regex.*
> remove reference to regex.o in htlib/Makefile
> #define HAVE_BROKEN_REGEX in include/htconfig.h
> 
> htdig: Start digging:   Thu Nov 29 22:25:33 PST 2001
> htmerge: Start merging: Thu Nov 29 22:27:12 PST 2001   99 seconds
...

I don't think the difference between 99 and 104 seconds is significant.
This confirms my suspicion that the HAVE_BROKEN_REGEX doesn't do a
whole lot.  To be sure, though, I think we'd need timings for 112501 +
parsedate.0 + ssl.6, remove reference to regex.o in htlib/Makefile, #undef
AND #define HAVE_BROKEN_REGEX (i.e. two tests) in include/htconfig.h
(but don't remove htlib/regex.h).  I suspect the timings for both will
be like the 2nd test above, around 143 sec.

I suspect the difference between the 143 and the 99-104 sec is due
to the inclusion of the bundled regex.h even though you're using
the C library regex.o code.  It's a wonder this works at all, but
there does seem to be some impact on performance.

> ____________________ 092301 + Armstrong + ssl.4 ____________________
> htdig: Start digging:   Fri Nov 30 00:18:06 PST 2001
> htmerge: Start merging: Fri Nov 30 00:18:44 PST 2001     38 seconds
...

This is the part I find a bit troubling, but I don't know what we
can do about it.  I don't know why Armstrong's patch, which uses rx
instead of regex, causes htdig to run 2-3 times faster, unless there
are other changes between 092301 and 112501 that account for much of
this, but it could well be just implementation efficiencies in one
library and not in the other.

In your tests above, do you make use of url_rewrite_rules?  If so,
how do the timings change if you don't use it?

Thanks for the feedback.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to