On Fri, 30 Nov 2001, Gilles Detillieux wrote:

> Date: Fri, 30 Nov 2001 12:06:04 -0600 (CST)
> From: Gilles Detillieux <[EMAIL PROTECTED]>
> To: Joe R. Jah <[EMAIL PROTECTED]>
> Cc: "ht://Dig developers list" <[EMAIL PROTECTED]>
> Subject: Re: [htdig-dev] to-do list for 3.1.6
> 
> > ___________________ 112501 + parsedate.0 + ssl.6 ___________________
> > rm htlib/regex.*
> > remove reference to regex.o in htlib/Makefile
> > #undef HAVE_BROKEN_REGEX in include/htconfig.h
> > 
> > htdig: Start digging:   Thu Nov 29 22:22:32 PST 2001
> > htmerge: Start merging: Thu Nov 29 22:24:14 PST 2001        104 seconds
> ...
> > ___________________ 112501 + parsedate.0 + ssl.6 ___________________
> > rm htlib/regex.*
> > remove reference to regex.o in htlib/Makefile
> > #define HAVE_BROKEN_REGEX in include/htconfig.h
> > 
> > htdig: Start digging:   Thu Nov 29 22:25:33 PST 2001
> > htmerge: Start merging: Thu Nov 29 22:27:12 PST 2001         99 seconds
> ...
> 
> I don't think the difference between 99 and 104 seconds is significant.
> This confirms my suspicion that the HAVE_BROKEN_REGEX doesn't do a
> whole lot.  To be sure, though, I think we'd need timings for 112501 +
> parsedate.0 + ssl.6, remove reference to regex.o in htlib/Makefile, #undef
> AND #define HAVE_BROKEN_REGEX (i.e. two tests) in include/htconfig.h
> (but don't remove htlib/regex.h).  I suspect the timings for both will
> be like the 2nd test above, around 143 sec.

___________________ 112501 + parsedate.0 + ssl.6 ___________________
remove reference to regex.o in htlib/Makefile
#define HAVE_BROKEN_REGEX in include/htconfig.h

htdig: Start digging:   Sat Dec 1 00:10:58 PST 2001
htmerge: Start merging: Sat Dec 1 00:12:44 PST 2001     106
htmerge: Total word count: 13159
htmerge: Total documents: 163
htmerge: Total size of documents (in K): 1904
___________________ 112501 + parsedate.0 + ssl.6 ___________________
remove reference to regex.o in htlib/Makefile
#undef HAVE_BROKEN_REGEX in include/htconfig.h

htdig: Start digging:   Sat Dec 1 00:18:55 PST 2001
htmerge: Start merging: Sat Dec 1 00:20:38 PST 2001     103
htmerge: Total word count: 13159
htmerge: Total documents: 163
htmerge: Total size of documents (in K): 1904
____________________________________________________________________

> I suspect the difference between the 143 and the 99-104 sec is due
> to the inclusion of the bundled regex.h even though you're using
> the C library regex.o code.  It's a wonder this works at all, but
> there does seem to be some impact on performance.

I am not sure how that 143 came about last time; I can't reproduce it any
more;-/

> > ____________________ 092301 + Armstrong + ssl.4 ____________________
> > htdig: Start digging:   Fri Nov 30 00:18:06 PST 2001
> > htmerge: Start merging: Fri Nov 30 00:18:44 PST 2001     38 seconds
> ...
> 
> This is the part I find a bit troubling, but I don't know what we
> can do about it.  I don't know why Armstrong's patch, which uses rx
> instead of regex, causes htdig to run 2-3 times faster, unless there
> are other changes between 092301 and 112501 that account for much of
> this, but it could well be just implementation efficiencies in one
> library and not in the other.

I reported the difference in indexing time to the list the very first time
url_rewrite_rules was integrated in the code.  I don't believe at that
time anything else had changed in the code.

> In your tests above, do you make use of url_rewrite_rules?  If so,
> how do the timings change if you don't use it?

___________________ 112501 + parsedate.0 + ssl.6 ___________________
remove reference to regex.o in htlib/Makefile
#define HAVE_BROKEN_REGEX in include/htconfig.h
no url_rewrite_rules

htdig: Start digging:   Sat Dec 1 00:40:09 PST 2001
htmerge: Start merging: Sat Dec 1 00:40:34 PST 2001      25 seconds
htmerge: Total word count: 13159
htmerge: Total documents: 163
htmerge: Total size of documents (in K): 1904
rundig: end rundig: Sat Dec 1 00:40:39 PST 2001
___________________ 112501 + parsedate.0 + ssl.6 ___________________
remove reference to regex.o in htlib/Makefile
#undef HAVE_BROKEN_REGEX in include/htconfig.h
no url_rewrite_rules

htdig: Start digging:   Sat Dec 1 00:28:50 PST 2001
htmerge: Start merging: Sat Dec 1 00:29:10 PST 2001      20 seconds
htmerge: Total word count: 13159
htmerge: Total documents: 163
htmerge: Total size of documents (in K): 1904
____________________________________________________________________

Regards,

Joe
-- 
     _/   _/_/_/       _/              ____________    __o
     _/   _/   _/      _/         ______________     _-\<,_
 _/  _/   _/_/_/   _/  _/                     ......(_)/ (_)
  _/_/ oe _/   _/.  _/_/ ah        [EMAIL PROTECTED]


_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to