>> * find_start_sequence() is the main scanning function within
>>   mod_include.  There's some research in progress to try to speed
>>   this up significantly.
> 
> Based on the patches you submitted (and my quasi-errant formatting
> patch), I had to read most of the code in mod_include, so I'm more 
> familiar with mod_include now.  I do think there are some obvious 
> ways to optimize find_start_sequence.  I wonder if we could apply 
> a KMP-string matching algorithm here.  I dunno.  I'll take a look 
> at it though.  Something bugs me about the restarts.  I bet that 
> we spend even more time in find_start_sequence when a HTML file 
> has lots of comments.  =-)

I suggested to Ian yesterday night that I'd look into a Boyer-Moore
matching algorithm.  I'll work on that too.

Sander

Reply via email to