On lör, 2008-10-18 at 12:58 +0300, Henrik K wrote:

> By doing it correctly, using ^hostname$ instead of plain hostname in regex
> results in 1.2 seconds, that's 80000+ hosts/sec..

The interesting pattern match to compare with is

s/^www\.// on the hostnames before making patterns

Then for each hostname
(\.|^)hostname$

or expanded in two patterns depending how well Regexp::Assemble handles
this case.

   \.hostname$
   ^hostname$

blacklists have a quite large proportion domain matches, matching a
complete domain.

Quite likely regex will handle this much better if you reverse the
hostnames, resulting in patterns on the form

 ^emantsoh(\.|$)

Regards
Henrik

Attachment: signature.asc
Description: This is a digitally signed message part

Reply via email to