On Fri, 2010-03-12 at 08:15 +0200, Henrik K wrote:

> Why don't you simply maintain your wordlists in some files and use a script
> to generate portmanteau.cf? You could use Regexp::Assemble module to
> optimize also. Who cares what the actual rules look like? The more words
> (simple alternations) there are in a single RE, the better it performs. If
> you want clarity in the cf, keep the original words listed in a comment
> block.
>
....because that didn't occur to me. 

Its a good idea. Better yet, my rule development & test environment can
be easily extended to incorporate it. Thanks.

Your comment about a single regex containing many alternations being
more efficient than several smaller ones raises two questions:

- what is the maximum line length for such a rule?
- does the order of alternations have any effect on performance or
  is alphabetic order good enough? It would certainly make rule
  generation simpler.

 
Martin


Reply via email to