Luis Lopez created NUTCH-2035:
---------------------------------

             Summary: Regex filter using case sensitive rules.
                 Key: NUTCH-2035
                 URL: https://issues.apache.org/jira/browse/NUTCH-2035
             Project: Nutch
          Issue Type: Improvement
          Components: plugin
    Affects Versions: 1.10
            Reporter: Luis Lopez
            Priority: Minor
             Fix For: 1.11


Regex expressions are computationally expensive and having “EXE|exe|JPG|jpg” 
etc etc..... adds up if we use complex rules.

Regex filter should use case insensitive rules to make the rules more readable 
and improve performance.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to