Ferdy Galema created NUTCH-1441:
-----------------------------------

             Summary: AnchorIndexingFilter should use plain HashSet
                 Key: NUTCH-1441
                 URL: https://issues.apache.org/jira/browse/NUTCH-1441
             Project: Nutch
          Issue Type: Bug
            Reporter: Ferdy Galema
            Priority: Minor
             Fix For: 2.1
         Attachments: NUTCH-1441.patch

AnchorIndexingFilter should use a plain HashSet, instead of WeakHashMap. 
WeakHashMap is unnecessary and can perhaps even cause bugs. (A WeakHashMap get 
its entries removed when the gc notices the keys are not elsewhere in use.)

This patch also makes the filter a bit faster by lazy instantiating the set. 
(No need to create one everytime when deduplication is off).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to