[ https://issues.apache.org/jira/browse/NUTCH-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney resolved NUTCH-1451. ----------------------------------------- Resolution: Fixed Committed @revision 1408282 in trunk Committed @revision 1408289 in 2.2-SNAPSHOT I didn't upload patches for these fixes as the generated patches contained loads of non-Utf8 characters which corrupted the file. The fixes remove our dependency upon shipping with the automaton.jar and licenses. The automaton deps are now pulled by ivy. > Upgrade automaton jar to 1.11-8 > ------------------------------- > > Key: NUTCH-1451 > URL: https://issues.apache.org/jira/browse/NUTCH-1451 > Project: Nutch > Issue Type: Improvement > Components: parser > Affects Versions: 1.6, 2.1 > Reporter: Lewis John McGibbney > Assignee: Lewis John McGibbney > Priority: Minor > Fix For: 1.6, 2.2 > > > The latest version 1.11-8 was released September 7, 2011. > This library is significantly faster than the default regex parsing. I > haven't got a clue what version we currently use but the license states 2005 > so I'm guessing its been a long time since it was upgraded. > I'll get a patch together and for completeness run independent test to > compare results pre and post upgrade. It would be nice to see > marginal > improvements :0) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira