[
https://issues.apache.org/jira/browse/NUTCH-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney resolved NUTCH-1451.
-----------------------------------------
Resolution: Fixed
Committed @revision 1408282 in trunk
Committed @revision 1408289 in 2.2-SNAPSHOT
I didn't upload patches for these fixes as the generated patches contained
loads of non-Utf8 characters which corrupted the file.
The fixes remove our dependency upon shipping with the automaton.jar and
licenses. The automaton deps are now pulled by ivy.
> Upgrade automaton jar to 1.11-8
> -------------------------------
>
> Key: NUTCH-1451
> URL: https://issues.apache.org/jira/browse/NUTCH-1451
> Project: Nutch
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.6, 2.1
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Priority: Minor
> Fix For: 1.6, 2.2
>
>
> The latest version 1.11-8 was released September 7, 2011.
> This library is significantly faster than the default regex parsing. I
> haven't got a clue what version we currently use but the license states 2005
> so I'm guessing its been a long time since it was upgraded.
> I'll get a patch together and for completeness run independent test to
> compare results pre and post upgrade. It would be nice to see > marginal
> improvements :0)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira