[ 
https://issues.apache.org/jira/browse/NUTCH-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney resolved NUTCH-1451.
-----------------------------------------

    Resolution: Fixed

Committed @revision 1408282 in trunk
Committed @revision 1408289 in 2.2-SNAPSHOT

I didn't upload patches for these fixes as the generated patches contained 
loads of non-Utf8 characters which corrupted the file. 
The fixes remove our dependency upon shipping with the automaton.jar and 
licenses. The automaton deps are now pulled by ivy. 
                
> Upgrade automaton jar to 1.11-8
> -------------------------------
>
>                 Key: NUTCH-1451
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1451
>             Project: Nutch
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.6, 2.1
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>             Fix For: 1.6, 2.2
>
>
> The latest version 1.11-8 was released September 7, 2011.
> This library is significantly faster than the default regex parsing. I 
> haven't got a clue what version we currently use but the license states 2005 
> so I'm guessing its been a long time since it was upgraded.
> I'll get a patch together and for completeness run independent test to 
> compare results pre and post upgrade. It would be nice to see > marginal 
> improvements :0)  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to