Re: Automaton improvements

2011-07-25 Thread Dawid Weiss
I don't think this will make it into a separate library, Julien. It's a port of brics and done specifically so that it fits Lucene's internal needs. If anything, I would just make Nutch require Lucene as a dependency -- this would provide more stable updates. Dawid On Mon, Jul 25, 2011 at 10:35

Re: Automaton improvements

2011-07-25 Thread Julien Nioche
Hi Dawid, This was a bit of wishful thinking indeed :-) With a bit of luck the improvements will be added to brics, but as you pointed out we can always use the lucene jar anyway. BTW you are too modest, you should have pointed to the video of your talk in Berlin http://vimeo.com/26517310 which

Re: Automaton improvements

2011-07-25 Thread Dawid Weiss
It is actually Robert Muir and Mike McCandless doing the heavy lifting here, so modesty has nothing to do with it :) I just think it'll stay inside Lucene because it is often tweaked and tuned. Plus, there is the FSTBuilder and associated classes which provide yet another way to build and traverse

[jira] [Commented] (NUTCH-1065) New mvn.template

2011-07-25 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13070453#comment-13070453 ] Julien Nioche commented on NUTCH-1065: -- +1 thanks New mvn.template

Re: Automaton improvements

2011-07-25 Thread Kirby Bohling
https://issues.apache.org/jira/browse/NUTCH-1068 Issue created, patch attached. Once I hear back from the author about getting it included in the upstream library, I'll update the issue. I'm really not able to pursue directly, as I'm not much of a Nutch user at the moment. I've lurked on the

[jira] [Commented] (NUTCH-1034) Create Solr Velocity templates

2011-07-25 Thread Umar Shah (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13070696#comment-13070696 ] Umar Shah commented on NUTCH-1034: -- use doc.vm.patch and facets.vm.patch to get search

[jira] [Created] (NUTCH-1069) readlinkdb throws exception

2011-07-25 Thread Markus Jelsma (JIRA)
readlinkdb throws exception --- Key: NUTCH-1069 URL: https://issues.apache.org/jira/browse/NUTCH-1069 Project: Nutch Issue Type: Bug Affects Versions: 1.4 Reporter: Markus Jelsma Assignee:

[jira] [Commented] (NUTCH-1045) MimeUtil to rely on default config provided by Tika

2011-07-25 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13070930#comment-13070930 ] Hudson commented on NUTCH-1045: --- Integrated in Nutch-trunk #1557 (See