Re: Update schema to get solrdedup working again

2011-05-11 Thread Julien Nioche
Resending to dev@nutch - had sent to markus only We still need to do something about the moreindexing filter. https://issues.apache.org/jira/browse/NUTCH-985 For now a quick fix for the moreindexingfilter would be OK, but we can maybe create a new issue for 1.4 and rely on Date objects

[jira] [Commented] (NUTCH-985) MoreIndexingFilter doesn't use properly formatted date fields for Solr

2011-05-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13031682#comment-13031682 ] Markus Jelsma commented on NUTCH-985: - From dev@nutch For now a quick fix for the

[jira] [Commented] (NUTCH-937) When nutch is run on hadoop 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967)

2011-05-11 Thread Viksit Gaur (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13031939#comment-13031939 ] Viksit Gaur commented on NUTCH-937: --- A workaround for this is: - Set the following in

[jira] [Updated] (NUTCH-961) Expose Tika's boilerpipe support

2011-05-11 Thread Gabriele Kahlout (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabriele Kahlout updated NUTCH-961: --- Attachment: NUTCH-961-1.3-tikaparser1.patch Same as NUTCH-961-1.3-tikaparser.patch by Markus

[jira] [Updated] (NUTCH-961) Expose Tika's boilerpipe support

2011-05-11 Thread Gabriele Kahlout (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabriele Kahlout updated NUTCH-961: --- Attachment: NUTCH-961-1.3-tikaparser1.patch Modified to include necessary changes to