[jira] [Commented] (NUTCH-1344) BasicURLNormalizer to normalize https same as http

2012-10-10 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473066#comment-13473066 ] Julien Nioche commented on NUTCH-1344: -- Good catch Sebastian. PLease commit to both

Re: patches to parse-metatag plugin to save mutliValues

2012-10-10 Thread Lewis John Mcgibbney
Hi Kiran, There is an issue open in Jira for this [0], it would be really appreciated if you could add your observations/discoveries to it and we can get it logged and hopefully fixed. Thanks again Lewis [0] https://issues.apache.org/jira/browse/NUTCH-874 On Thu, Oct 4, 2012 at 7:20 PM, kiran

Re: patches to parse-metatag plugin to save mutliValues

2012-10-10 Thread Lewis John Mcgibbney
Hi Kiran, On Wed, Oct 10, 2012 at 12:53 PM, kiran chitturi chitturikira...@gmail.com wrote: This is the problem i observed with few of the plugins as i have explained in my last email. They use code which is compatible with 1.5 but not with 2.0. Right now, i am almost done with porting

[jira] [Updated] (NUTCH-706) Url regex normalizer: default pattern for session id removal not to match newsId

2012-10-10 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-706: -- Fix Version/s: 2.2 Summary: Url regex normalizer: default pattern for session id

[jira] [Resolved] (NUTCH-706) Url regex normalizer: default pattern for session id removal not to match newsId

2012-10-10 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-706. --- Resolution: Fixed committed to trunk (revision 1396796) and 2.x (revision 1396795)

[jira] [Resolved] (NUTCH-1344) BasicURLNormalizer to normalize https same as http

2012-10-10 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-1344. Resolution: Fixed Fix Version/s: 2.2 1.6 committed to trunk

Re: patches to parse-metatag plugin to save mutliValues

2012-10-10 Thread kiran chitturi
Hi Lewis, This is the problem i observed with few of the plugins as i have explained in my last email. They use code which is compatible with 1.5 but not with 2.0. Right now, i am almost done with porting parse-metatags and index-metadata to nutch 2.x. I can look in to other plugins after this to

[jira] [Commented] (NUTCH-706) Url regex normalizer: default pattern for session id removal not to match newsId

2012-10-10 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473599#comment-13473599 ] Sebastian Nagel commented on NUTCH-706: --- First commit erroneously with wrong patch.

[jira] [Commented] (NUTCH-706) Url regex normalizer: default pattern for session id removal not to match newsId

2012-10-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473620#comment-13473620 ] Hudson commented on NUTCH-706: -- Integrated in nutch-trunk-maven #449 (See

[jira] [Commented] (NUTCH-1344) BasicURLNormalizer to normalize https same as http

2012-10-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473621#comment-13473621 ] Hudson commented on NUTCH-1344: --- Integrated in nutch-trunk-maven #449 (See

[jira] [Updated] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2012-10-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-874: --- Attachment: NUTCH-874.patch trivial patch to remove unused classes brought to our

Re: patches to parse-metatag plugin to save mutliValues

2012-10-10 Thread Lewis John Mcgibbney
Hi Kiran, I made the patch to remove these classes you highlight. The patch passes tests so I will commit to 2.x head. Thank you for your contrib Lewis On Wed, Oct 10, 2012 at 3:01 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Kiran, On Wed, Oct 10, 2012 at 12:53 PM, kiran

[jira] [Commented] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2012-10-10 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473654#comment-13473654 ] Lewis John McGibbney commented on NUTCH-874: part 1 e.g. removal of unused

Re: patches to parse-metatag plugin to save mutliValues

2012-10-10 Thread kiran chitturi
Thank you for the help. I am almost done with patching up parse-metatags plugin I made another post about the plugin and multipleValues in metadata. I will also check other plugins and see if they need any fixes. The patch you made might be enough. I will check it out again in eclipse. Regards,

[jira] [Created] (NUTCH-1477) NPE when injecting with DataFileAvroStore

2012-10-10 Thread Mike Baranczak (JIRA)
Mike Baranczak created NUTCH-1477: - Summary: NPE when injecting with DataFileAvroStore Key: NUTCH-1477 URL: https://issues.apache.org/jira/browse/NUTCH-1477 Project: Nutch Issue Type: Bug

[jira] [Commented] (NUTCH-1477) NPE when injecting with DataFileAvroStore

2012-10-10 Thread Mike Baranczak (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473756#comment-13473756 ] Mike Baranczak commented on NUTCH-1477: --- I tried upgrading the Avro library to the

[jira] [Commented] (NUTCH-706) Url regex normalizer: default pattern for session id removal not to match newsId

2012-10-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473825#comment-13473825 ] Hudson commented on NUTCH-706: -- Integrated in Nutch-nutchgora #375 (See

[jira] [Commented] (NUTCH-874) Make sure all plugins in src/plugin are compatible with Nutch 2.0 and Gora

2012-10-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473827#comment-13473827 ] Hudson commented on NUTCH-874: -- Integrated in Nutch-nutchgora #375 (See

[jira] [Commented] (NUTCH-1344) BasicURLNormalizer to normalize https same as http

2012-10-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473826#comment-13473826 ] Hudson commented on NUTCH-1344: --- Integrated in Nutch-nutchgora #375 (See

Jenkins build is back to normal : Nutch-trunk #1984

2012-10-10 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-trunk/1984/