[jira] [Updated] (NUTCH-1467) nutch 1.5.1 not able to parse mutliValued metatags

2012-10-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1467: Attachment: NUTCH-1467-trunk.patch Hi Kiran. I've attached a unified patch for

[jira] [Commented] (NUTCH-585) [PARSE-HTML plugin] Block certain parts of HTML code from being indexed

2012-10-02 Thread Iwan Luijks (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13467595#comment-13467595 ] Iwan Luijks commented on NUTCH-585: --- I can confirm the plugin provided in

[jira] [Commented] (NUTCH-1467) nutch 1.5.1 not able to parse mutliValued metatags

2012-10-02 Thread kiran (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13467598#comment-13467598 ] kiran commented on NUTCH-1467: -- Thank you for the unified patch. I did not know much about

[jira] [Commented] (NUTCH-706) Url regex normalizer

2012-10-02 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13467990#comment-13467990 ] Sebastian Nagel commented on NUTCH-706: --- Are there objections to apply and commit the

[jira] [Commented] (NUTCH-706) Url regex normalizer

2012-10-02 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13468120#comment-13468120 ] Markus Jelsma commented on NUTCH-706: - If tests pass and this solves the problem and