[jira] [Commented] (NUTCH-2723) Indexer Solr not to decode URLs before deletion
[ https://issues.apache.org/jira/browse/NUTCH-2723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16883709#comment-16883709 ] Hudson commented on NUTCH-2723: --- FAILURE: Integrated in Jenkins build Nutch-trunk #3628 (See [https://builds.apache.org/job/Nutch-trunk/3628/]) NUTCH-2723 Indexer Solr not to decode URLs before deletion (markus: [https://github.com/apache/nutch/commit/5150c442a78d15c042ee6fb12e6dbea8ec5341e6]) * (edit) src/plugin/indexer-solr/src/java/org/apache/nutch/indexwriter/solr/SolrIndexWriter.java > Indexer Solr not to decode URLs before deletion > --- > > Key: NUTCH-2723 > URL: https://issues.apache.org/jira/browse/NUTCH-2723 > Project: Nutch > Issue Type: Bug > Components: indexer >Affects Versions: 1.14 >Reporter: Markus Jelsma >Assignee: Markus Jelsma >Priority: Major > Fix For: 1.16 > > Attachments: NUTCH-2723.patch > > > URLs are indexed in their raw encoded form. But indexer-solr incorrectly > decodes them just before they are sent for deletion, leading to a state where > a bunch of URLs are never deleted. -- This message was sent by Atlassian JIRA (v7.6.14#76016)
[jira] [Commented] (NUTCH-2723) Indexer Solr not to decode URLs before deletion
[ https://issues.apache.org/jira/browse/NUTCH-2723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16882905#comment-16882905 ] Sebastian Nagel commented on NUTCH-2723: Hi [~markus17], good catch, please commit! Thanks. > Indexer Solr not to decode URLs before deletion > --- > > Key: NUTCH-2723 > URL: https://issues.apache.org/jira/browse/NUTCH-2723 > Project: Nutch > Issue Type: Bug > Components: indexer >Affects Versions: 1.14 >Reporter: Markus Jelsma >Assignee: Markus Jelsma >Priority: Major > Fix For: 1.16 > > Attachments: NUTCH-2723.patch > > > URLs are indexed in their raw encoded form. But indexer-solr incorrectly > decodes them just before they are sent for deletion, leading to a state where > a bunch of URLs are never deleted. -- This message was sent by Atlassian JIRA (v7.6.14#76016)