[jira] [Comment Edited] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388650#comment-16388650 ] Lewis John McGibbney edited comment on NUTCH-2517 at 3/6/18 10:50 PM: --

[jira] [Comment Edited] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388650#comment-16388650 ] Lewis John McGibbney edited comment on NUTCH-2517 at 3/6/18 10:49 PM: --

[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388650#comment-16388650 ] Lewis John McGibbney commented on NUTCH-2517: - I cannot reproduce this... see below for tests

[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388718#comment-16388718 ] Lewis John McGibbney commented on NUTCH-2517: - Should be noted that I didn't run this from the

[jira] [Assigned] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2517: --- Assignee: Lewis John McGibbney > mergesegs corrupts segment data >

[jira] [Comment Edited] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16388650#comment-16388650 ] Lewis John McGibbney edited comment on NUTCH-2517 at 3/6/18 11:09 PM: --

[jira] [Commented] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Marco Ebbinghaus (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389143#comment-16389143 ] Marco Ebbinghaus commented on NUTCH-2517: - I can also reproduce this when NOT running this from a

[jira] [Updated] (NUTCH-2517) mergesegs corrupts segment data

2018-03-06 Thread Marco Ebbinghaus (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Ebbinghaus updated NUTCH-2517: Attachment: Screenshot_2018-03-07_07-50-05.png > mergesegs corrupts segment data >

[jira] [Created] (NUTCH-2524) Crawl Script , if file exists in HDFS doesnt work.

2018-03-06 Thread Semyon Semyonov (JIRA)
Semyon Semyonov created NUTCH-2524: -- Summary: Crawl Script , if file exists in HDFS doesnt work. Key: NUTCH-2524 URL: https://issues.apache.org/jira/browse/NUTCH-2524 Project: Nutch Issue

[jira] [Commented] (NUTCH-2524) Crawl Script , if file exists in HDFS doesnt work.

2018-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387569#comment-16387569 ] ASF GitHub Bot commented on NUTCH-2524: --- okedoki opened a new pull request #291: NUTCH-2524 URL:

[jira] [Commented] (NUTCH-2522) Bidirectional URL exemption filter

2018-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387496#comment-16387496 ] ASF GitHub Bot commented on NUTCH-2522: --- okedoki opened a new pull request #290: NUTCH-2522 URL:

[jira] [Commented] (NUTCH-2522) Bidirectional URL exemption filter

2018-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387536#comment-16387536 ] ASF GitHub Bot commented on NUTCH-2522: --- sebastian-nagel commented on a change in pull request #290:

[jira] [Commented] (NUTCH-2522) Bidirectional URL exemption filter

2018-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387538#comment-16387538 ] ASF GitHub Bot commented on NUTCH-2522: --- sebastian-nagel commented on a change in pull request #290:

[jira] [Commented] (NUTCH-2522) Bidirectional URL exemption filter

2018-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387537#comment-16387537 ] ASF GitHub Bot commented on NUTCH-2522: --- sebastian-nagel commented on a change in pull request #290:

[jira] [Commented] (NUTCH-2519) Log mapreduce job counters in local mode

2018-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387440#comment-16387440 ] ASF GitHub Bot commented on NUTCH-2519: --- sebastian-nagel closed pull request #287: NUTCH-2519 Log

[jira] [Resolved] (NUTCH-2519) Log mapreduce job counters in local mode

2018-03-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2519. Resolution: Fixed Assignee: Sebastian Nagel Committed to 1.x and 2.x. Thanks for the

[jira] [Resolved] (NUTCH-2521) SitemapProcessor to use property sitemap.redir.max

2018-03-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2521. Resolution: Fixed Fixed for 1.x, thanks! > SitemapProcessor to use property

[jira] [Commented] (NUTCH-2521) SitemapProcessor to use property sitemap.redir.max

2018-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387465#comment-16387465 ] ASF GitHub Bot commented on NUTCH-2521: --- sebastian-nagel closed pull request #289: NUTCH-2521

[jira] [Assigned] (NUTCH-2521) SitemapProcessor to use property sitemap.redir.max

2018-03-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2521: -- Assignee: Sebastian Nagel > SitemapProcessor to use property sitemap.redir.max >

Jenkins build is back to normal : Nutch-nutchgora #1602

2018-03-06 Thread Apache Jenkins Server
See

[jira] [Commented] (NUTCH-2519) Log mapreduce job counters in local mode

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387472#comment-16387472 ] Hudson commented on NUTCH-2519: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3504 (See

[jira] [Commented] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387473#comment-16387473 ] Hudson commented on NUTCH-2520: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3504 (See

[jira] [Updated] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2520: --- Fix Version/s: 2.4 > Wrong Accept-Charset sent when http.accept.charset is not defined >

[jira] [Updated] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-2520: --- Affects Version/s: 2.4 > Wrong Accept-Charset sent when http.accept.charset is not defined >

[jira] [Commented] (NUTCH-2519) Log mapreduce job counters in local mode

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387463#comment-16387463 ] Hudson commented on NUTCH-2519: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1602 (See

[jira] [Commented] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387461#comment-16387461 ] ASF GitHub Bot commented on NUTCH-2520: --- sebastian-nagel closed pull request #288: NUTCH-2520 Use

[jira] [Resolved] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-06 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2520. Resolution: Fixed Assignee: Sebastian Nagel Fixed in 1.x and 2.x. Thanks! > Wrong

[jira] [Commented] (NUTCH-2520) Wrong Accept-Charset sent when http.accept.charset is not defined

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387464#comment-16387464 ] Hudson commented on NUTCH-2520: --- SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1602 (See

[jira] [Commented] (NUTCH-2522) Bidirectional URL exemption filter

2018-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387539#comment-16387539 ] ASF GitHub Bot commented on NUTCH-2522: --- sebastian-nagel commented on a change in pull request #290:

[jira] [Commented] (NUTCH-2521) SitemapProcessor to use property sitemap.redir.max

2018-03-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16387541#comment-16387541 ] Hudson commented on NUTCH-2521: --- SUCCESS: Integrated in Jenkins build Nutch-trunk #3505 (See