[jira] [Commented] (NUTCH-1733) parse-html to support HTML5 charset definitions

2014-03-18 Thread lufeng (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938867#comment-13938867 ] lufeng commented on NUTCH-1733: --- +1 pass all tests parse-html to support HTML5 charset

[jira] [Created] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread ysc (JIRA)
ysc created NUTCH-1739: -- Summary: ExecutorService field in ParseUtil.java not be right use and cause memory leak Key: NUTCH-1739 URL: https://issues.apache.org/jira/browse/NUTCH-1739 Project: Nutch

[jira] [Updated] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread ysc (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ysc updated NUTCH-1739: --- Attachment: nutch1.7.patch This patch is produced in the environment of nutch1.7. You can reference this patch to

[jira] [Commented] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938922#comment-13938922 ] Alparslan Avcı commented on NUTCH-1739: --- It seems there is no problem for 2.x since

[jira] [Comment Edited] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938922#comment-13938922 ] Alparslan Avcı edited comment on NUTCH-1739 at 3/18/14 7:32 AM:

[jira] [Updated] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread ysc (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ysc updated NUTCH-1739: --- Affects Version/s: (was: 2.2.1) (was: 2.2) (was: 2.1)

[jira] [Comment Edited] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938930#comment-13938930 ] Sebastian Nagel edited comment on NUTCH-1739 at 3/18/14 7:43 AM:

[jira] [Commented] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938930#comment-13938930 ] Sebastian Nagel commented on NUTCH-1739: Thanks, [~yangshangchuan]. But isn't this

[jira] [Updated] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread ysc (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ysc updated NUTCH-1739: --- Affects Version/s: 2.1 2.2 2.2.1 ExecutorService field in

[jira] [Commented] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread ysc (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938941#comment-13938941 ] ysc commented on NUTCH-1739: Thanks, [~alparslan.avci] , you are right. Nutch2.1 hasn't

[jira] [Comment Edited] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread ysc (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938892#comment-13938892 ] ysc edited comment on NUTCH-1739 at 3/18/14 8:04 AM: - This patch is

[jira] [Updated] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread ysc (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ysc updated NUTCH-1739: --- Attachment: nutch2.2.1.patch This patch is produced in the environment of nutch2.2.1. You can reference this patch

[jira] [Commented] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread ysc (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938945#comment-13938945 ] ysc commented on NUTCH-1739: Thanks, [~wastl-nagel] , you are right, i just now saw it. In

[jira] [Comment Edited] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938960#comment-13938960 ] Talat UYARER edited comment on NUTCH-1738 at 3/18/14 8:29 AM: --

[jira] [Updated] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1738: Attachment: NUTCH-1738.patch Hi [~lewis] , I attached a patch for this information. Can you

[jira] [Commented] (NUTCH-1739) ExecutorService field in ParseUtil.java not be right use and cause memory leak

2014-03-18 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938962#comment-13938962 ] Alparslan Avcı commented on NUTCH-1739: --- Hi [~yangshangchuan], and thanks for the

[jira] [Updated] (NUTCH-1740) BatchId parameter is not set in DbUpdaterJob

2014-03-18 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alparslan Avcı updated NUTCH-1740: -- Attachment: NUTCH-1556-batchId.patch This is fixed for 2.x in NUTCH-1556. Uploading the

[jira] [Created] (NUTCH-1740) BatchId parameter is not set in DbUpdaterJob

2014-03-18 Thread JIRA
Alparslan Avcı created NUTCH-1740: - Summary: BatchId parameter is not set in DbUpdaterJob Key: NUTCH-1740 URL: https://issues.apache.org/jira/browse/NUTCH-1740 Project: Nutch Issue Type: Bug

[jira] [Commented] (NUTCH-1733) parse-html to support HTML5 charset definitions

2014-03-18 Thread John Lafitte (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939540#comment-13939540 ] John Lafitte commented on NUTCH-1733: - It might just be specific to my files or

[jira] [Commented] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-03-18 Thread Shanaka Jayasundera (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939553#comment-13939553 ] Shanaka Jayasundera commented on NUTCH-1478: Hi All, I've downloaded latest

[jira] [Updated] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1738: Assignee: (was: Lewis John McGibbney) Expose number of URLs generated per

[jira] [Updated] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1738: Assignee: Talat UYARER Expose number of URLs generated per batch in GeneratorJob

[jira] [Updated] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1738: Patch Info: Patch Available Expose number of URLs generated per batch in

[jira] [Commented] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939873#comment-13939873 ] Lewis John McGibbney commented on NUTCH-1738: - Assigned to you [~talat] for

[jira] [Resolved] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1738. - Resolution: Fixed Committed @revision 1579072 in 2.x HEAD Thank you [~talat]

[jira] [Commented] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13939912#comment-13939912 ] Hudson commented on NUTCH-1738: --- SUCCESS: Integrated in Nutch-nutchgora #958 (See