Build failed in Jenkins: Nutch-trunk #2267

2013-07-03 Thread Apache Jenkins Server
See -- [...truncated 1295 lines...] A src/plugin/parse-metatags/src A src/plugin/parse-metatags/src/test A src/plugin/parse-metatags/src/test/org A src/plugin/parse-metatags/sr

[jira] [Commented] (NUTCH-1596) NodeWalker NPE on next node

2013-07-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699508#comment-13699508 ] Markus Jelsma commented on NUTCH-1596: -- Of course! I was already a bit suspicious abo

[jira] [Updated] (NUTCH-1596) NodeWalker NPE on next node

2013-07-03 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1596: --- Attachment: NUTCH-1596-v1.patch Hi [~markus17], there may be concurrency if there are multipl

[jira] [Commented] (NUTCH-1599) Obtain consensus on new description of Nutch

2013-07-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699362#comment-13699362 ] Markus Jelsma commented on NUTCH-1599: -- nice! thanks > Obtain consen

[jira] [Resolved] (NUTCH-1599) Obtain consensus on new description of Nutch

2013-07-03 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1599. - Resolution: Fixed http://nutch.apache.org/#What+is+Apache+Nutch%3F Please check

[jira] [Updated] (NUTCH-1524) Internal links are not being saved even with change in parameter (db.ignore.internal.links)

2013-07-03 Thread Brian (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brian updated NUTCH-1524: - Attachment: NUTCH-1524.patch > Internal links are not being saved even with change in parameter > (db.ignore

[jira] [Comment Edited] (NUTCH-1524) Internal links are not being saved even with change in parameter (db.ignore.internal.links)

2013-07-03 Thread Brian (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699127#comment-13699127 ] Brian edited comment on NUTCH-1524 at 7/3/13 4:28 PM: -- Well this was

[jira] [Comment Edited] (NUTCH-1524) Internal links are not being saved even with change in parameter (db.ignore.internal.links)

2013-07-03 Thread Brian (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699127#comment-13699127 ] Brian edited comment on NUTCH-1524 at 7/3/13 4:12 PM: -- Well this was

[jira] [Commented] (NUTCH-1524) Internal links are not being saved even with change in parameter (db.ignore.internal.links)

2013-07-03 Thread Brian (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699127#comment-13699127 ] Brian commented on NUTCH-1524: -- Well this was frustrating... it turned out to be due to a bug

[jira] [Commented] (NUTCH-1599) Obtain consensus on new description of Nutch

2013-07-03 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699115#comment-13699115 ] Tejas Patil commented on NUTCH-1599: I agree with Julien: Nutch should be described as

[jira] [Commented] (NUTCH-1602) improve the readability of metadata in readdb dump normal

2013-07-03 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699096#comment-13699096 ] Tejas Patil commented on NUTCH-1602: Hi Lufeng, +1 from me too. One minor suggestion:

[jira] [Created] (NUTCH-1602) improve the readability of metadata in readdb dump normal

2013-07-03 Thread lufeng (JIRA)
lufeng created NUTCH-1602: - Summary: improve the readability of metadata in readdb dump normal Key: NUTCH-1602 URL: https://issues.apache.org/jira/browse/NUTCH-1602 Project: Nutch Issue Type: Improv

[jira] [Commented] (NUTCH-1602) improve the readability of metadata in readdb dump normal

2013-07-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699051#comment-13699051 ] Markus Jelsma commented on NUTCH-1602: -- Nice, i've been annoyed with the output as we

[jira] [Updated] (NUTCH-1602) improve the readability of metadata in readdb dump normal

2013-07-03 Thread lufeng (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lufeng updated NUTCH-1602: -- Attachment: NUTCH-1602.patch > improve the readability of metadata in readdb dump normal > ---

[jira] [Commented] (NUTCH-1600) Injector overwrite does not always work properly

2013-07-03 Thread lufeng (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699034#comment-13699034 ] lufeng commented on NUTCH-1600: --- test work fine. +1 > Injector overwrite d

[jira] [Commented] (NUTCH-1595) Upgrade to Tika 1.4

2013-07-03 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699021#comment-13699021 ] Julien Nioche commented on NUTCH-1595: -- good idea! > Upgrade to Tika

[jira] [Commented] (NUTCH-1595) Upgrade to Tika 1.4

2013-07-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13699006#comment-13699006 ] Markus Jelsma commented on NUTCH-1595: -- Pfff, thanks. Perhaps we should add a how_to_

[jira] [Commented] (NUTCH-1595) Upgrade to Tika 1.4

2013-07-03 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13698831#comment-13698831 ] Julien Nioche commented on NUTCH-1595: -- You've forgotten to upgrade quite a few depen

[jira] [Updated] (NUTCH-1601) ElasticSearchIndexer fails to properly delete documents

2013-07-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1601: - Attachment: NUTCH-1601-1.8.patch Patch for trunk! Deletes are coming through! >

[jira] [Created] (NUTCH-1601) ElasticSearchIndexer fails to properly delete documents

2013-07-03 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-1601: Summary: ElasticSearchIndexer fails to properly delete documents Key: NUTCH-1601 URL: https://issues.apache.org/jira/browse/NUTCH-1601 Project: Nutch Issue T

[jira] [Updated] (NUTCH-1595) Upgrade to Tika 1.4

2013-07-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1595: - Attachment: NUTCH-1595-2x.patch ..and patch for 2x. > Upgrade to Tika 1.4 >

[jira] [Updated] (NUTCH-1595) Upgrade to Tika 1.4

2013-07-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1595: - Attachment: NUTCH-1595-1.8.patch New patch for trunk, it was missing the poi upgrade.

[jira] [Updated] (NUTCH-1600) Injector overwrite does not always work properly

2013-07-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1600: - Attachment: NUTCH-1600-1.8.patch Patch for trunk. > Injector overwrite does not

[jira] [Created] (NUTCH-1600) Injector overwrite does not always work properly

2013-07-03 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-1600: Summary: Injector overwrite does not always work properly Key: NUTCH-1600 URL: https://issues.apache.org/jira/browse/NUTCH-1600 Project: Nutch Issue Type: Bu

[jira] [Commented] (NUTCH-1599) Obtain consensus on new description of Nutch

2013-07-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13698738#comment-13698738 ] Markus Jelsma commented on NUTCH-1599: -- Highly extensible and scalable web crawler so

RE: [ANNOUNCE] Apache Nutch v2.2.1 Released

2013-07-03 Thread Markus Jelsma
Great news, thanks Lewis! -Original message- From: Lewis John Mcgibbney Sent: Tuesday 2nd July 2013 18:32 To: u...@nutch.apache.org; dev@nutch.apache.org Subject: [ANNOUNCE] Apache Nutch v2.2.1 Released Good Afternoon Everyone, The Apache Nutch PMC are very pleased to announce the immedi