[jira] [Commented] (NUTCH-2144) Plugin to override db.ignore.external to exempt interesting external domain URLs

2016-02-28 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171503#comment-15171503 ] ASF GitHub Bot commented on NUTCH-2144: --- Github user asfgit closed the pull request at:

[jira] [Resolved] (NUTCH-2144) Plugin to override db.ignore.external to exempt interesting external domain URLs

2016-02-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2144. -- Resolution: Fixed OK all fixed thanks [~thammegowda]! {noformat}

[GitHub] nutch pull request: NUTCH-2144 Added an extension point and a plug...

2016-02-28 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/nutch/pull/93 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[jira] [Commented] (NUTCH-2222) re-fetch deletes all metadata except _csh_ and _rs_

2016-02-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171487#comment-15171487 ] Lewis John McGibbney commented on NUTCH-: - Hi, I can replicate this on

[jira] [Commented] (NUTCH-2144) Plugin to override db.ignore.external to exempt interesting external domain URLs

2016-02-28 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171369#comment-15171369 ] ASF GitHub Bot commented on NUTCH-2144: --- Github user thammegowda closed the pull request at:

[GitHub] nutch pull request: NUTCH-2144 : override db.ignore.external to ex...

2016-02-28 Thread thammegowda
Github user thammegowda closed the pull request at: https://github.com/apache/nutch/pull/89 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[jira] [Commented] (NUTCH-2144) Plugin to override db.ignore.external to exempt interesting external domain URLs

2016-02-28 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171366#comment-15171366 ] ASF GitHub Bot commented on NUTCH-2144: --- GitHub user thammegowda opened a pull request:

[GitHub] nutch pull request: NUTCH-2144 Added an extension point and a plug...

2016-02-28 Thread thammegowda
GitHub user thammegowda opened a pull request: https://github.com/apache/nutch/pull/93 NUTCH-2144 Added an extension point and a plugin to accept external links This PR is a duplicate of #89 Recreated due to the issues caused while moving to writable git.

[Nutch Wiki] Trivial Update of "Nutch2Tutorial" by LewisJohnMcgibbney

2016-02-28 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "Nutch2Tutorial" page has been changed by LewisJohnMcgibbney: https://wiki.apache.org/nutch/Nutch2Tutorial?action=diff=16=17 == Obtaining Software and Configuration == *

[jira] [Updated] (NUTCH-2222) re-fetch deletes all metadata except _csh_ and _rs_

2016-02-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-: Fix Version/s: 2.3.2 > re-fetch deletes all metadata except _csh_ and _rs_ >

[jira] [Updated] (NUTCH-1741) Support of Sitemaps in Nutch 2.x

2016-02-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1741: Fix Version/s: (was: 2.4) 2.3.2 > Support of Sitemaps in

[Nutch Wiki] Trivial Update of "UsingGit" by LewisJohnMcgibbney

2016-02-28 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "UsingGit" page has been changed by LewisJohnMcgibbney: https://wiki.apache.org/nutch/UsingGit?action=diff=2=3 Apache Nutch uses the [[http://git-scm.com/|Git]] version control

[jira] [Commented] (NUTCH-2234) Upgrade to elasticsearch 2.1.1

2016-02-28 Thread Tien Nguyen Manh (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171264#comment-15171264 ] Tien Nguyen Manh commented on NUTCH-2234: - elasticsearch 2.1.1 use httpclient 4.3.6 > Upgrade to

[jira] [Updated] (NUTCH-2236) Upgrade to Hadoop 2.7.1

2016-02-28 Thread Tien Nguyen Manh (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tien Nguyen Manh updated NUTCH-2236: Attachment: NUTCH-2236.patch I run Nutch 1.11 on Hadoop 2.7.1 with this patch. We also need

[jira] [Created] (NUTCH-2236) Upgrade to Hadoop 2.7.1

2016-02-28 Thread Tien Nguyen Manh (JIRA)
Tien Nguyen Manh created NUTCH-2236: --- Summary: Upgrade to Hadoop 2.7.1 Key: NUTCH-2236 URL: https://issues.apache.org/jira/browse/NUTCH-2236 Project: Nutch Issue Type: Improvement