[GitHub] nutch pull request: Fix the issue of the bad tstamp

2016-02-29 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/nutch/pull/94 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[jira] [Resolved] (NUTCH-2213) CommonCrawlDataDumper saves gzipped body in extracted form

2016-02-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-2213. -- Resolution: Fixed Fix Version/s: 1.12 > CommonCrawlDataDumper saves gzipped body

[jira] [Commented] (NUTCH-2213) CommonCrawlDataDumper saves gzipped body in extracted form

2016-02-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15173188#comment-15173188 ] Chris A. Mattmann commented on NUTCH-2213: -- Fixed thanks [~jnioche]! {noformat}

[GitHub] nutch pull request: NUTCH-2213 : do not store the headers verbatim...

2016-02-29 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/nutch/pull/88 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[jira] [Commented] (NUTCH-2213) CommonCrawlDataDumper saves gzipped body in extracted form

2016-02-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15173182#comment-15173182 ] ASF GitHub Bot commented on NUTCH-2213: --- Github user asfgit closed the pull request at:

[jira] [Work started] (NUTCH-2213) CommonCrawlDataDumper saves gzipped body in extracted form

2016-02-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2213 started by Chris A. Mattmann. > CommonCrawlDataDumper saves gzipped body in extracted form >

[GitHub] nutch pull request: Fix the issue of the bad tstamp

2016-02-29 Thread jeremie70
GitHub user jeremie70 opened a pull request: https://github.com/apache/nutch/pull/94 Fix the issue of the bad tstamp The tstamp was everytime equal to "1970-01-01T00:00:00.000Z" cause of this. You can merge this pull request into a Git repository by running: $ git pull

[jira] [Commented] (NUTCH-2222) re-fetch deletes all metadata except _csh_ and _rs_

2016-02-29 Thread Adnane B. (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171889#comment-15171889 ] Adnane B. commented on NUTCH-: -- Thank you very match! > re-fetch deletes all metadata except _csh_

[jira] [Commented] (NUTCH-2236) Upgrade to Hadoop 2.7.1

2016-02-29 Thread Tien Nguyen Manh (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171725#comment-15171725 ] Tien Nguyen Manh commented on NUTCH-2236: - No problem, just to make it run on Hadoop 2.7.1 >

[jira] [Commented] (NUTCH-2236) Upgrade to Hadoop 2.7.1

2016-02-29 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171705#comment-15171705 ] Markus Jelsma commented on NUTCH-2236: -- Hello Tien - what problem does this patch solve? Thanks! >

[jira] [Assigned] (NUTCH-2236) Upgrade to Hadoop 2.7.1

2016-02-29 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma reassigned NUTCH-2236: Assignee: Markus Jelsma > Upgrade to Hadoop 2.7.1 > --- > >

[jira] [Updated] (NUTCH-2236) Upgrade to Hadoop 2.7.1

2016-02-29 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated NUTCH-2236: Fix Version/s: 1.12 > Upgrade to Hadoop 2.7.1 > --- > >