[jira] [Commented] (NUTCH-2222) re-fetch deletes all metadata except _csh_ and _rs_

2016-02-26 Thread Adnane B. (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15170226#comment-15170226 ] Adnane B. commented on NUTCH-: -- Hello, Did you reproduced this issue ? Please let me

[jira] [Commented] (NUTCH-2234) Upgrade to elasticsearch 2.1.1

2016-02-26 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15169521#comment-15169521 ] Lewis John McGibbney commented on NUTCH-2234: - Out or curiosity. What versions

[Nutch Wiki] Update of "FrontPage" by ChrisMattmann

2016-02-26 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "FrontPage" page has been changed by ChrisMattmann: https://wiki.apache.org/nutch/FrontPage?action=diff&rev1=302&rev2=303 * PluginCentral -- How to write your own plugins and use

[NOTICE] Nutch now using Writeable Git repos at the ASF

2016-02-26 Thread Mattmann, Chris A (3980)
Hi Team, Nutch now officially uses Git to manage its source repos. You can see the final elements to that here: https://issues.apache.org/jira/browse/INFRA-11300 I’ve written a guide for the wiki describing how to migrate your existing SVN checkout to Nutch if you are a user or a developer. Ple

[Nutch Wiki] Update of "UsingGit" by ChrisMattmann

2016-02-26 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "UsingGit" page has been changed by ChrisMattmann: https://wiki.apache.org/nutch/UsingGit?action=diff&rev1=1&rev2=2 Comment: - make Nutch specific - Apache Tika uses the [[http://gi

[Nutch Wiki] Update of "UsingGit" by ChrisMattmann

2016-02-26 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "UsingGit" page has been changed by ChrisMattmann: https://wiki.apache.org/nutch/UsingGit New page: Apache Tika uses the [[http://git-scm.com/|Git]] version control system. Apache p

[jira] [Commented] (NUTCH-2234) Upgrade to elasticsearch 2.1.1

2016-02-26 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15169182#comment-15169182 ] Markus Jelsma commented on NUTCH-2234: -- Nice! I'll get this in once i have that Git t

[jira] [Updated] (NUTCH-2234) Upgrade to elasticsearch 2.1.1

2016-02-26 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-2234: - Fix Version/s: 1.12 > Upgrade to elasticsearch 2.1.1 > -- > >

[jira] [Assigned] (NUTCH-2234) Upgrade to elasticsearch 2.1.1

2016-02-26 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma reassigned NUTCH-2234: Assignee: Markus Jelsma > Upgrade to elasticsearch 2.1.1 > -- >

[Nutch Wiki] Update of "DownloadingNutch" by ChrisMattmann

2016-02-26 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "DownloadingNutch" page has been changed by ChrisMattmann: https://wiki.apache.org/nutch/DownloadingNutch?action=diff&rev1=13&rev2=14 Comment: - git change You have two choices in

[jira] [Commented] (NUTCH-2234) Upgrade to elasticsearch 2.1.1

2016-02-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15169167#comment-15169167 ] Otis Gospodnetic commented on NUTCH-2234: - +1, works for us. > Upgrade to elastic

[jira] [Commented] (NUTCH-1228) Change mapred.task.timeout to mapreduce.task.timeout in fetcher

2016-02-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15169171#comment-15169171 ] Otis Gospodnetic commented on NUTCH-1228: - I think we are using this with Nutch 1.

[jira] [Updated] (NUTCH-1228) Change mapred.task.timeout to mapreduce.task.timeout in fetcher

2016-02-26 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Otis Gospodnetic updated NUTCH-1228: Fix Version/s: 1.12 > Change mapred.task.timeout to mapreduce.task.timeout in fetcher >

Re: [RESULT] [VOTE] Moving to Git

2016-02-26 Thread Mattmann, Chris A (3980)
haha, hopefully you’ll grow to not think that Markus :( ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailsto

RE: [RESULT] [VOTE] Moving to Git

2016-02-26 Thread Markus Jelsma
Thanks Chris! Looks like a hassle, oh well.. M. -Original message- > From:Mattmann, Chris A (3980) > Sent: Tuesday 23rd February 2016 17:59 > To: dev@nutch.apache.org > Subject: Re: [RESULT] [VOTE] Moving to Git > > Would this page help out? I’ll look to replicate it on the Nutch > wik

[jira] [Commented] (NUTCH-961) Expose Tika's boilerpipe support

2016-02-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168821#comment-15168821 ] ASF GitHub Bot commented on NUTCH-961: -- GitHub user jeremie70 opened a pull request:

[GitHub] nutch pull request: Add the boilerpipe parsing adapted from NUTCH-...

2016-02-26 Thread jeremie70
GitHub user jeremie70 opened a pull request: https://github.com/apache/nutch/pull/92 Add the boilerpipe parsing adapted from NUTCH-961 You can merge this pull request into a Git repository by running: $ git pull https://github.com/jeremie70/nutch my-branch Alternatively you c