[
https://issues.apache.org/jira/browse/NUTCH-1924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14302984#comment-14302984
]
Talat UYARER commented on NUTCH-1924:
-------------------------------------
Hi [~rrydziu] and [~lewismc],
First of all Thanks for this wonderful stuff. I have some comments. You can
see below:
- Do we need Hadoop 2.x on our docker images ? Dockers small containers. Nutch
can run local mode. Building Hadoop and other things. IMHO they are overkill.
- At the present Nutch 2.x write/read on Hbase that runs on Hadoop 2 or Hadoop
1. Nutch 2.x can not run on Map Reduce of Hadoop 2. Because of Nutch's Hadoop 1
dependecies confilct with Hadoop 2 cluster. In our dockerfile Hbase is
compiled for Hadoop 2. AFAIK This is unnecessary. There is not difference
clients of Hbase 0.94
- Our docker file is too big. we can update it this way: We create a docker
file which Hbase and Nutch (local mode) run on it without Hadoop 2.
> Nutch + HBase Docker
> --------------------
>
> Key: NUTCH-1924
> URL: https://issues.apache.org/jira/browse/NUTCH-1924
> Project: Nutch
> Issue Type: Sub-task
> Components: build
> Reporter: Lewis John McGibbney
> Assignee: Radosław Stankiewicz
> Fix For: 2.3.1
>
> Attachments: NUTCH-1924.patch
>
>
> ZooKeeper 3.4.5 Hadoop 0.20.204 HBase 0.90.4 Nutch 2.2.1
> https://registry.hub.docker.com/u/stankiewicz/hbase_hadoop_nutch/dockerfile/
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)