[ 
https://issues.apache.org/jira/browse/HADOOP-13397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15396023#comment-15396023
 ] 

Allen Wittenauer commented on HADOOP-13397:
-------------------------------------------

I had a long discussion yesterday with some folks about this topic, especially 
around how to make it something consumable for a wide variety of 
installation-types.  One of the big asks was to make it work as a 
self-contained Dockerfile (so no COPY/ADD, RUNs can only reference things 
already inside the image, etc, etc.) as much as possible to allow the 
Dockerfile to be used by some other service and/or the basis of adding more 
content.  This means if I'm using a configuration service such as bcfg2 or 
puppet, it would be able to put down the necessary components at docker build 
or docker run time.  If I'm using something that takes a supplied tar ball, 
then COPY is unavoidable. [1]  It also means it's not a "do everything 
imaginable" feature like HBASE-12721 .  One would still need something to 
actually launch the containers and give some control information.

I've been playing around a bit and have a simple prototype built based upon 
those discussions and what I've seen in Klaus' github repo. I'm going to clean 
it up and flesh it out a bit and probably post a patch in week or so, time 
permitting. (i.e., remove all the hard codes and start making it take 
options... haha.)

[1] I mean, we *could* do something like base64 encode the tar ball and 
extract, but that seems a little extreme. ;)

> Add dockerfile for Hadoop
> -------------------------
>
>                 Key: HADOOP-13397
>                 URL: https://issues.apache.org/jira/browse/HADOOP-13397
>             Project: Hadoop Common
>          Issue Type: Bug
>            Reporter: Klaus Ma
>            Assignee: Allen Wittenauer
>
> For now, there's no community version Dockerfile in Hadoop; most of docker 
> images are provided by vendor, e.g. 
> 1. Cloudera's image: https://hub.docker.com/r/cloudera/quickstart/
> 2.  From HortonWorks sequenceiq: 
> https://hub.docker.com/r/sequenceiq/hadoop-docker/
> 3. MapR provides the mapr-sandbox-base: 
> https://hub.docker.com/r/maprtech/mapr-sandbox-base/
> The proposal of this JIRA is to provide a community version Dockerfile in 
> Hadoop, and here's some requirement:
> 1. Seperated docker image for master & agents, e.g. resource manager & node 
> manager
> 2. Default configuration to start master & agent instead of configurating 
> manually
> 3. Start Hadoop process as no-daemon
> Here's my dockerfile to start master/agent: 
> https://github.com/k82cn/outrider/tree/master/kubernetes/imgs/yarn
> I'd like to contribute it after polishing :).
> Email Thread : 
> http://mail-archives.apache.org/mod_mbox/hadoop-user/201607.mbox/%3CSG2PR04MB162977CFE150444FA022510FB6370%40SG2PR04MB1629.apcprd04.prod.outlook.com%3E



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to