Hi,

I am creating a multi-node Hadoop cluster for a personal project, and I would 
like to use the official docker image 
(apache/hadoop<https://hub.docker.com/r/apache/hadoop>).

However, looking at the official docker image documentation and the 
docker-compose file I have seen the following environment variable:

environment:
          ENSURE_NAMENODE_DIR: "/tmp/hadoop-root/dfs/name"

I would like to know if it is safe to create the namenode directory in the /tmp 
folder since this kind of folder is neither secure nor data persistent. Thus, I 
would like to understand which path is the best practice for this. Moreover, 
which are other environment variables I could use of.

Thanks a lot, in advance.

Kind regards,

Nikos Spanos

M.Sc Business Analytics & Big Data| Athens University of Economics & Business
Phone Number: +306982310494
Linkedin profile <http://linkedin.com/in/spanosnikos>

Reply via email to