The Hadoop's docker image is not for production use. That's why But we should update that if people are thinking to use it for production. Not familiar with docker compose but contributions welcomed: https://github.com/apache/hadoop/blob/docker-hadoop-3/docker-compose.yaml
On Fri, Sep 22, 2023 at 5:44 AM Nikos Spanos <spanos.nikol...@outlook.com> wrote: > Hi, > > > > I am creating a multi-node Hadoop cluster for a personal project, and I > would like to use the official docker image (apache/hadoop > <https://hub.docker.com/r/apache/hadoop>). > > > > However, looking at the official docker image documentation and the > docker-compose file I have seen the following environment variable: > > > > environment: > > ENSURE_NAMENODE_DIR: "/tmp/hadoop-root/dfs/name" > > > > I would like to know if it is safe to create the namenode directory in the > /tmp folder since this kind of folder is neither secure nor data > persistent. Thus, I would like to understand which path is the best > practice for this. Moreover, which are other environment variables I could > use of. > > > > Thanks a lot, in advance. > > > > Kind regards, > > > > *Nikos Spanos* > > > > M.Sc Business Analytics & Big Data| Athens University of Economics & > Business > > Phone Number: +306982310494 > > Linkedin profile <http://linkedin.com/in/spanosnikos> > > >