Hello,

How would I handle environment separation in HDFS? My initial thought was
to use a directory structure like /data/<env>/<table-user>/<table-name>,
but I'm running into problems with reading the files back out of HDFS (for
example merging small files into larger files). For the ListHDFS processor,
it doesn't allow input connections, so I can't specify the environment with
an attribute. Would something like this require me to use two instances of
NiFi and some sort of environment system variable lookup in EL? Is it even
common practice to encode the environment information in the directory
structure, or do people generally have an HDFS instance per environment
instead? Sorry if this question sort of extends outside of the scope of
NiFi.

Thanks!

Reply via email to