Thanks Mark, for the response.
I have my input on the server as local files. we haven't thought if we might set-up a NFS server. We have configured the server machine - installed Hadoop and have HFS setup. To achieve my goal, What is the change that you would recommend over the pipeline I suggested ?
