I have a machine that stores a lot of the data I need to put into my
cluster's HDFS.  It's on the same private network as the nodes, but it isn't
a node itself.

What is the easiest way to have it be able to directly inject the data files
into HDFS, without it acting as a datanode for replicas?

I tried an NFS mount, but something either within Hadoop, NFS, my hardware,
or somewhere else, and it would always hang when transferring more than a
few hundred files.

I'm hoping for a more direct solution, like setting up a dummy datanode
without a local storage space or something.  Just wondering if there's a
trick to that, or something.

-Colin

Reply via email to