I have few questions regarding the usage of DistCP for copying files in the
same cluster.
1) Which one is better within a same cluster and what factors (like file
size etc) wouldinfluence the usage of one over te other?
2) when we run a cp command like below from a client node of the cluster
(not a data node), How does the cp command work
i) like an MR job
ii) copy files locally and then it copy it back at the new location.
Example of the copy command
hdfs dfs -cp /<some_location>/file /<new_location>/
Thanks, your responses are appreciated.
-- Kay