I have hundreds of large files ( ~ 100MB ) in a /mnt/ location which is shared by all my hadoop nodes. Was wondering if I could directly use "hadoop distcp file:///mnt/data/tr* /input" to parallelize/distribute hadoop push. Hadoop push is indeed becoming a bottle neck for me and any help in this regard is greatly appreciated. Currently I am using "hadoop -moveFromlocal ..." and it is taking too much of time.
-Thanks, Prasen
