hello , Yes you can do this by specify in hadoop-site.xml about the location of namenode , where your data is already get distributed.
--------------------------------------------------------------- <property> <name>fs.default.name</name> <value> <IPAddress:PortNo> </value> </property> --------------------------------------------------------------- Thanks --- Peeyush On Thu, 2008-03-27 at 15:41 -0400, Jean-Pierre wrote: > Hello, > > I'm working on large amount of logs, and I've noticed that the > distribution of data on the network (./hadoop dfs -put input input) > takes a lot of time. > > Let's says that my data is already distributed among the network, is > there anyway to say to hadoop to use the already existing > distribution ?. > > Thanks >
