I am not sure that you are following the right techniqs. We have the same
issue concerning loading master/slave, still trying to find some more
details how to do it better but could not advice you now..
keep posting probably sombody can give you the correct answer, good
questions actually
thanks,
DT
www.ejinz.com
Search News
----- Original Message -----
From: "Venkates .P.B." <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Friday, August 03, 2007 1:41 AM
Subject: Re: Loading data into HDFS
Am I missing something very fundamental ? Can someone comment on these
queries ?
Thanks,
Venkates P B
On 8/1/07, Venkates .P.B. <[EMAIL PROTECTED]> wrote:
Few queries regarding the way data is loaded into HDFS.
-Is it a common practice to load the data into HDFS only through the
master node ? We are able to copy only around 35 logs (64K each) per
minute
in a 2 slave configuration.
-We are concerned about time it would take to update filenames and block
maps in the master node when data is loaded from few/all the slave nodes.
Can anyone let me know how long generally it takes for this update to
happen.
And one more question, what if the node crashes soon after the data is
copied into one it. How is data consistency maintained here ?
Thanks in advance,
Venkates P B