Any fix for this?

2011-05-11 Thread Steve Cohen
that connects has precedence. Is it possible to just wipe one of the datanodes so it is blank or do we have to format the entire hdfs filesystem from the namenode to add the new datanode. Thanks, Steve Cohen

Re: Any fix for this?

2011-05-11 Thread Steve Cohen
f its blocks as well. > > When you add a "fresh" datanode to the cluster, you add one that has an > empty dfs.data.dir. > > Try clearing out dfs.data.dir before adding the new node. > > Jeff > > > > On Wed, May 11, 2011 at 1:59 PM, Steve Cohen wrote:

NotReplicatedYetException .20.2

2011-05-18 Thread Steve Cohen
ee several other reports of this type of error but I haven't seen a solution. Thanks, Steve Cohen

replicating existing blocks?

2011-05-18 Thread Steve Cohen
Say I add a datanode to a pseudo cluster and I want to change the replication factor to 2. I see that I can either run hadoop fs -setrep or change the hdfs-site.xml value for dfs.replication. But do either of these cause the existing blocks to replicate? Thanks, Steve Cohen

Re: replicating existing blocks?

2011-05-18 Thread Steve Cohen
Thanks for the answer. Earlier, I asked about why I get occasional not replicated yet errors. Now, I had dfs.replication set to one. What replication could it have been doing? Did the error messages actually mean that the file couldn't get created in the cluster? Thanks, Steve Cohen O

Re: replicating existing blocks?

2011-05-18 Thread Steve Cohen
les is 10, which > obviously doesn't work well on a psuedo-distributed cluster. > > -Joey > > On Wed, May 18, 2011 at 5:07 PM, Steve Cohen wrote: >> Thanks for the answer. Earlier, I asked about why I get occasional not >> replicated yet errors. Now, I had dfs.rep

Re: replicating existing blocks?

2011-05-19 Thread Steve Cohen
hould be around the square root of the number of nodes. >   > > > You can set it per job in the job specific conf and/or in mapred-site.xml. > > > Friso > > > > On 19 mei 2011, at 03:42, Steve Cohen wrote: > >> Where is the default replication factor on jo

Re: replicating existing blocks?

2011-05-19 Thread Steve Cohen
One last question about these replication values. If dfs.replication and mapred.submit.replication are set to 1, does that mean they get copied one time so there are two dfs blocks and two job files or does it mean there is one dfs block and one job file? Thanks, Steve Cohen On Thu, May 19, 2011

Re: replicating existing blocks?

2011-05-19 Thread Steve Cohen
Thanks for clarifying. On Thu, May 19, 2011 at 1:50 PM, Joey Echeverria wrote: > The later. Replication of 1 means there's only one copy of any given > data block. If you lose that replica, you lose your data. > > -Joey > > On Thu, May 19, 2011 at 9:44 AM, Steve Co