Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The following page has been changed by MarkSchnitzius: http://wiki.apache.org/hadoop/FAQ ------------------------------------------------------------------------------ On a large cluster removing one or two data-nodes will not lead to any data loss, because name-node will replicate their blocks as long as it will detect that the nodes are dead. - With a large number of nodes getting removed or dying the probability of loosing data is higher. + With a large number of nodes getting removed or dying the probability of losing data is higher. Hadoop offers the ''decommission'' feature to retire a set of existing data-nodes. The nodes to be retired should be included into the ''exclude file'', and the exclude file name should
