Assume for a moment that you have a large cluster of 500 AWS spot instance servers running. And you want to keep the bid price low, so at some point it's likely that the whole cluster will get axed until the spot price comes down some.
In order to maintain HDFS continuity I'd want say 10 servers running as normal instances, and I'd want to ensure that HDFS is replicating 100% of data to those 10 that don't run the risk of group elimination. Is it possible for HDFS to ensure replication to these "primary" nodes?
