Hi, I have a customer who wants to make sure that copies of his data are distributed amongst datacenters. So they are using rack names like this /dc1/rack1, /dc1/rack2, /dc2/rack1 etc.
Unfortunately, the BlockPlacementPolicyDefault seems to place all blocks on /dc1/* sometimes. Is there a way to guarantee that /dc1/* and /dc2/* will be used in this scenario? Looking at chooseRandomWithStorageTypeTwoTrial it seems to consider the full "scope" and not its components. I couldn't find anything in the code but I had hoped I'm missing something: Is there a way to configure HDFS for the behaviour I'd like? Thanks! Lars