Brian Bockelman wrote:
To some extent, this whole issue is caused because we only have enough space for 2 replicas; I'd imagine that at 3 replicas, the issue would be much harder to trigger.
The unfortunate reality is that if you run a configuration that's different than most you'll likely run into bugs that others do not. (As a side note, this is why we should try to minimize configuration options, so that everyone is running much the same system.) Hopefully squashing the two bugs you've filed will substantially help things.
Doug
