Doug Cutting wrote:
Brian Bockelman wrote:
To some extent, this whole issue is caused because we only have enough
space for 2 replicas; I'd imagine that at 3 replicas, the issue would
be much harder to trigger.
The unfortunate reality is that if you run a configuration that's
different than most you'll likely run into bugs that others do not. (As
a side note, this is why we should try to minimize configuration
options, so that everyone is running much the same system.)
Alternatively, "why we should be exploring the configuration space more
widely"