> cluster restart, is there any "memory" of which region servers last served > which regions or some other method to improve data locality?
Nope, not yet. The new master code for 0.90 has some basics, but it's a bit complicated and we're not there yet. It basically requires asking the Namenode for the locations of every block of every regions, and compute what should go where. J-D