Does anyone have any tips or strategies for keeping an index in sync with its data? I'd of course update the index immediately after the data, but over time there will inevitably be inconsistencies. Do people just run periodic clean-up jobs?
On a related note, how important is batching updates from a performance standpoint? In MySQL it is significant, but the write path in HBase seems so fast that it may not matter much except for network latency. Would you recommend updating 1000 data rows, then applying the 1000 index updates, or interleaving the updates row-by-row? Congrats on the new release! Looks awesome. Matt
