I, like many others, am having difficulty getting a mapred job that bulk imports data into an HBase table to run successfully to completion.
At this time, rather than get into specifics of my configuration, the exceptions I'm receiving, etc..., I wanted to ask a general question: Should I expect my bulk import to be more likely to succeed if my data is sorted by its key? Or should I expect my bulk import to be more likely to succeed if my data is randomized? Or should I expect the ordering of my input data to have no effect on my ability to successfully bulk import records? Thanks.
