Bulk import - does sort order of input data affect success rate?

Stuart White Thu, 02 Apr 2009 13:28:48 -0700

I, like many others, am having difficulty getting a mapred job that
bulk imports data into an HBase table to run successfully to
completion.


At this time, rather than get into specifics of my configuration, the
exceptions I'm receiving, etc..., I wanted to ask a general question:

Should I expect my bulk import to be more likely to succeed if my data
is sorted by its key?
Or should I expect my bulk import to be more likely to succeed if my
data is randomized?
Or should I expect the ordering of my input data to have no effect on
my ability to successfully bulk import records?

Thanks.

Bulk import - does sort order of input data affect success rate?

Reply via email to