Hi all, Ryan wrote on a different thread:
"It should be possible to randomly insert data from a pre-existing data set. There is some work to directly import straight into hfiles and skipping the regionserver, but that would only really work on 1 time imports to new tables." Could someone please elaborate on this a little and outline the steps needed? Do you write an hfile in a custom mapreduce output format and then somehow write the table metadata file afterwards? Cheers, Tim
