Hello Lashing, MapReduce would be great :
Each mapper addresses a different MySQL DB and "TableOutputFormat" to the corresponding HTable. maybe pig : UNION after LOAD on different MySQL DB and then STORE on the différent table according to your policy (may need several M/R jobs all managed by pig workflow). The more efficient (1 job) would be pure home made Java MapReduce (mapper only for each MySQL DB bulk loading on HTables) Cheers, -- Damien HARDY
