Hi all, I am using Sqoop to import a big MySql table (around 910 milions of records) in Hbase. The command line that I'm using is something like: sqoop import --connect jdbc:mysql://<server>/<db>?zeroDateTimeBehavior=round --username <usr> -P --query <query>' --split-by <date-field> --hbase-table "<hbase_table>" --column-family "<fam>" --hbase-row-key "hash"
The strange thing is that it takes a lot to complete the last part of the map. This is part of the log: [...] 12/09/04 17:16:45 INFO mapred.JobClient: Running job: job_201209031227_0007 12/09/04 17:16:46 INFO mapred.JobClient: map 0% reduce 0% 12/09/04 17:24:20 INFO mapred.JobClient: map 25% reduce 0% 12/09/04 17:24:21 INFO mapred.JobClient: map 50% reduce 0% 12/09/04 17:24:23 INFO mapred.JobClient: map 75% reduce 0% As you can see it does not take much time to from start to 75%, but the last part hasn't been finished (although it is working by a day continuously). Is there something wrong? I've tried to take a look to the logs but it seems to be ok. Thanks, Alberto -- Alberto Cordioli
