sqoop hive snappy sequence file

Chalcy Thu, 21 Jun 2012 08:38:11 -0700

Hi sqoop, hive, compression experts,



When sqoop importing into hive with snappy compression and as sequence
file, the number of rows imported is shown correctly in the
logging(12/06/21 09:34:24 INFO mapreduce.ImportJobBase: Retrieved 10000
records.) but when I do count(*) on the hive table I get 13714 rows.  Also
the data when I do select * from table limit 100; gives garbage.

What am I not setting right?

Also we found a  open issue https://issues.cloudera.org/browse/SQOOP-200 -
Is this resolved in future sqoop versions?

Thanks,

Chalcy

sqoop hive snappy sequence file

Reply via email to