Hi sqoop, hive, compression experts,
When sqoop importing into hive with snappy compression and as sequence file, the number of rows imported is shown correctly in the logging(12/06/21 09:34:24 INFO mapreduce.ImportJobBase: Retrieved 10000 records.) but when I do count(*) on the hive table I get 13714 rows. Also the data when I do select * from table limit 100; gives garbage. What am I not setting right? Also we found a open issue https://issues.cloudera.org/browse/SQOOP-200 - Is this resolved in future sqoop versions? Thanks, Chalcy
