Thanks for the response. 1)How do I use the Gz compression and does it come with Hadoop. Or else how do I build a compression method for using in Hive. I would like to run evaluation across compression methods. What is the default compression used in Hadoop.
2)Kindly bear with me if this question is stupid. I am not talking about compression within intermediate steps. Storing the raw data in compressed format, how can this be useful since data needs to be decompressed for executing a job...wright?.