Hi Friends, I am facing an issue with Buckets in Hive , Need some inputs / guidance
We initially created one table and whenever query (simple aggregation query on one table) was executed multiple mappers were launched. Later on considering optimization perspective, we altered table to include bucketing on one column. Since then queries on this table are executing with single mapper and hence significant delay in production queries execution. As mentioned in hive documentation, altering table to include bucketing doesn't impact previous data, but this behavior was not expected. Table is ORC formatted and with Snappy Compression. Disk size of table is approx. 3.5 GB with 70 million records Thanks, Ashish Singh