Hi Friends,

I am facing an issue with Buckets in Hive , Need some inputs / guidance

We initially created one table and whenever query (simple aggregation query
on one table) was executed multiple mappers were launched.
Later on considering optimization perspective, we altered table to include
bucketing on one column. Since then queries on this table are executing
with single mapper and hence significant delay in production queries
execution.

As mentioned in hive documentation, altering table to include bucketing
doesn't impact previous data, but this behavior was not expected.

Table is ORC formatted and with Snappy Compression.
Disk size of table is approx. 3.5 GB with 70 million records


Thanks,
Ashish Singh

Reply via email to