bucketed column seems great but i dont understand why they are being used for 
just for optimizing joins and not where clause (filter).i have a huge table 
(billions of records)  which includes a field with medium cardinality 
(~100,000). user usually filter with that field (at least). using partitions, 
or full table scan, are both inefficient. Hash partition, or bucketing seems to 
be the way to go. i saw HIVE-5831, but it seems the solution is not going into 
trunk for some reason.any comments?thanks. 

Reply via email to