[GitHub] [iceberg] RussellSpitzer commented on issue #5626: Support bucket transform on multiple data columns

via GitHub Tue, 18 Jul 2023 07:25:46 -0700


RussellSpitzer commented on issue #5626:
URL: https://github.com/apache/iceberg/issues/5626#issuecomment-1640326506


   The example you give is a problem though regardless of the bucketing 
function if the number of buckets is ~ = the cardinality of the column (or 
group of columns). The other thing to note about this example is that we would 
probably have just as good a distribution of rows if we just bucket'd 
(col_b,16). If we really wanted to include col_a we would do an identity 
transform correct?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [iceberg] RussellSpitzer commented on issue #5626: Support bucket transform on multiple data columns

Reply via email to