Re: string bucketing compatibility issue

2021-07-19 Thread Ryan Blue
Thanks, Piotr! I've added this to the agenda for our next sync. I think the main question is whether we think users have hit this problem or not. If we don't think that it is something users have probably hit, then we can just fix the problem. If we think someone has data stored with the incorrect

Re: string bucketing compatibility issue

2021-07-19 Thread Piotr Findeisen
Hi, I've filed https://github.com/apache/iceberg/issues/2837 for this as well. Best PF On Sat, Jul 17, 2021 at 12:48 AM Piotr Findeisen wrote: > Hi, > > It was discovered by @Mateusz Gajewski > that Iceberg bucketing > transformation for string isn't regular Murmur3 32-bit hash. > > Upon cl