tustvold commented on issue #6888: URL: https://github.com/apache/arrow-rs/issues/6888#issuecomment-2549782482
> Having number of values more than number of keys is really unlikely when you merge dictionary that have the same value multiple times (which is the whole point of the dictionary) See https://github.com/apache/arrow-rs/pull/3558, it can occur as a result of the way a single dictionary may be shared across multiple arrays, or as a result of previous processing. The intent is to optimise the dictionary when we know the performance cost will pay off -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org