brancz commented on issue #37538: URL: https://github.com/apache/arrow/issues/37538#issuecomment-1710319216
> I'm ok with inserts potentially creating duplicates in the dictionary. That's why I wrote this. I don't actually want the "initial dictionary" values to be tracked by the memotable. I'm ok with potential duplicates as the two sources are so distinct they almost never will contain duplicates, and even if, then I'm ok with that happening occasionally. The problem is that I want to only iterate once over my data to insert everything in a reasonably deeply nested struct, sometimes inserting just indices into the initial dictionary and sometimes adding new values. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
