brancz commented on issue #37538:
URL: https://github.com/apache/arrow/issues/37538#issuecomment-1710319216

   > I'm ok with inserts potentially creating duplicates in the dictionary.
   
   That's why I wrote this. I don't actually want the "initial dictionary" 
values to be tracked by the memotable. I'm ok with potential duplicates as the 
two sources are so distinct they almost never will contain duplicates, and even 
if, then I'm ok with that happening occasionally.
   
   The problem is that I want to only iterate once over my data to insert 
everything in a reasonably deeply nested struct, sometimes inserting just 
indices into the initial dictionary and sometimes adding new values.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to