pchintar commented on PR #9836:
URL: https://github.com/apache/arrow-rs/pull/9836#issuecomment-4372346054

   Hi @alamb,
   
   My sincere apologies for the v.late response — I missed this earlier because 
it's finals week for me rn (I'm a student).
   
   Yes, I do think this's worth pursuing. So, in my Commit all existing 
supported non-dictionary types keep the same encoding logic, including:
   
   ```text
   numeric types → fast path
   Union                 → fast path
   Utf8 / Binary          → fast path
   List / Struct / Map    → fast path
   Boolean                → fast path
   RunEndEncoded          → fast path
   ```
   
   Only this falls back/uses the current heap Re-allocation path:
   
   ```text
   DataType::Dictionary(...)
   or nested dictionary inside Struct/List/Map/Union/etc.
   ```
   
   As you've suggested earlier, I'll work on the dictionary path too after this 
PR & setup a new PR for it after my finals are done within the next 3 days. In 
the meanwhile, would really appreciate a review if you're able to do so.
   
   Thanks again!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to