zeevm commented on PR #4156:
URL: https://github.com/apache/arrow-rs/pull/4156#issuecomment-1554231892

   Understood.
   
   My considerations are:
   
   1. Using arrow I'll have to allocate double the memory and incur a lot of 
mem copies into our engine native in memory format.
   
   2. Our engine already does predicate push down, late materialization, 
aplying predicates to dictionaries and very fast scanning of raw dictionary IDs 
(skipping dictionary decoding entirely) with SIMD instructions and other 
optimizations
   
   Get Outlook for Android<https://aka.ms/AAb9ysg>
   ________________________________
   From: Raphael Taylor-Davies ***@***.***>
   Sent: Friday, May 19, 2023 11:05:15 AM
   To: apache/arrow-rs ***@***.***>
   Cc: zeevm ***@***.***>; Comment ***@***.***>
   Subject: Re: [apache/arrow-rs] Cleanup ChunkReader (#4118) (PR #4156)
   
   
   Correct, the high-level API reads into arrow and supports object stores, in 
addition to predicate pushdown, late materialization, row filtering, etc... The 
low-level APIs support advanced usage, but are not nearly as batteries 
included. I would encourage you to try out the arrow APIs, you may find they 
are actually faster 😅
   
   —
   Reply to this email directly, view it on 
GitHub<https://github.com/apache/arrow-rs/pull/4156#issuecomment-1554200543>, 
or 
unsubscribe<https://github.com/notifications/unsubscribe-auth/AAA33OPY2XVBQOVZ6PSXOC3XG4STXANCNFSM6AAAAAAXPRRIHY>.
   You are receiving this because you commented.Message ID: ***@***.***>
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to