[GitHub] [arrow-rs] zeevm commented on pull request #4156: Cleanup ChunkReader (#4118)

via GitHub Fri, 19 May 2023 01:32:41 -0700


zeevm commented on PR #4156:
URL: https://github.com/apache/arrow-rs/pull/4156#issuecomment-1554231892

Understood.

My considerations are:

1. Using arrow I'll have to allocate double the memory and incur a lot of
mem copies into our engine native in memory format.

2. Our engine already does predicate push down, late materialization,
aplying predicates to dictionaries and very fast scanning of raw dictionary IDs
(skipping dictionary decoding entirely) with SIMD instructions and other
optimizations

Get Outlook for Android<https://aka.ms/AAb9ysg>
________________________________
From: Raphael Taylor-Davies ***@***.***>
Sent: Friday, May 19, 2023 11:05:15 AM
To: apache/arrow-rs ***@***.***>
Cc: zeevm ***@***.***>; Comment ***@***.***>
Subject: Re: [apache/arrow-rs] Cleanup ChunkReader (#4118) (PR #4156)

Correct, the high-level API reads into arrow and supports object stores, in
addition to predicate pushdown, late materialization, row filtering, etc... The
low-level APIs support advanced usage, but are not nearly as batteries
included. I would encourage you to try out the arrow APIs, you may find they
are actually faster 😅

—
Reply to this email directly, view it on
GitHub<https://github.com/apache/arrow-rs/pull/4156#issuecomment-1554200543>,
or
unsubscribe<https://github.com/notifications/unsubscribe-auth/AAA33OPY2XVBQOVZ6PSXOC3XG4STXANCNFSM6AAAAAAXPRRIHY>.
You are receiving this because you commented.Message ID: ***@***.***>

--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-rs] zeevm commented on pull request #4156: Cleanup ChunkReader (#4118)

Reply via email to