hi Arun,

I took a brief look at your branch. One thing that is missing is the
proposed public APIs that use the index pages -- that would be very
helpful for this discussion.

I don't think we have any code for doing random access of a particular
data page in a column chunk, so having as an initial matter would also
be helpful.

- Wes

On Tue, Feb 4, 2020 at 2:28 PM Lekshmi Narayanan, Arun Balajiee
<[email protected]> wrote:
>
> Hi Parquet dev
>
> Deepak Majeti was my dev lead during my summer internship, from when I am 
> trying to add a few changes in the Arrow Parquet Project for the ticket below
>
> https://issues.apache.org/jira/browse/PARQUET-1404 (Assigned to Deepak)
>
> With this regard, I am making a few changes to src/parquet/file_reader.cc ( 
> in a fork on my repository)
>
> https://github.com/a2un/arrow/tree/PARQUET-1404-Add-index-pages-to-the-format-to-support-efficient-page-skipping-to-parquet-cpp/cpp
>
> I am stuck at trying to read a particular row using the index that I get in 
> the page_location array struct of offset index. Could you help me with this ? 
> and if there have been discussions on the forums for this as well, could you 
> direct me to that link?
>
> Regards,
> Arun Balajiee
>

Reply via email to