hi Arun, I took a brief look at your branch. One thing that is missing is the proposed public APIs that use the index pages -- that would be very helpful for this discussion.
I don't think we have any code for doing random access of a particular data page in a column chunk, so having as an initial matter would also be helpful. - Wes On Tue, Feb 4, 2020 at 2:28 PM Lekshmi Narayanan, Arun Balajiee <[email protected]> wrote: > > Hi Parquet dev > > Deepak Majeti was my dev lead during my summer internship, from when I am > trying to add a few changes in the Arrow Parquet Project for the ticket below > > https://issues.apache.org/jira/browse/PARQUET-1404 (Assigned to Deepak) > > With this regard, I am making a few changes to src/parquet/file_reader.cc ( > in a fork on my repository) > > https://github.com/a2un/arrow/tree/PARQUET-1404-Add-index-pages-to-the-format-to-support-efficient-page-skipping-to-parquet-cpp/cpp > > I am stuck at trying to read a particular row using the index that I get in > the page_location array struct of offset index. Could you help me with this ? > and if there have been discussions on the forums for this as well, could you > direct me to that link? > > Regards, > Arun Balajiee >
