Actually I made some changes after the date on the pull request ( even in this 
year), which are not getting reflected on this compare link

Regards,
Arun Balajiee

From: Wes McKinney<mailto:[email protected]>
Sent: Tuesday, February 4, 2020 6:43 PM
To: Parquet Dev<mailto:[email protected]>
Cc: Deepak Majeti<mailto:[email protected]>; Anatoli 
Shein<mailto:[email protected]>
Subject: Re: Arrow 1404: Adding index for Page-level Skipping

Here's a compare link in case others want to have a look

https://nam05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Farrow%2Fcompare%2Fmaster...a2un%3APARQUET-1404-Add-index-pages-to-the-format-to-support-efficient-page-skipping-to-parquet-cpp&amp;data=02%7C01%7CARL122%40pitt.edu%7Cae7f0408b49c4a1111b408d7a9cbfbd5%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C637164565879592140&amp;sdata=uGV8GSSL1e9CmaxKfkkStdcgQHf0RxLizO72NRKRrrg%3D&amp;reserved=0

On Tue, Feb 4, 2020 at 5:41 PM Wes McKinney <[email protected]> wrote:
>
> hi Arun,
>
> I took a brief look at your branch. One thing that is missing is the
> proposed public APIs that use the index pages -- that would be very
> helpful for this discussion.
>
> I don't think we have any code for doing random access of a particular
> data page in a column chunk, so having as an initial matter would also
> be helpful.
>
> - Wes
>
> On Tue, Feb 4, 2020 at 2:28 PM Lekshmi Narayanan, Arun Balajiee
> <[email protected]> wrote:
> >
> > Hi Parquet dev
> >
> > Deepak Majeti was my dev lead during my summer internship, from when I am 
> > trying to add a few changes in the Arrow Parquet Project for the ticket 
> > below
> >
> > https://nam05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fbrowse%2FPARQUET-1404&amp;data=02%7C01%7CARL122%40pitt.edu%7Cae7f0408b49c4a1111b408d7a9cbfbd5%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C637164565879592140&amp;sdata=aGvdRxYzQdWAo%2FC8ADw6Br5WDMxiVaeBXO7QuSYK8TU%3D&amp;reserved=0
> >  (Assigned to Deepak)
> >
> > With this regard, I am making a few changes to src/parquet/file_reader.cc ( 
> > in a fork on my repository)
> >
> > https://nam05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fa2un%2Farrow%2Ftree%2FPARQUET-1404-Add-index-pages-to-the-format-to-support-efficient-page-skipping-to-parquet-cpp%2Fcpp&amp;data=02%7C01%7CARL122%40pitt.edu%7Cae7f0408b49c4a1111b408d7a9cbfbd5%7C9ef9f489e0a04eeb87cc3a526112fd0d%7C1%7C0%7C637164565879592140&amp;sdata=cNkK9cL7v6bqI6%2FM50SyLDs%2BPQ0IVmYvvc9MnYD9WgA%3D&amp;reserved=0
> >
> > I am stuck at trying to read a particular row using the index that I get in 
> > the page_location array struct of offset index. Could you help me with this 
> > ? and if there have been discussions on the forums for this as well, could 
> > you direct me to that link?
> >
> > Regards,
> > Arun Balajiee
> >

Reply via email to