Yes, this was it. I was not able to find it in the documentation but managed to surmise it by reading the source code for a few minutes. Thank you for your help!
Regards, Bipin On Wed, Sep 27, 2023 at 1:44 AM Micah Kornfield <[email protected]> wrote: > I think you are looking for RowGroup Metadata defined at [1] and > accessible through the file reader [2] > [1] > https://github.com/apache/arrow/blob/main/cpp/src/parquet/metadata.h#L198 > [2] > https://github.com/apache/arrow/blob/main/cpp/src/parquet/file_reader.h#L55 > > On Fri, Sep 15, 2023 at 1:26 PM Bipin Mathew <[email protected]> > wrote: > >> Good Afternoon Everyone, >> >> I am looking for a C++ analog to the python SDK's >> pyarrow.parquet.RowGroupMetaData >> function ( >> https://arrow.apache.org/docs/python/generated/pyarrow.parquet.RowGroupMetaData.html >> ). Does one exist? At a high level I am trying to obtain the layout of a >> parquet file without decoding the actual data content. Specifically, I need >> to obtain the number of row groups ( which I know how to get ) and the >> number of rows in *each* row group. What is the best way to do this? >> >> Regards, >> >> Bipin >> >>
