Hello, I have been writing some code to read Parquet files and it would be useful if there was an easy way to get the number of bytes in a string column as well as the null indices of that column. I would have expected this to be available in metadata somewhere, but I have not seen any way to query that from the API and don’t see anything like this using `parquet-tools` to inspect the files.
Is there any way to get the null indices of a Parquet string column besides reading the whole file and manually checking for nulls? Is there any way to get the byte lengths of string columns without reading each string and summing the number of bytes of each string? Thank you. Best, Ben McDonald
