alamb commented on code in PR #6450:
URL: https://github.com/apache/arrow-rs/pull/6450#discussion_r1776086910


##########
parquet/src/file/metadata/reader.rs:
##########
@@ -247,13 +248,18 @@ impl ParquetMetaDataReader {
             ));
         }
 
-        // TODO(ets): what is the correct behavior for missing page indexes? 
MetadataLoader would
-        // leave them as `None`, while the parser in 
`index_reader::read_columns_indexes` returns a
-        // vector of empty vectors.
-        // I think it's best to leave them as `None`.
+        // FIXME: there are differing implementations in the case where page 
indexes are missing
+        // from the file. `MetadataLoader` will leave them as `None`, while 
the parser in
+        // `index_reader::read_columns_indexes` returns a vector of empty 
vectors.
+        // It is best for this function to replicate the latter behavior for 
now, but in the future
+        // the two paths to retrieve metadata should be made consistent. Note 
that this is only
+        // an issue if the user requested page indexes, so there is no need to 
provide empty
+        // vectors in `try_parse_sized()`.

Review Comment:
   ```suggestion
           // It is best for this function to replicate the latter behavior for 
now, but in a future
           // breaking release, the two paths to retrieve metadata should be 
made consistent. Note that this is only
           // an issue if the user requested page indexes, so there is no need 
to provide empty
           // vectors in `try_parse_sized()`.
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to