jorisvandenbossche commented on code in PR #36290:
URL: https://github.com/apache/arrow/pull/36290#discussion_r1254470409
##########
python/pyarrow/parquet/core.py:
##########
@@ -867,6 +867,10 @@ def _sanitize_table(table, new_schema, flavor):
it will restore the timezone (Parquet only stores the UTC values without
timezone), or columns with duration type will be restored from the int64
Parquet column.
+write_page_index : bool, default False
Review Comment:
So if I understand correctly, we are currently not yet using the PageIndex
when reading files (through the python APIs) for pruning pages when given a
filter?
Should we mention that in the docstring to note that you can already write a
PageIndex, but it will not yet be used when reading using pyarrow?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]