amoeba opened a new issue, #35645: URL: https://github.com/apache/arrow/issues/35645
### Describe the enhancement requested PyArrow's ParquetWriter has an argument, `use_deprecated_int96_timestamps`, which is used to write timestamps as int96 which is deprecated in Parquet. I think the docstring may be confusing to new users as it's not entirely clear if this is something deprecated in PyArrow that may be removed at some point or if it's just deprecated elsewhere and PyArrow maintains support for it. The case is the latter. See the docstring: https://github.com/apache/arrow/blob/f59e37f8a676a13f0bd82a09c211ddb45c6b48ac/python/pyarrow/parquet/core.py#L772-L774 I think the Parquet section in the PyArrow docs does a better job of explaining this feature, > Older Parquet implementations use ``INT96`` based storage of > timestamps, but this is now deprecated. This includes some older > versions of Apache Impala and Apache Spark. To write timestamps in > this format, set the ``use_deprecated_int96_timestamps`` option to > ``True`` in ``write_table``. I think the docstring could be improved by using similar language to the above. ### Component(s) Documentation, Python -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
