jorisvandenbossche commented on a change in pull request #12078:
URL: https://github.com/apache/arrow/pull/12078#discussion_r781073566
##########
File path: python/pyarrow/array.pxi
##########
@@ -159,9 +159,10 @@ def array(object obj, type=None, mask=None, size=None,
from_pandas=None,
Notes
-----
- Localized timestamps will currently be returned as UTC (pandas's native
- representation). Timezone-naive data will be implicitly interpreted as
- UTC.
+ Timezone will be preserved in the returned array for timezone-aware data,
+ else no timezone will be returned for naive timestamps.
+ Internally, UTC values are stored for timezone-aware data,
+ whereas timezone-naive data is implicitly interpreted as if in UTC.
Review comment:
```suggestion
Internally, UTC values are stored for timezone-aware data with the
timezone set in the data type.
```
I would personally leave out the "as if in UTC for timezone-naive data"
part, as this is a detail for how this data is interpreted internally in other
kernels, but can be confusing for the general reader I think (in the end, what
matters for the user is that they get a timestamp array without any timezone
set).
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]