jorisvandenbossche commented on a change in pull request #12078:
URL: https://github.com/apache/arrow/pull/12078#discussion_r781073566



##########
File path: python/pyarrow/array.pxi
##########
@@ -159,9 +159,10 @@ def array(object obj, type=None, mask=None, size=None, 
from_pandas=None,
 
     Notes
     -----
-    Localized timestamps will currently be returned as UTC (pandas's native
-    representation). Timezone-naive data will be implicitly interpreted as
-    UTC.
+    Timezone will be preserved in the returned array for timezone-aware data,
+    else no timezone will be returned for naive timestamps.
+    Internally, UTC values are stored for timezone-aware data,
+    whereas timezone-naive data is implicitly interpreted as if in UTC.

Review comment:
       ```suggestion
       Internally, UTC values are stored for timezone-aware data with the
       timezone set in the data type.
   ```
   
   I would personally leave out the "as if in UTC for timezone-naive data" 
part, as this is a detail for how this data is interpreted internally in other 
kernels, but can be confusing for the general reader I think (in the end, what 
matters for the user is that they get a timestamp array without any timezone 
set). 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to