ianmcook commented on issue #39315:
URL: https://github.com/apache/arrow/issues/39315#issuecomment-2108677475

   I think this would be a very nice usability improvement.
   
   The way PyArrow currently prints timezone-aware timestamp values can be very 
confusing. For example, you might try to create a Table like this:
   
   ```py
   from datetime import datetime
   import pyarrow as pa
   import pandas as pd
   
   t = pa.table(
       {'ts': [datetime(1969, 1, 1, 1, 1, 1)]},
       schema=pa.schema([("ts", pa.timestamp("us", tz="America/New_York"))])
   )
   ```
   
   When you print it, it looks like the time represents the 01:01:01 EST:
   
   ```
   t
   ## pyarrow.Table
   ## ts: timestamp[us, tz=America/New_York]
   ## ----
   ## ts: [[1969-01-01 01:01:01.000000Z]]
   ```
   
   But upon closer inspection, it is actually representing the time 01:01:01 
UTC which converts to 20:01:01 EST:
   
   ```
   t["ts"][0]
   ## <pyarrow.TimestampScalar: '1968-12-31T20:01:01.000000-0500'>
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to