MarcoGorelli commented on issue #15047:
URL: https://github.com/apache/arrow/issues/15047#issuecomment-1592523553

   Does it really fall back to pytz?
   
   I don't have pytz installed:
   ```console
   $ python -c 'import pytz'
   Traceback (most recent call last):
     File "<string>", line 1, in <module>
   ModuleNotFoundError: No module named 'pytz'
   ```
   but trying to convert `2038-04-01 09:00` from 'America/Boise' to UTC, and 
I'm getting the same (wrong) result with pyarrow as I would with pandas:
   ```python
   import pyarrow as pa
   import pyarrow.compute as pc
   
   from datetime import datetime, timezone
   from zoneinfo import ZoneInfo
   
   string = '2038-04-01 03:00:00.000000'
   
   dt = datetime.fromisoformat(string)
   dt = dt.replace(tzinfo=ZoneInfo('America/Boise'))
   tz = ZoneInfo('UTC')
   converted_dt = dt.astimezone(tz)
   print(converted_dt)
   
   ts = pc.assume_timezone(pa.array([datetime(2038, 4, 1, 3)]), 
timezone='America/Boise')
   print(ts)
   ```
   
   This outputs
   ```python
   2038-04-01 09:00:00+00:00
   [
     2038-04-01 10:00:00.000000
   ]
   ```
   whereas I was expecting
   ```python
   2038-04-01 09:00:00+00:00
   [
     2038-04-01 09:00:00.000000
   ]
   ```
   
   Is there a way to "force" zoneinfo usage?
   
   I think this is what's causing issues when converting polars to pandas: 
https://github.com/pola-rs/polars/issues/9315


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to