jorisvandenbossche commented on issue #39809:
URL: https://github.com/apache/arrow/issues/39809#issuecomment-1916309229
We actually also raise an error if you try to _use_ the lower case "utc",
exactly for that reason of not being part of the database. For example:
```
In [18]: arr_naive = pa.array([pd.Timestamp("2021-01-01 09:00")])
In [19]: pc.assume_timezone(arr_naive, "utc")
...
ArrowInvalid: Cannot locate timezone 'utc': utc not found in timezone
database
```
However, the issue here is that we don't do any validation of the timezone
at all at construction time of the type (or when reading in a file, or ..). At
the moment you can perfectly write complete nonsense like `pa.timestamp("us",
tz="blabla")`.
I assume one reason for this is because there is not necessarily a guarantee
that a timezone database is available to perform this check (i.e. typically on
windows).
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]