jorisvandenbossche commented on issue #39809:
URL: https://github.com/apache/arrow/issues/39809#issuecomment-1916309229

   We actually also raise an error if you try to _use_ the lower case "utc", 
exactly for that reason of not being part of the database. For example:
   
   ```
   In [18]: arr_naive = pa.array([pd.Timestamp("2021-01-01 09:00")])
   
   In [19]: pc.assume_timezone(arr_naive, "utc")
   ...
   ArrowInvalid: Cannot locate timezone 'utc': utc not found in timezone 
database
   ```
   
   However, the issue here is that we don't do any validation of the timezone 
at all at construction time of the type (or when reading in a file, or ..). At 
the moment you can perfectly write complete nonsense like `pa.timestamp("us", 
tz="blabla")`. 
   I assume one reason for this is because there is not necessarily a guarantee 
that a timezone database is available to perform this check (i.e. typically on 
windows).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to