wgtmac opened a new issue, #34590: URL: https://github.com/apache/arrow/issues/34590
### Describe the enhancement requested Background: There was an effort to fix inconsistent timestamp types across different SQL-on-Hadoop engines: https://docs.google.com/document/d/1gNRww9mZJcHvUDCXklzjFEQGpefsuR_akCDfWsdE35Q In the Apache Orc, two timestamp types are provided: - TIMESTAMP: timestamp type without timezone, timestamp value is stored in the writer timezone . - TIMESTAMP_INSTANT: timestamp type with local timezone, timestamp value is stored in the UTC timezone. arrow::TimestampType has an optional timezone field: https://github.com/apache/arrow/blob/main/cpp/src/arrow/type.h#L1385 - If timezone is provided, values are normalized in UTC. - If timezone is missing, values can be in any timezone. Therefore, the type mapping should be as below: - orc::TIMESTAMP <=> arrow::TimestampType w/o timezone - orc::TIMESTAMP <=> arrow::TimestampType w/ timezone ### Component(s) C++ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
