[
https://issues.apache.org/jira/browse/ORC-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yiqun Zhang updated ORC-1055:
-----------------------------
Description:
I have a CSV file that has a column having timestamp values as 0001-01-01
00:00:00.0. Then I convert CSV file to ORC file using CSV to ORC converter and
place the ORC file in a hive table backed by ORC files. On querying the data
using Hive beeline and Spark SQL, different results are obtained
If converted using CPP tool, value read using Hive beeline and Spark SQL
queries is 0001-01-03 00:00:00
Reported by [~vraval48]
was:
I have a CSV file that has a column having timestamp values as 0001-01-01
00:00:00.0. Then I convert CSV file to ORC file using CSV to ORC converter and
place the ORC file in a hive table backed by ORC files. On querying the data
using Hive beeline and Spark SQL, different results are obtained
If converted using CPP tool, value read using Hive beeline and Spark SQL
queries is 0001-01-03 00:00:00
> [C++] Timestamp values read in Hive are different when using ORC file created
> using CSV to ORC converter tools
> --------------------------------------------------------------------------------------------------------------
>
> Key: ORC-1055
> URL: https://issues.apache.org/jira/browse/ORC-1055
> Project: ORC
> Issue Type: Bug
> Components: C++
> Reporter: Yiqun Zhang
> Priority: Major
> Attachments: converted_by_cpp.orc, timestamp.csv
>
>
> I have a CSV file that has a column having timestamp values as 0001-01-01
> 00:00:00.0. Then I convert CSV file to ORC file using CSV to ORC converter
> and place the ORC file in a hive table backed by ORC files. On querying the
> data using Hive beeline and Spark SQL, different results are obtained
> If converted using CPP tool, value read using Hive beeline and Spark SQL
> queries is 0001-01-03 00:00:00
> Reported by [~vraval48]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)