[
https://issues.apache.org/jira/browse/HIVE-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14513641#comment-14513641
]
Alexander Pivovarov commented on HIVE-10488:
--------------------------------------------
I can not reproduce this issue in hive 1.2.0
I created 2 tables
t3 - textfile
t3o - Orc
{code}
hive> desc formatted t3;
OK
# col_name data_type comment
rnum int
cdt date
# Detailed Table Information
Database: default
Owner: apivovarov
CreateTime: Sun Apr 26 23:58:29 PDT 2015
LastAccessTime: UNKNOWN
Protect Mode: None
Retention: 0
Location: hdfs://localhost/apps/apivovarov/warehouse/t3
Table Type: MANAGED_TABLE
Table Parameters:
transient_lastDdlTime 1430117909
# Storage Information
SerDe Library: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
InputFormat: org.apache.hadoop.mapred.TextInputFormat
OutputFormat:
org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
Compressed: No
Num Buckets: -1
Bucket Columns: []
Sort Columns: []
Storage Desc Params:
serialization.format 1
Time taken: 0.098 seconds, Fetched: 27 row(s)
{code}
{code}
hive> desc formatted t3o;
OK
# col_name data_type comment
rnum int
cdt date
# Detailed Table Information
Database: default
Owner: apivovarov
CreateTime: Mon Apr 27 00:00:11 PDT 2015
LastAccessTime: UNKNOWN
Protect Mode: None
Retention: 0
Location: hdfs://localhost/apps/apivovarov/warehouse/t3o
Table Type: MANAGED_TABLE
Table Parameters:
COLUMN_STATS_ACCURATE true
numFiles 1
numRows 4
rawDataSize 184
totalSize 302
transient_lastDdlTime 1430118011
# Storage Information
SerDe Library: org.apache.hadoop.hive.ql.io.orc.OrcSerde
InputFormat: org.apache.hadoop.hive.ql.io.orc.OrcInputFormat
OutputFormat: org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat
Compressed: No
Num Buckets: -1
Bucket Columns: []
Sort Columns: []
Storage Desc Params:
serialization.format 1
Time taken: 0.096 seconds, Fetched: 32 row(s)
{code}
{code}
hive> select * from t3;
OK
0 NULL
1 1996-01-01
2 2000-01-01
3 2000-12-31
Time taken: 0.086 seconds, Fetched: 4 row(s)
{code}
{code}
hive> select * from t3o;
OK
0 NULL
1 1996-01-01
2 2000-01-01
3 2000-12-31
Time taken: 0.086 seconds, Fetched: 4 row(s)
{code}
{code}
hive> select rnum, cdt, cast (cdt as timestamp) from t3;
OK
0 NULL NULL
1 1996-01-01 1996-01-01 00:00:00
2 2000-01-01 2000-01-01 00:00:00
3 2000-12-31 2000-12-31 00:00:00
Time taken: 0.091 seconds, Fetched: 4 row(s)
{code}
{code}
hive> select rnum, cdt, cast (cdt as timestamp) from t3o;
OK
0 NULL NULL
1 1996-01-01 1996-01-01 00:00:00
2 2000-01-01 2000-01-01 00:00:00
3 2000-12-31 2000-12-31 00:00:00
Time taken: 0.108 seconds, Fetched: 4 row(s)
{code}
MR
{code}
hive> select t3.rnum, t3.cdt, cast (t3.cdt as timestamp) cts, t3o.cdt cdt2,
cast(t3o.cdt as timestamp) cts2 from t3 join t3o on (t3.rnum = t3o.rnum);
Query ID = apivovarov_20150427000533_2734a9a1-63eb-45d4-83a4-4129ae3e7afc
Total jobs = 1
15/04/27 00:05:36 WARN util.NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
Execution log at:
/tmp/apivovarov/apivovarov_20150427000533_2734a9a1-63eb-45d4-83a4-4129ae3e7afc.log
2015-04-27 00:05:37 Starting to launch local task to process map join;
maximum memory = 477102080
2015-04-27 00:05:39 Dump the side-table for tag: 0 with group count: 4 into
file:
file:/tmp/apivovarov/fe4b8d14-3414-4790-a737-7a5d00bd04d0/hive_2015-04-27_00-05-33_412_2029315734201436275-1/-local-10003/HashTable-Stage-3/MapJoin-mapfile00--.hashtable
2015-04-27 00:05:39 Uploaded 1 File to:
file:/tmp/apivovarov/fe4b8d14-3414-4790-a737-7a5d00bd04d0/hive_2015-04-27_00-05-33_412_2029315734201436275-1/-local-10003/HashTable-Stage-3/MapJoin-mapfile00--.hashtable
(345 bytes)
2015-04-27 00:05:39 End of local task; Time Taken: 1.612 sec.
Execution completed successfully
MapredLocal task succeeded
Launching Job 1 out of 1
Number of reduce tasks is set to 0 since there's no reduce operator
Starting Job = job_1429923083119_0002, Tracking URL =
http://c11.example.com:8088/proxy/application_1429923083119_0002/
Kill Command = /usr/lib/hadoop-2.6.0/bin/hadoop job -kill
job_1429923083119_0002
Hadoop job information for Stage-3: number of mappers: 1; number of reducers: 0
2015-04-27 00:05:47,494 Stage-3 map = 0%, reduce = 0%
2015-04-27 00:05:54,942 Stage-3 map = 100%, reduce = 0%, Cumulative CPU 2.03
sec
MapReduce Total cumulative CPU time: 2 seconds 30 msec
Ended Job = job_1429923083119_0002
MapReduce Jobs Launched:
Stage-Stage-3: Map: 1 Cumulative CPU: 2.03 sec HDFS Read: 6756 HDFS Write:
206 SUCCESS
Total MapReduce CPU Time Spent: 2 seconds 30 msec
OK
0 NULL NULL NULL NULL
1 1996-01-01 1996-01-01 00:00:00 1996-01-01 1996-01-01
00:00:00
2 2000-01-01 2000-01-01 00:00:00 2000-01-01 2000-01-01
00:00:00
3 2000-12-31 2000-12-31 00:00:00 2000-12-31 2000-12-31
00:00:00
Time taken: 22.631 seconds, Fetched: 4 row(s)
{code}
> cast DATE as TIMESTAMP returns incorrect values
> -----------------------------------------------
>
> Key: HIVE-10488
> URL: https://issues.apache.org/jira/browse/HIVE-10488
> Project: Hive
> Issue Type: Bug
> Components: SQL
> Affects Versions: 0.13.1
> Reporter: N Campbell
> Assignee: Chaoyu Tang
>
> same data in textfile works
> same data loaded into an ORC table does not
> connection property of tez/mr makes no difference.
> select rnum, cdt, cast (cdt as timestamp) from tdt
> 0 <null> <null>
> 1 1996-01-01 1969-12-31 19:00:09.496
> 2 2000-01-01 1969-12-31 19:00:10.957
> 3 2000-12-31 1969-12-31 19:00:11.322
> vs
> 0 <null> <null>
> 1 1996-01-01 1996-01-01 00:00:00.0
> 2 2000-01-01 2000-01-01 00:00:00.0
> 3 2000-12-31 2000-12-31 00:00:00.0
> create table if not exists TDT ( RNUM int , CDT date )
> STORED AS orc ;
> insert overwrite table TDT select * from text.TDT;
> 0|\N
> 1|1996-01-01
> 2|2000-01-01
> 3|2000-12-31
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)