[
https://issues.apache.org/jira/browse/IMPALA-9258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998856#comment-16998856
]
authur wang commented on IMPALA-9258:
-------------------------------------
[~tarmstrong] , thanks for watching the problem. we find that it does not just
happen on a table or on some tables. It most often occurs in the following
scenarios : first we use mr to generate some data, refresh table, and query
tables via impala, this time impala most likely to be good. then we delete
data using hdfs command, and generate data again using mr, this time the result
may be wrong.
> impala and hive query result are different
> -------------------------------------------
>
> Key: IMPALA-9258
> URL: https://issues.apache.org/jira/browse/IMPALA-9258
> Project: IMPALA
> Issue Type: Bug
> Components: Clients
> Affects Versions: Impala 3.2.0
> Environment: CDH6.2.1
> Reporter: authur wang
> Priority: Major
> Labels: newbie
> Attachments: user_inf.zip
>
>
> After we use mapreduce to generate rcfiles, we find that the results between
> hive and impala are different. The hive query will generate the right result
> while impala will get wrong result.
>
> the attachment is the data files.
>
> the ddl of the table is :
> CREATE EXTERNAL TABLE user_inf (
> id BIGINT,
> user_id STRING,
> cert_id STRING,
> name STRING,
> mobile STRING,
> access_id STRING,
> status STRING,
> channel STRING,
> rec_crt_ts STRING,
> rec_upd_ts STRING,
> ver INT
> )
> STORED AS RCFILE
> LOCATION '/user_inf'
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]