[ 
https://issues.apache.org/jira/browse/IMPALA-9258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998856#comment-16998856
 ] 

authur wang commented on IMPALA-9258:
-------------------------------------

[~tarmstrong] , thanks for watching the problem. we find that it does not just 
happen on a table or on some tables. It most often occurs in the following 
scenarios :  first we use mr to generate some data,  refresh table,  and query 
tables via impala, this time impala most likely to be good.  then we delete 
data using hdfs command, and generate data again using mr, this time the result 
may be wrong.

 

 

 

> impala and hive query result are different 
> -------------------------------------------
>
>                 Key: IMPALA-9258
>                 URL: https://issues.apache.org/jira/browse/IMPALA-9258
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Clients
>    Affects Versions: Impala 3.2.0
>         Environment: CDH6.2.1
>            Reporter: authur wang
>            Priority: Major
>              Labels: newbie
>         Attachments: user_inf.zip
>
>
> After we use mapreduce to generate rcfiles, we find that the results between 
> hive and impala are different. The hive query will generate the right result 
> while impala will get wrong result.
>  
> the attachment is the data files.
>  
> the ddl of the table is :
> CREATE EXTERNAL TABLE user_inf (
>  id BIGINT,
>  user_id STRING,
>  cert_id STRING,
>  name STRING,
>  mobile STRING,
>  access_id STRING,
>  status STRING,
>  channel STRING,
>  rec_crt_ts STRING,
>  rec_upd_ts STRING,
>  ver INT
>  )
>  STORED AS RCFILE
>  LOCATION '/user_inf'



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to