DDL is
CREATE TABLE dw.fct_traffic_navpage_path_detl(
date_id string,
chanl_id bigint,
sessn_id string,
gu_id string,
prov_id string,
city_id string,
landing_page_type_id string,
landing_track_time string,
landing_url string,
nav_refer_tracker_id string,
nav_refer_page_type_id string,
nav_refer_page_value string,
nav_refer_link_position string,
nav_tracker_id string,
nav_page_categ_id string,
nav_page_type_id string,
nav_page_value string,
nav_srce_type string,
internal_keyword string,
internal_result_sum string,
pltfm_id int,
app_vers string,
nav_link_position string,
nav_button_position string,
nav_track_time string,
nav_next_tracker_id string,
sessn_last_time string,
sessn_pv int,
detl_tracker_id string,
detl_page_type_id string,
detl_page_value string,
detl_pm_id bigint,
detl_link_position string,
detl_position_track_id string,
cart_tracker_id string,
cart_page_type_id string,
cart_page_value string,
cart_link_postion string,
cart_button_position string,
cart_position_track_id string,
cart_prod_id bigint,
ordr_tracker_id string,
ordr_page_type_id string,
ordr_code string,
updt_time string,
cart_pm_id bigint,
brand_code string,
categ_type int,
os string,
end_user_id string,
add_cart_flag string,
navgation_page_flag int,
nav_page_url string,
detl_button_position string,
manul_flag int,
manul_track_date string,
nav_refer_tpa string,
nav_refer_tpa_id string,
nav_refer_tpc string,
nav_refer_tpi string,
nav_refer_tcs string,
nav_refer_tcsa string,
nav_refer_tcdt string,
nav_refer_tcd string,
nav_refer_tci string,
nav_refer_postn_type string,
nav_tpa_id string,
nav_tpa string,
nav_tpc string,
nav_tpi string,
nav_tcs string,
nav_tcsa string,
nav_tcdt string,
nav_tcd string,
nav_tci string,
nav_postn_type string,
detl_tpa_id string,
detl_tpa string,
detl_tpc string,
detl_tpi string,
detl_tcs string,
detl_tcsa string,
detl_tcdt string,
detl_tcd string,
detl_tci string,
detl_postn_type string,
cart_tpa_id string,
cart_tpa string,
cart_tpc string,
cart_tpi string,
cart_tcs string,
cart_tcsa string,
cart_tcdt string,
cart_tcd string,
cart_tci string,
cart_postn_type string,
sessn_chanl_id bigint,
gu_sec_flg bigint,
detl_refer_page_type_id string,
detl_refer_page_value string,
detl_event_id string,
nav_refer_intrn_reslt_sum string,
nav_intrn_reslt_sum string,
nav_refer_intrn_kw string,
nav_intrn_kw string,
detl_track_time string,
cart_track_time string)
PARTITIONED BY (
ds string)
ROW FORMAT SERDE
'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'
STORED AS INPUTFORMAT
'org.apache.hadoop.mapred.TextInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
LOCATION
'/user/hive/dw/fct_traffic_navpage_path_detl'
TBLPROPERTIES (
'numPartitions'='265',
'numFiles'='26677',
'last_modified_by'='bi_etl',
'last_modified_time'='1423633028',
'transient_lastDdlTime'='1427870517',
'numRows'='0',
'totalSize'='8268127466928',
'rawDataSize'='0')
My query is :
SELECT a1.sessn_id,
a1.ordr_code,
a1.cart_tracker_id,
a1.end_user_id,
a1.cart_track_time
FROM dw.fct_traffic_navpage_path_detl a1
WHERE a1.ds = '2015-01-19'
AND a1.cart_tracker_id > 0
AND (a1.cart_button_position IS NULL OR length(a1.cart_button_position) = 0)
AND a1.sessn_id IN ('Y49EY895ACABHS95DRQEE8DVFEB8JSE1',
'UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG',
'87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM')
I attach my sample data.
[email protected]
From: Thejas Nair
Date: 2015-04-02 15:28
To: dev
Subject: Re: hive 0.14 on some platform return some not NULL value as NULL
Can you give more details
- the query you are running
- schema of the table
- serialization format of the table, sample records if possible.
On Wed, Apr 1, 2015 at 6:32 PM, [email protected] <[email protected]> wrote:
>
> I use hive 1.1.0 cli on computer A (linux) the result is
>
> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM 2357378283356 91501191044440048
> 7326356 NULL
>
> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM 2357378283356 121501191035580028
> 7326356 NULL
>
> UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG 2362223711289 161501191549050061
> 14837289 NULL
>
> Y49EY895ACABHS95DRQEE8DVFEB8JSE1 2360853052224 111501191426280023
> 115883224 NULL
>
> But I use hive0.14 cli in my test enviroment the result is correct.
>
>
> I use hive 0.10 on computer B (linux) the result is
>
> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM 2357378283356 91501191044440048
> 7326356 2015-01-19 10:44:44
>
> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM 2357378283356 121501191035580028
> 7326356 2015-01-19 10:35:58
>
> UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG 2362223711289 161501191549050061
> 14837289 2015-01-19 15:49:05
>
> Y49EY895ACABHS95DRQEE8DVFEB8JSE1 2360853052224 111501191426280023
> 115883224 2015-01-19 14:26:28
>
>
> Why ?
>
> I attach my log. Also in my log I found 2015-04-01 09:55:38,409 WARN [main]
> org.apache.hadoop.hive.serde2.lazy.LazyStruct: Extra bytes detected at the
> end of the row! Ignoring similar problems.
>
>
> ________________________________
> [email protected]