Hi,
Happy holidays :).
I have 2 different pig scripts with the statement below
(1)
GeoRef_IP = LOAD '$TBL_GEOGRAPHY' USING
org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_geog_id
cf_data:cq_pc_sector cf_data:cq_district_code cf_data:cq_postal_town
cf_data:cq_postal_county cf_data:cq_mosaic_code cf_data:cq_mosaic_code_desc
cf_data:cq_mosaic_group cf_data:cq_sales_territory cf_data:cq_sales_area
cf_data:cq_sales_region cf_data:cq_dqtimestamp cf_data:cq_checkarray',
'-loadKey true');

and
(2)
GeoRef_IP = LOAD '$TBL_GEOGRAPHY' USING
org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_geog_id
cf_data:cq_pc_sector cf_data:cq_district_code cf_data:cq_postal_town
cf_data:cq_postal_county cf_data:cq_mosaic_code cf_data:cq_mosaic_code_desc
cf_data:cq_mosaic_group cf_data:cq_sales_territory cf_data:cq_sales_area
cf_data:cq_sales_region cf_data:cq_dqtimestamp cf_data:cq_checkarray',
'-loadKey true') as
(postcode,geog_id,pc_sector,district_code,postal_town,postal_county,mosaic_code,mosaic_code_desc,mosaic_group,sales_territory,sales_area,sales_region,dqtimestamp,checkarray);

the only difference is as statement.

now for example
A foreach of $0,$4,$5 and a dump gives me different results for statement 1
and 2.
where 1 is correct.

Has anyone faced this behavior before?.

Regards,
Krishna

Reply via email to