Can you pastebin the output for both queries ?

What version of hbase are you using ?

Cheers

On Sun, Dec 28, 2014 at 11:11 PM, Krishna Kalyan <krishnakaly...@gmail.com>
wrote:

> Hi,
> Happy holidays :).
> I have 2 different pig scripts with the statement below
> (1)
> GeoRef_IP = LOAD '$TBL_GEOGRAPHY' USING
> org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_geog_id
> cf_data:cq_pc_sector cf_data:cq_district_code cf_data:cq_postal_town
> cf_data:cq_postal_county cf_data:cq_mosaic_code cf_data:cq_mosaic_code_desc
> cf_data:cq_mosaic_group cf_data:cq_sales_territory cf_data:cq_sales_area
> cf_data:cq_sales_region cf_data:cq_dqtimestamp cf_data:cq_checkarray',
> '-loadKey true');
>
> and
> (2)
> GeoRef_IP = LOAD '$TBL_GEOGRAPHY' USING
> org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_geog_id
> cf_data:cq_pc_sector cf_data:cq_district_code cf_data:cq_postal_town
> cf_data:cq_postal_county cf_data:cq_mosaic_code cf_data:cq_mosaic_code_desc
> cf_data:cq_mosaic_group cf_data:cq_sales_territory cf_data:cq_sales_area
> cf_data:cq_sales_region cf_data:cq_dqtimestamp cf_data:cq_checkarray',
> '-loadKey true') as
>
> (postcode,geog_id,pc_sector,district_code,postal_town,postal_county,mosaic_code,mosaic_code_desc,mosaic_group,sales_territory,sales_area,sales_region,dqtimestamp,checkarray);
>
> the only difference is as statement.
>
> now for example
> A foreach of $0,$4,$5 and a dump gives me different results for statement 1
> and 2.
> where 1 is correct.
>
> Has anyone faced this behavior before?.
>
> Regards,
> Krishna
>

Reply via email to