Can you pastebin the output for both queries ? What version of hbase are you using ?
Cheers On Sun, Dec 28, 2014 at 11:11 PM, Krishna Kalyan <krishnakaly...@gmail.com> wrote: > Hi, > Happy holidays :). > I have 2 different pig scripts with the statement below > (1) > GeoRef_IP = LOAD '$TBL_GEOGRAPHY' USING > org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_geog_id > cf_data:cq_pc_sector cf_data:cq_district_code cf_data:cq_postal_town > cf_data:cq_postal_county cf_data:cq_mosaic_code cf_data:cq_mosaic_code_desc > cf_data:cq_mosaic_group cf_data:cq_sales_territory cf_data:cq_sales_area > cf_data:cq_sales_region cf_data:cq_dqtimestamp cf_data:cq_checkarray', > '-loadKey true'); > > and > (2) > GeoRef_IP = LOAD '$TBL_GEOGRAPHY' USING > org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_geog_id > cf_data:cq_pc_sector cf_data:cq_district_code cf_data:cq_postal_town > cf_data:cq_postal_county cf_data:cq_mosaic_code cf_data:cq_mosaic_code_desc > cf_data:cq_mosaic_group cf_data:cq_sales_territory cf_data:cq_sales_area > cf_data:cq_sales_region cf_data:cq_dqtimestamp cf_data:cq_checkarray', > '-loadKey true') as > > (postcode,geog_id,pc_sector,district_code,postal_town,postal_county,mosaic_code,mosaic_code_desc,mosaic_group,sales_territory,sales_area,sales_region,dqtimestamp,checkarray); > > the only difference is as statement. > > now for example > A foreach of $0,$4,$5 and a dump gives me different results for statement 1 > and 2. > where 1 is correct. > > Has anyone faced this behavior before?. > > Regards, > Krishna >