Hi, Happy holidays :). I have 2 different pig scripts with the statement below (1) GeoRef_IP = LOAD '$TBL_GEOGRAPHY' USING org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_geog_id cf_data:cq_pc_sector cf_data:cq_district_code cf_data:cq_postal_town cf_data:cq_postal_county cf_data:cq_mosaic_code cf_data:cq_mosaic_code_desc cf_data:cq_mosaic_group cf_data:cq_sales_territory cf_data:cq_sales_area cf_data:cq_sales_region cf_data:cq_dqtimestamp cf_data:cq_checkarray', '-loadKey true');
and (2) GeoRef_IP = LOAD '$TBL_GEOGRAPHY' USING org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_geog_id cf_data:cq_pc_sector cf_data:cq_district_code cf_data:cq_postal_town cf_data:cq_postal_county cf_data:cq_mosaic_code cf_data:cq_mosaic_code_desc cf_data:cq_mosaic_group cf_data:cq_sales_territory cf_data:cq_sales_area cf_data:cq_sales_region cf_data:cq_dqtimestamp cf_data:cq_checkarray', '-loadKey true') as (postcode,geog_id,pc_sector,district_code,postal_town,postal_county,mosaic_code,mosaic_code_desc,mosaic_group,sales_territory,sales_area,sales_region,dqtimestamp,checkarray); the only difference is as statement. now for example A foreach of $0,$4,$5 and a dump gives me different results for statement 1 and 2. where 1 is correct. Has anyone faced this behavior before?. Regards, Krishna