I built the pig trunk with hbase 0.90.3 client lib (ant -Dhbase.version=0.90.3) and the issue is still here.

It makes me thing about an issue in the optimizer... Anyway the fact is that my request is not complex, so I wonder how such an issue can go through PIG test suite !

Any help ?

Le 27/07/11 14:38, Vincent Barat a écrit :
More info on this issue:

1- I use PIG 0.8.1 and HBase 0.90.3 and Hadoop 0.20-append
2- The issue can be reproduced with PIG trunk too

The script:

start_sessions = LOAD 'startSession.mde253811.preprod.ubithere.com' USING org.apache.pig.backend.hadoop.hbase.HBaseStorage('meta:sid meta:infoid meta:imei meta:timestamp') AS (sid:chararray, infoid:chararray, imei:chararray, start:long); end_sessions = LOAD 'endSession.mde253811.preprod.ubithere.com' USING org.apache.pig.backend.hadoop.hbase.HBaseStorage('meta:sid meta:timestamp meta:locid') AS (sid:chararray, end:long, locid:chararray);
sessions = JOIN start_sessions BY sid, end_sessions BY sid;
sessions = FILTER sessions BY end > start AND end - start < 86400000L; sessions = FOREACH sessions GENERATE start_sessions::sid, imei, start, end;
sessions = LIMIT sessions 100;
dump sessions;
<output 1>
dump sessions;
<output 2>

The issue:

<output 1> is empty
<output 2> is 100 lines

I can reproduce the issue systematically.

Please advice: this issue prevent me from moving to HBase 0.90.3 in production, as I need to upgrade to PIG 0.8.1 at the same time !


Reply via email to