HBase get from within UDF vs. PIG FILTER

Norbert Burger Fri, 19 Aug 2011 09:17:24 -0700

I have a need within a larger Pig script to pull just a few records from an
Hbase table.  I know the exact key, so it'd be trivial with a get() from a
UDF.  Another alternative is use to a custom LOAD/FILTER combo, but this
would involve filtering off all but 3 of about 50 records.


>From a performance angle, punting over to the UDF is faster, right?
Although it seems to break the model of only using UDFs when necessary...

How are others handling this situation?

Norbert

HBase get from within UDF vs. PIG FILTER

Reply via email to