Pig HBase Scan Performance using HBaseStorage API

2014-12-06 Thread Krishna Kalyan
Hi, Would there be a performance difference query1 vs query2? *query1 :* cc = LOAD '$TBL_CLEARCODE' USING org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_description cf_data:cq_category cf_data:cq_dqtimestamp cf_data:cq_checkarray', '-loadKey true ') as

Pig Writing a Pig UDF for checkandPut using HBaseStorage API

2014-12-06 Thread Krishna Kalyan
Hi, Currently we have all our batch process written in Pig. I need to Store some data into HBase using Pig. Before storing the data I need to check if the value is present. If value is dont put. I plan to write a pig UDF to do this, as all our data pipelines use Pig. My Sample Code Below:

Re: Help with Pig UDF?

2014-12-06 Thread Ryan
Got it, thanks! Any idea why Tika might not be working? I've been testing and while no exceptions are being thrown, neither is anything being appended when I call pdfText.append(contenthandler.toString()); On Fri, Dec 5, 2014 at 6:21 PM, Pradeep Gollakota pradeep...@gmail.com wrote: A static