I think that you might be able to get away with 20.2 if you don't use
the filtering options.


On Mon, Oct 25, 2010 at 3:39 PM, Anze <[email protected]> wrote:
>
> Dmitriy, thanks for the answer!
>
> The problem with upgrading to HBase 0.20.6 is that cloudera doesn't ship it
> yet and we would like to keep our install at "official" versions, even if
> beta. Of course, since this is a development / testing cluster, we could bend
> the rules if really necessary...
>
> I have written a small MR job (actually, just "M" job :) that exports the
> tables to files (allowing me to use Pig 0.7), but that is a bit cumbersome and
> slow.
>
> If I install the latest Pig (0.8), will it work at all with HBase 0.20.2?
> In other words, are scan filters (which were fixed in 0.20.6) needed as part
> of user-defined parameters or as part of Pig optimizations in reading from
> HBase? Hope my question makes sense... :)
>
> Thanks again,
>
> Anze
>
>
> On Tuesday 26 October 2010, Dmitriy Ryaboy wrote:
>> Anze, the reason we bumped up to 20.6 in the ticket was because HBase
>> 20.2 had a bug in it. Ask the HBase folks, but I'd say you should
>> upgrade.
>> FWIW we upgraded to 20.6 from 20.2 a few months back and it's been
>> working smoothly.
>>
>> The Elephant-Bird hbase loader for pig 0.6 does add row keys and most
>> of the other features we added to the built-in loader for pig 0.8
>> (notably, it does not do storage). But I don't recommend downgrading
>> to pig 0.6, as 7 and especially 8 are great improvements to the
>> software.
>>
>> -D
>>
>> On Mon, Oct 25, 2010 at 7:01 AM, Anze <[email protected]> wrote:
>> > Hi all!
>> >
>> > I am struggling to find a working solution to load data from HBase
>> > directly. I am using Cloudera CDH3b3 which comes with Pig 0.7. What
>> > would be the easiest way to load data from HBase?
>> > If it matters: we need the rows to be included, too.
>> >
>> > I have checked ElephantBird, but it seems to require Pig 0.6. I could
>> > downgrade, but it seems... well... :)
>> >
>> > On the other hand, loading from HBase with rows is only added in Pig 0.8:
>> > https://issues.apache.org/jira/browse/PIG-915
>> > https://issues.apache.org/jira/browse/PIG-1205
>> > But judging from the last issue Pig 0.8 requires HBase 0.20.6?
>> >
>> > I can install latest Pig from source if needed, but I'd rather leave
>> > Hadoop and HBase at their versions (0.20.2 and 0.89.20100924
>> > respectively).
>> >
>> > Should I write my own UDF? I'd appreciate some pointers.
>> >
>> > Thanks,
>> >
>> > Anze
>
>

Reply via email to