Did I also say, thank you to both of you and everyone involved for elephantbird.
An extremely useful set of tools (like a wonderful christmas present)

Cheers
Saptarshi


On Mon, Mar 21, 2011 at 4:50 PM, Saptarshi Guha
<[email protected]> wrote:
> Hi Dmitriy and Gerrit,
>
> I did the following,
>
> - confirmed that LzoPigStorage is indeed reading my lzo files (took
> this from gerrit's github)
> - confirmed that LzoBaseRegexLoader sublcass (mine) was getting
> strings  (from pig-08 branch of Dmitriy's github)
>
> Both work. I was messing around (foolishly) with wrong extensions.
>
> Thanks
> Saptarshi
>
>
>
>
> On Mon, Mar 21, 2011 at 1:22 PM, Dmitriy Ryaboy <[email protected]> wrote:
>> Try the more up-to-date version
>> at https://github.com/dvryaboy/elephant-bird/tree/pig-08
>> please send me your class if it still fails, and the input (uncompressed)
>> data to reproduce the error.
>> D
>>
>> On Mon, Mar 21, 2011 at 1:11 PM, Saptarshi Guha <[email protected]>
>> wrote:
>>>
>>> Hello,
>>>
>>> I have some LZO files, which i
>>>
>>> a) indexed via DistributedLzoIndexer to create index files
>>> b) did not index, so just some LZO files in a directory.
>>>
>>> Using  both approaches, I tried creating a subclass LzoBaseRegexLoader
>>> that returns a pattern.
>>> Sadly, not a single line matched. This is not a problem of the regex
>>> (checked it works with other strings),
>>> i modified LzoBaseRegexLoader.java to print the strings coming in and
>>> I'm getting binary  e.g.
>>>
>>> http://pastebin.com/wAveGzDy
>>>
>>> I'm using Pig 0.8 and ElephantBird checked out from
>>> https://github.com/gerritjvv/elephant-bird
>>>
>>> Any suggestions?
>>>
>>> Saptarshi
>>
>>
>

Reply via email to