Hi Dmitriy and Gerrit,

I did the following,

- confirmed that LzoPigStorage is indeed reading my lzo files (took
this from gerrit's github)
- confirmed that LzoBaseRegexLoader sublcass (mine) was getting
strings  (from pig-08 branch of Dmitriy's github)

Both work. I was messing around (foolishly) with wrong extensions.

Thanks
Saptarshi




On Mon, Mar 21, 2011 at 1:22 PM, Dmitriy Ryaboy <[email protected]> wrote:
> Try the more up-to-date version
> at https://github.com/dvryaboy/elephant-bird/tree/pig-08
> please send me your class if it still fails, and the input (uncompressed)
> data to reproduce the error.
> D
>
> On Mon, Mar 21, 2011 at 1:11 PM, Saptarshi Guha <[email protected]>
> wrote:
>>
>> Hello,
>>
>> I have some LZO files, which i
>>
>> a) indexed via DistributedLzoIndexer to create index files
>> b) did not index, so just some LZO files in a directory.
>>
>> Using  both approaches, I tried creating a subclass LzoBaseRegexLoader
>> that returns a pattern.
>> Sadly, not a single line matched. This is not a problem of the regex
>> (checked it works with other strings),
>> i modified LzoBaseRegexLoader.java to print the strings coming in and
>> I'm getting binary  e.g.
>>
>> http://pastebin.com/wAveGzDy
>>
>> I'm using Pig 0.8 and ElephantBird checked out from
>> https://github.com/gerritjvv/elephant-bird
>>
>> Any suggestions?
>>
>> Saptarshi
>
>

Reply via email to