Hi Dmitriy and Gerrit, I did the following,
- confirmed that LzoPigStorage is indeed reading my lzo files (took this from gerrit's github) - confirmed that LzoBaseRegexLoader sublcass (mine) was getting strings (from pig-08 branch of Dmitriy's github) Both work. I was messing around (foolishly) with wrong extensions. Thanks Saptarshi On Mon, Mar 21, 2011 at 1:22 PM, Dmitriy Ryaboy <[email protected]> wrote: > Try the more up-to-date version > at https://github.com/dvryaboy/elephant-bird/tree/pig-08 > please send me your class if it still fails, and the input (uncompressed) > data to reproduce the error. > D > > On Mon, Mar 21, 2011 at 1:11 PM, Saptarshi Guha <[email protected]> > wrote: >> >> Hello, >> >> I have some LZO files, which i >> >> a) indexed via DistributedLzoIndexer to create index files >> b) did not index, so just some LZO files in a directory. >> >> Using both approaches, I tried creating a subclass LzoBaseRegexLoader >> that returns a pattern. >> Sadly, not a single line matched. This is not a problem of the regex >> (checked it works with other strings), >> i modified LzoBaseRegexLoader.java to print the strings coming in and >> I'm getting binary e.g. >> >> http://pastebin.com/wAveGzDy >> >> I'm using Pig 0.8 and ElephantBird checked out from >> https://github.com/gerritjvv/elephant-bird >> >> Any suggestions? >> >> Saptarshi > >
