Did I also say, thank you to both of you and everyone involved for elephantbird. An extremely useful set of tools (like a wonderful christmas present)
Cheers Saptarshi On Mon, Mar 21, 2011 at 4:50 PM, Saptarshi Guha <[email protected]> wrote: > Hi Dmitriy and Gerrit, > > I did the following, > > - confirmed that LzoPigStorage is indeed reading my lzo files (took > this from gerrit's github) > - confirmed that LzoBaseRegexLoader sublcass (mine) was getting > strings (from pig-08 branch of Dmitriy's github) > > Both work. I was messing around (foolishly) with wrong extensions. > > Thanks > Saptarshi > > > > > On Mon, Mar 21, 2011 at 1:22 PM, Dmitriy Ryaboy <[email protected]> wrote: >> Try the more up-to-date version >> at https://github.com/dvryaboy/elephant-bird/tree/pig-08 >> please send me your class if it still fails, and the input (uncompressed) >> data to reproduce the error. >> D >> >> On Mon, Mar 21, 2011 at 1:11 PM, Saptarshi Guha <[email protected]> >> wrote: >>> >>> Hello, >>> >>> I have some LZO files, which i >>> >>> a) indexed via DistributedLzoIndexer to create index files >>> b) did not index, so just some LZO files in a directory. >>> >>> Using both approaches, I tried creating a subclass LzoBaseRegexLoader >>> that returns a pattern. >>> Sadly, not a single line matched. This is not a problem of the regex >>> (checked it works with other strings), >>> i modified LzoBaseRegexLoader.java to print the strings coming in and >>> I'm getting binary e.g. >>> >>> http://pastebin.com/wAveGzDy >>> >>> I'm using Pig 0.8 and ElephantBird checked out from >>> https://github.com/gerritjvv/elephant-bird >>> >>> Any suggestions? >>> >>> Saptarshi >> >> >
