Hello-

Using Pig Version  0.10.0

I'm trying to import BlueCoat proxy logs using Pig, and having
difficulty with field delimiters (Multiple spaces, spaces embedded
withing browser description string, etc.) This seems like it would be a
common Pig application.

I found an old example using ...piggybank.evaluation.string.EXTRACT that
seemed to fit the bill, but alas, EXTRACT has been deprecated.

Extract allowed REGEX like parsing, so I checked out
...piggybank.storage.RegExLoader, and then
...piggybank.storage.apachelog.CommonLogLoader.

I am unable to translate what the Usage comments mean. Basically, I
don't see where I set the REGEX pattern string for my needs.

Can anyone explain the usage of either of these classes or point me to
an example?

Thanks,

Terry

Reply via email to