Hello:

I want to do something simple:

1) I have a data file, mydata.log,  formatted like this:

a1 | b1 | c=foo&d=bar | e1
a2 | b2 | c=john&d=doe | e2
a3 | b3 | c=foo&d=doe | e3
...

2) and I want to LOAD the data USING something bu what ? in order to get the
AS as I want it i.e. fpr example :

data = LOAD 'mydata.log' USING ??? AS (A, B, C, D, E);

i.e. I want the third field (i.e. the one formatted as 'cx=foox&dx=barx') to
be parsed to yield the C and D in my AS liste of field
so that later on I can do things like:

data_cfoo = FILTER data BY c == 'foo';
data_cfoo_ddoe = FILTER data_cfoo BY d='doe';

for example.

There has to have a simple way way to do that ?
Passing a parsing shell cmd or script, a regex, a ruby script or what else
to PigStorage, or using something else than PigStorage?

Many thanks

Yves

YVES
DE FJORD

   YVES ROY DÉVELOPPEUR LOGICIEL DE FJORD
2100, RUE DRUMMOND, MONTRÉAL, QUÉBEC H3G 1X1 CANADA
T 514 270 8782 #4572 / F 514 270 4162 / cossette.com

Reply via email to