Awesome! I was trying to FLATTEN(*) without the TOBAG. Thanks Thejas.
On 6/2/11 11:52 AM, Thejas M Nair wrote: > one_word_per_line = FOREACH words GENERATE FLATTEN(TOBAG(*)); > > -Thejas > > > On 6/2/11 11:38 AM, "Xavier Stevens" <[email protected]> wrote: > > I'm currently trying to write a pig script to output a feature > index. Is > there a built-in function for converting an unknown length tuple to > output once for each item in the tuple? > > Example code: > > raw = LOAD 'hbase://mytable' USING HBaseStorage('data:json') AS > json:chararray; > genmap = FOREACH raw GENERATE > com.mozilla.pig.eval.json.JsonMap(json) AS > json_map:map[]; > words = FOREACH genmap GENERATE > FLATTEN(com.mozilla.pig.eval.text.Normalize(json_map#'text')) AS > word_tuple; > dump words; > (the,quick,brown,fox,jumped,over,the,lazy,dog) > > I want to get: > > the > quick > brown > fox > jumped > over > lazy > dog > > Thanks, > > -Xavier > > > > -- >
