Yep. It's just JsonLoader.
By default it works on top of whatever's returned by TexInputFormat, but
you can override that, as long as the input format returns a string that's
valid json, we are cool (so in theory you could write a
TwitterAPIInputFormat or something, and get the json in Pig, not that I
would recommend that).

D

On Wed, Sep 26, 2012 at 9:34 PM, Russell Jurney <russell.jur...@gmail.com>wrote:

> Does that work without lzo?
>
> Russell Jurney http://datasyndrome.com
>
> On Sep 26, 2012, at 9:00 PM, Dmitriy Ryaboy <dvrya...@gmail.com> wrote:
>
> > Try asking Michael May on gihub? This seems to be an issue with his
> Loader..
> >
> > The JsonLoader in ElephantBird should work in this case if you turn on
> > nested parsing (
> >
> https://github.com/kevinweil/elephant-bird/blob/master/pig/src/main/java/com/twitter/elephantbird/pig/load/JsonLoader.java
> > )
> >
> > D
> >
> > On Wed, Sep 26, 2012 at 2:31 PM, Deepak Tiwari <dtiwari...@gmail.com>
> wrote:
> >
> >> My bad.. I think I have compiled from
> >> https://github.com/mmay/PigJsonLoader/blob/master/JsonLoader.java long
> >> time
> >> back in my piggybank area..it indeed didnt come with the original jar...
> >>
> >> Regards,
> >>
> >> Deepak
> >>
> >> On Tue, Sep 25, 2012 at 8:14 AM, Bill Graham <billgra...@gmail.com>
> wrote:
> >>
> >>> I missed the part about Piggybank, but I'm confused because I don't see
> >>> that class in SVN:
> >>>
> >>>
> >>
> http://svn.apache.org/viewvc/pig/branches/branch-0.10/contrib/piggybank/java/src/main/java/org/apache/pig/piggybank/storage/
> >>>
> >>> Either way your error seems to be issues with parsing the doubles.
> >>>
> >>>
> >>> On Mon, Sep 24, 2012 at 2:24 PM, Vivek Shrivastava <
> >>> vivshrivast...@gmail.com
> >>>> wrote:
> >>>
> >>>> Thanks for responding Bill, However I am using JsonLoader that is in
> >> the
> >>>> Piggybank with Pig-0.10.0.
> >>>>
> >>>> It doesnt need any schema and converts Json data as map (
> >>>> org.apache.pig.piggybank.storage.JsonLoader() as (json:map[]) ) and I
> >>>> extract data from there using keys. I have processed huge amount of
> >> data
> >>>> without any problem and no schema was required.
> >>>>
> >>>> Regards,
> >>>>
> >>>> Vivek
> >>>>
> >>>> On Mon, Sep 24, 2012 at 2:03 PM, Bill Graham <billgra...@gmail.com>
> >>> wrote:
> >>>>
> >>>>> This loader only works for data stored using JsonStorage. From the
> >>>>> javadocs:
> >>>>>
> >>>>> A loader for data stored using
> >>>>> JsonStorage<
> >>>>>
> >>>
> >>
> http://pig.apache.org/docs/r0.10.0/api/org/apache/pig/builtin/JsonStorage.html
> >>>>>> .
> >>>>>
> >>>>> This is not a generic JSON loader. It depends on the schema being
> >> stored
> >>>>> with the data when conceivably you could write a loader that
> >> determines
> >>>>> the
> >>>>> schema from the JSON.
> >>>>>
> >>>>> Was this data produced via JsonStorage? If not, you'll need to write
> a
> >>>>> custom loader.
> >>>>>
> >>>>> On Mon, Sep 24, 2012 at 12:04 PM, Deepak Tiwari <
> dtiwari...@gmail.com
> >>>>>> wrote:
> >>>>>
> >>>>>> Hi,
> >>>>>>
> >>>>>> I am try to parse this data using Pig parser
> >>>>>> org.apache.pig.piggybank.storage.JsonLoader
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>
> >>>
> >>
> {"geo":{"type":"Polygon","coordinates":[[[-91.3061478,-30.2688069],[-91.012471,-60.2688069],[-91.012471,-69.9306357],[-91.3061478,-29.9306357]]]},
> >>>>>>
> >>>>>> I need to extract this array
> >>>>>>
> >>>>>>
> >>>>>
> >>>
> >>
> [[[-91.3061478,-30.2688069],[-91.012471,-60.2688069],[-91.012471,-69.9306357],[-91.3061478,-29.9306357]]]
> >>>>>>
> >>>>>> I am getting this error while accessing flatten(geo#'coordinates')
> >> , I
> >>>>>> think that's the limitation ( "only standard Pig type is supported")
> >>> of
> >>>>> the
> >>>>>> the parser, but wondering if someone has any workaround
> >>>>>>
> >>>>>> "java.lang.RuntimeException: Unexpected data type
> >>>>>> org.codehaus.jackson.node.DoubleNode found in stream. Note only
> >>> standard
> >>>>>> Pig type is supported when you output from UDF/LoadFunc"
> >>>>>>
> >>>>>>
> >>>>>> Thanks very much,
> >>>>>>
> >>>>>> Deepak
> >>>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>>> --
> >>>>> *Note that I'm no longer using my Yahoo! email address. Please email
> >> me
> >>> at
> >>>>> billgra...@gmail.com going forward.*
> >>>>>
> >>>>
> >>>>
> >>>
> >>>
> >>> --
> >>> *Note that I'm no longer using my Yahoo! email address. Please email me
> >> at
> >>> billgra...@gmail.com going forward.*
> >>>
> >>
>

Reply via email to