Fix column here means you have to define the table using SerDe in advance. If you have new columns added to the json, the table has to be changed. Create a table only have a column, such as value. Each row is line of json. Then use the jason tuple UDF to parcer the json on the fly
On Fri, Jun 5, 2015 at 11:55 AM, Ayazur Rehman <rehman.ayazu...@gmail.com> wrote: > What is the meaning of fix column...does it mean non-nested columns? > So I decided to use the UDF ... > But I am still struggling with a command to create a table .... Is it > necessary to use String datatype...even if I have integers....and how do I > specify the nested data types....like the following .... > > "columnD": {"some_string":10.0} > > Also, how can I run group by queries on this file... > > Apologies for the silly questions...I haven't dealt with complicated > things on hive and..the latest stable release of Pig has a bug...where it > can't read array data type... > > > On Fri, Jun 5, 2015 at 11:33 AM, DU DU <will...@gmail.com> wrote: > >> Using Json serde requires you have fix column. In your case, you should >> consider using HIVE UDF mentioned above >> >> On Fri, Jun 5, 2015 at 11:13 AM, Wollert, Fabian < >> fabian.woll...@zalando.de> wrote: >> >>> you can either pursuit the approaches from the others (reading the whole >>> table with JSON Input Format) or you read the File with normal Input Format >>> and STRING Datatype and then apply the function i mentioned afterwards. >>> Both Approaches work, i usually use mine. I don't know though which one is >>> best for performance. >>> >>> Cheers >>> Fabian >>> >>> 2015-06-05 17:03 GMT+02:00 Ayazur Rehman <rehman.ayazu...@gmail.com>: >>> >>>> Hi Fabian, >>>> >>>> The documentation only points to how to run some queries with json >>>> format table and I could not find the documentation for creating a Json >>>> Table. Did I miss something in the documentation? >>>> I apologize if my question was not clear. >>>> Thanks >>>> >>>> On Fri, Jun 5, 2015 at 10:38 AM, Wollert, Fabian < >>>> fabian.woll...@zalando.de> wrote: >>>> >>>>> Hey Ayazur, please see here for doc's on how to read JSON in Hive: >>>>> >>>>> >>>>> https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-get_json_object >>>>> >>>>> Cheers >>>>> Fabian >>>>> >>>>> 2015-06-05 16:30 GMT+02:00 Ayazur Rehman <rehman.ayazu...@gmail.com>: >>>>> >>>>>> Hi, >>>>>> >>>>>> Can someone tell me how to read Json file in hive >>>>>> My Hive version is 0.14 >>>>>> >>>>>> the schema is something like this- >>>>>> >>>>>> {"columnA": "some_string", "columnB": 227958.0, "columnC": [0.0, 0.0, >>>>>> 0.0, 0.0], "columnD": {"some_string":10.0}} >>>>>> >>>>>> If there is documentation about it, please let me know as well >>>>>> >>>>>> -- >>>>>> Thanking You, >>>>>> Ayazur Rehman >>>>>> +1 720 917 8540 >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> *Fabian Wollert* >>>>> Business Intelligence >>>>> >>>>> >>>>> >>>>> *POSTAL ADDRESS* >>>>> Zalando SE >>>>> 11501 Berlin >>>>> >>>>> *OFFICE* >>>>> Zalando SE >>>>> Mollstraße 1 >>>>> 10178 Berlin >>>>> Germany >>>>> >>>>> Phone: +49 30 20968 1819 >>>>> Fax: +49 30 27594 693 >>>>> E-Mail: fabian.woll...@zalando.de >>>>> Web: www.zalando.de >>>>> Jobs: jobs.zalando.de >>>>> >>>>> Zalando SE, Tamara-Danz-Straße 1, 10243 Berlin >>>>> Company registration: Amtsgericht Charlottenburg, HRB 158855 B >>>>> Tax ID: 29/560/00596 * VAT registration number: DE 260543043 >>>>> Management Board: Robert Gentz, David Schneider, Rubin Ritter >>>>> Chairperson of the Supervisory Board: Cristina Stenbeck >>>>> Registered office: Berlinn >>>>> >>>> >>>> >>>> >>>> -- >>>> Thanking You, >>>> Ayazur Rehman >>>> +1 720 917 8540 >>>> >>> >>> >>> >>> -- >>> *Fabian Wollert* >>> Business Intelligence >>> >>> >>> >>> *POSTAL ADDRESS* >>> Zalando SE >>> 11501 Berlin >>> >>> *OFFICE* >>> Zalando SE >>> Mollstraße 1 >>> 10178 Berlin >>> Germany >>> >>> Phone: +49 30 20968 1819 >>> Fax: +49 30 27594 693 >>> E-Mail: fabian.woll...@zalando.de >>> Web: www.zalando.de >>> Jobs: jobs.zalando.de >>> >>> Zalando SE, Tamara-Danz-Straße 1, 10243 Berlin >>> Company registration: Amtsgericht Charlottenburg, HRB 158855 B >>> Tax ID: 29/560/00596 * VAT registration number: DE 260543043 >>> Management Board: Robert Gentz, David Schneider, Rubin Ritter >>> Chairperson of the Supervisory Board: Cristina Stenbeck >>> Registered office: Berlinn >>> >> >> >> >> -- >> Thanks, >> Will >> > > > > -- > Thanking You, > Ayazur Rehman > +1 720 917 8540 > -- Thanks, Will