Fix column here means you have to define the table using SerDe in advance.
If you have new columns added to the json, the table has to be changed.
Create a table only have a column, such as value. Each row is line of json.
Then use the jason tuple UDF to parcer the json on the fly

On Fri, Jun 5, 2015 at 11:55 AM, Ayazur Rehman <rehman.ayazu...@gmail.com>
wrote:

> What is the meaning of fix column...does it mean non-nested columns?
> So I decided to use the UDF ...
> But I am still struggling with a command to create a table .... Is it
> necessary to use String datatype...even if I have integers....and how do I
> specify the nested data types....like the following ....
>
> "columnD": {"some_string":10.0}
>
> Also, how can I run group by queries on this file...
>
> Apologies for the silly questions...I haven't dealt with complicated
> things on hive and..the latest stable release of Pig has a bug...where it
> can't read array data type...
>
>
> On Fri, Jun 5, 2015 at 11:33 AM, DU DU <will...@gmail.com> wrote:
>
>> Using Json serde requires you have fix column. In your case, you should
>> consider using HIVE UDF mentioned above
>>
>> On Fri, Jun 5, 2015 at 11:13 AM, Wollert, Fabian <
>> fabian.woll...@zalando.de> wrote:
>>
>>> you can either pursuit the approaches from the others (reading the whole
>>> table with JSON Input Format) or you read the File with normal Input Format
>>> and STRING Datatype and then apply the function i mentioned afterwards.
>>> Both Approaches work, i usually use mine. I don't know though which one is
>>> best for performance.
>>>
>>> Cheers
>>> Fabian
>>>
>>> 2015-06-05 17:03 GMT+02:00 Ayazur Rehman <rehman.ayazu...@gmail.com>:
>>>
>>>> Hi Fabian,
>>>>
>>>> The documentation only points to how to run some queries with json
>>>> format table and I could not find the documentation for creating a Json
>>>> Table. Did I miss something in the documentation?
>>>> I apologize if my question was not clear.
>>>> Thanks
>>>>
>>>> On Fri, Jun 5, 2015 at 10:38 AM, Wollert, Fabian <
>>>> fabian.woll...@zalando.de> wrote:
>>>>
>>>>> Hey Ayazur, please see here for doc's on how to read JSON in Hive:
>>>>>
>>>>>
>>>>> https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-get_json_object
>>>>>
>>>>> Cheers
>>>>> Fabian
>>>>>
>>>>> 2015-06-05 16:30 GMT+02:00 Ayazur Rehman <rehman.ayazu...@gmail.com>:
>>>>>
>>>>>> Hi,
>>>>>>
>>>>>> Can someone tell me how to read Json file in hive
>>>>>> My Hive version is 0.14
>>>>>>
>>>>>> the schema is something like this-
>>>>>>
>>>>>> {"columnA": "some_string", "columnB": 227958.0, "columnC": [0.0, 0.0, 
>>>>>> 0.0, 0.0], "columnD": {"some_string":10.0}}
>>>>>>
>>>>>> If there is documentation about it, please let me know as well
>>>>>>
>>>>>> --
>>>>>> Thanking You,
>>>>>> Ayazur Rehman
>>>>>> +1 720 917 8540
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> *Fabian Wollert*
>>>>> Business Intelligence
>>>>>
>>>>>
>>>>>
>>>>> *POSTAL ADDRESS*
>>>>> Zalando SE
>>>>> 11501 Berlin
>>>>>
>>>>> *OFFICE*
>>>>> Zalando SE
>>>>> Mollstraße 1
>>>>> 10178 Berlin
>>>>> Germany
>>>>>
>>>>> Phone: +49 30 20968 1819
>>>>> Fax:   +49 30 27594 693
>>>>> E-Mail: fabian.woll...@zalando.de
>>>>> Web: www.zalando.de
>>>>> Jobs: jobs.zalando.de
>>>>>
>>>>> Zalando SE, Tamara-Danz-Straße 1, 10243 Berlin
>>>>> Company registration: Amtsgericht Charlottenburg, HRB 158855 B
>>>>> Tax ID: 29/560/00596 * VAT registration number: DE 260543043
>>>>> Management Board: Robert Gentz, David Schneider, Rubin Ritter
>>>>> Chairperson of the Supervisory Board: Cristina Stenbeck
>>>>> Registered office: Berlinn
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Thanking You,
>>>> Ayazur Rehman
>>>> +1 720 917 8540
>>>>
>>>
>>>
>>>
>>> --
>>> *Fabian Wollert*
>>> Business Intelligence
>>>
>>>
>>>
>>> *POSTAL ADDRESS*
>>> Zalando SE
>>> 11501 Berlin
>>>
>>> *OFFICE*
>>> Zalando SE
>>> Mollstraße 1
>>> 10178 Berlin
>>> Germany
>>>
>>> Phone: +49 30 20968 1819
>>> Fax:   +49 30 27594 693
>>> E-Mail: fabian.woll...@zalando.de
>>> Web: www.zalando.de
>>> Jobs: jobs.zalando.de
>>>
>>> Zalando SE, Tamara-Danz-Straße 1, 10243 Berlin
>>> Company registration: Amtsgericht Charlottenburg, HRB 158855 B
>>> Tax ID: 29/560/00596 * VAT registration number: DE 260543043
>>> Management Board: Robert Gentz, David Schneider, Rubin Ritter
>>> Chairperson of the Supervisory Board: Cristina Stenbeck
>>> Registered office: Berlinn
>>>
>>
>>
>>
>> --
>> Thanks,
>> Will
>>
>
>
>
> --
> Thanking You,
> Ayazur Rehman
> +1 720 917 8540
>



-- 
Thanks,
Will

Reply via email to