Re: Hive error when loading csv data.

Sandeep Reddy P Tue, 26 Jun 2012 19:53:26 -0700

Thanks Michael Sorry i didnt get that soon. I'll try that and reply you
back.


On Tue, Jun 26, 2012 at 10:13 PM, Michel Segel <michael_se...@hotmail.com>wrote:

> Sorry,
> I was saying  that you can write a python script that replaces the
> delimiter with a | and ignore the commas within quotes.
>
>
> Sent from a remote device. Please excuse any typos...
>
> Mike Segel
>
> On Jun 26, 2012, at 8:58 PM, Sandeep Reddy P <sandeepreddy.3...@gmail.com>
> wrote:
>
> > If i do that my data will be d|"abc|def"|abcd my problem is not solved
> >
> > On Tue, Jun 26, 2012 at 6:48 PM, Michel Segel <michael_se...@hotmail.com
> >wrote:
> >
> >> Yup. I just didnt add the quotes.
> >>
> >> Sent from a remote device. Please excuse any typos...
> >>
> >> Mike Segel
> >>
> >> On Jun 26, 2012, at 4:30 PM, Sandeep Reddy P <
> sandeepreddy.3...@gmail.com>
> >> wrote:
> >>
> >>> Thanks for the reply.
> >>> I didnt get that Michael. My f2 should be "abc,def"
> >>>
> >>> On Tue, Jun 26, 2012 at 4:00 PM, Michael Segel <
> >> michael_se...@hotmail.com>wrote:
> >>>
> >>>> Alternatively you could write a simple script to convert the csv to a
> >> pipe
> >>>> delimited file so that "abc,def" will be abc,def.
> >>>>
> >>>> On Jun 26, 2012, at 2:51 PM, Harsh J wrote:
> >>>>
> >>>>> Hive's delimited-fields-format record reader does not handle quoted
> >>>>> text that carry the same delimiter within them. Excel supports such
> >>>>> records, so it reads it fine.
> >>>>>
> >>>>> You will need to create your table with a custom InputFormat class
> >>>>> that can handle this (Try using OpenCSV readers, they support this),
> >>>>> instead of relying on Hive to do this for you. If you're successful
> in
> >>>>> your approach, please also consider contributing something back to
> >>>>> Hive/Pig to help others.
> >>>>>
> >>>>> On Wed, Jun 27, 2012 at 12:37 AM, Sandeep Reddy P
> >>>>> <sandeepreddy.3...@gmail.com> wrote:
> >>>>>>
> >>>>>>
> >>>>>> Hi all,
> >>>>>> I have a csv file with 46 columns but i'm getting error when i do
> some
> >>>>>> analysis on that data type. For simplification i have taken 3
> columns
> >>>> and
> >>>>>> now my csv is like
> >>>>>> c,zxy,xyz
> >>>>>> d,"abc,def",abcd
> >>>>>>
> >>>>>> i have created table for this data using,
> >>>>>> hive> create table test3(
> >>>>>>> f1 string,
> >>>>>>> f2 string,
> >>>>>>> f3 string)
> >>>>>>> row format delimited
> >>>>>>> fields terminated by ",";
> >>>>>> OK
> >>>>>> Time taken: 0.143 seconds
> >>>>>> hive> load data local inpath '/home/training/a.csv'
> >>>>>>> into table test3;
> >>>>>> Copying data from file:/home/training/a.csv
> >>>>>> Copying file: file:/home/training/a.csv
> >>>>>> Loading data to table default.test3
> >>>>>> OK
> >>>>>> Time taken: 0.276 seconds
> >>>>>> hive> select * from test3;
> >>>>>> OK
> >>>>>> c       zxy     xyz
> >>>>>> d       "abc    def"
> >>>>>> Time taken: 0.156 seconds
> >>>>>>
> >>>>>> When i do select f2 from test3;
> >>>>>> my results are,
> >>>>>> OK
> >>>>>> zxy
> >>>>>> "abc
> >>>>>> but this should be abc,def
> >>>>>> When i open the same csv file with Microsoft Excel i got abc,def
> >>>>>> How should i solve this error??
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>> --
> >>>>>> Thanks,
> >>>>>> sandeep
> >>>>>>
> >>>>>> --
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>>> --
> >>>>> Harsh J
> >>>>>
> >>>>
> >>>>
> >>>
> >>>
> >>> --
> >>> Thanks,
> >>> sandeep
> >>
> >
> >
> >
> > --
> > Thanks,
> > sandeep
>



-- 
Thanks,
sandeep

Re: Hive error when loading csv data.

Reply via email to