Hi Namit Jain!

The xml file  is not generated from hive. I have xml files from somewhere
else.

On Sat, Jun 12, 2010 at 5:07 AM, Namit Jain <[email protected]> wrote:

> I dont think this will work.
>
> If the data is already in TextFormat, I think the SequenceFileRecordReader
> will be used to read the data
> which will break
>
> Do you already have the xml file from somewhere, or is also generated from
> hive
>
>
>
> ________________________________________
> From: Ashish Thusoo [[email protected]]
> Sent: Friday, June 11, 2010 4:03 PM
> To: [email protected]
> Subject: RE: Create Table with Line Terminated other than '\n'
>
> The other option is to write use the regular expression serde .. something
> on the lines...
>
> create table xyz(doc STRING)
> ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
> WITH SERDEPROPERTIES (
>  "input.regex" = "java regular expression",
>  "output.format.string" = "%1$s"
> )
> STORED AS SEQUENCEFILE;
>
> I think that may work for you.
>
> The input.regex parameter has a java regular expression that groups columns
> in a row (in your case there will be only one
> column). The output.format.string says that %1 grouping is the only column
> in this row which is of type string.
>
> Ashish
>
>
>
> ________________________________
> From: Shuja Rehman [mailto:[email protected]]
> Sent: Friday, June 11, 2010 4:38 AM
> To: [email protected]
> Subject: Re: Create Table with Line Terminated other than '\n'
>
> Zheng Shao !!!! Any other solution???
>
> On Fri, Jun 11, 2010 at 10:38 AM, Shuja Rehman <[email protected]
> <mailto:[email protected]>> wrote:
> Hi
> yeah Zheng,hadoop does not allowing other than \n. as i tried like this
>
> create table test (xmlFile String)ROW FORMAT DELIMITED FIELDS TERMINATED BY
> '\t' LINES TERMINATED BY '\001' ;
>
> but it giving me the error saying that
>
> ERROR ql.Driver: FAILED: Error in semantic analysis: LINES TERMINATED BY
> only supports newline '\n' right now
>
> Then what can be the solution????
>
> ANY HELP?????????????
>
>
> On Fri, Jun 11, 2010 at 7:22 AM, Zheng Shao <[email protected]<mailto:
> [email protected]>> wrote:
> Also, changing "LINES TERMINATED BY" probably won't work, because
> hadoop's TextInputFormat does not allow line terminators other than
> "\n".
>
> Zheng
>
> On Thu, Jun 10, 2010 at 6:31 PM, Carl Steinbach <[email protected]<mailto:
> [email protected]>> wrote:
> > Hi Shuja,
> > The grammar for Hive's CREATE TABLE statement is discussed
> > here: http://wiki.apache.org/hadoop/Hive/LanguageManual/DDL#Create_Table
> > You need to use the "LINES TERMINATED BY" clause in the CREATE TABLE
> > statement in order to specify a line terminator other than "\n".
> > Carl
> >
> > On Thu, Jun 10, 2010 at 5:39 PM, Shuja Rehman <[email protected]
> <mailto:[email protected]>> wrote:
> >>
> >> Hi
> >> I want to create a table in hive which should have row formated line
> >> terminated other than '\n'. so i can read xml file as single cell in one
> row
> >> and column of table.
> >> kindly let me know how to do this?
> >> THanks
> >>
> >>
> >>
> >> --
> >> Regards
> >> Shuja-ur-Rehman Baig
> >> _________________________________
> >> MS CS - School of Science and Engineering
> >> Lahore University of Management Sciences (LUMS)
> >> Sector U, DHA, Lahore, 54792, Pakistan
> >> Cell: +92 3214207445
> >
> >
>
>
>
> --
> Yours,
> Zheng
> http://www.linkedin.com/in/zshao
>
>
>
> --
> Regards
> Shuja-ur-Rehman Baig
> _________________________________
> MS CS - School of Science and Engineering
> Lahore University of Management Sciences (LUMS)
> Sector U, DHA, Lahore, 54792, Pakistan
> Cell: +92 3214207445
>
>
>
> --
> Regards
> Shuja-ur-Rehman Baig
> _________________________________
> MS CS - School of Science and Engineering
> Lahore University of Management Sciences (LUMS)
> Sector U, DHA, Lahore, 54792, Pakistan
> Cell: +92 3214207445
>



-- 
Regards
Shuja-ur-Rehman Baig
_________________________________
MS CS - School of Science and Engineering
Lahore University of Management Sciences (LUMS)
Sector U, DHA, Lahore, 54792, Pakistan
Cell: +92 3214207445

Reply via email to