I dont think this will work. If the data is already in TextFormat, I think the SequenceFileRecordReader will be used to read the data which will break
Do you already have the xml file from somewhere, or is also generated from hive ________________________________________ From: Ashish Thusoo [[email protected]] Sent: Friday, June 11, 2010 4:03 PM To: [email protected] Subject: RE: Create Table with Line Terminated other than '\n' The other option is to write use the regular expression serde .. something on the lines... create table xyz(doc STRING) ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe' WITH SERDEPROPERTIES ( "input.regex" = "java regular expression", "output.format.string" = "%1$s" ) STORED AS SEQUENCEFILE; I think that may work for you. The input.regex parameter has a java regular expression that groups columns in a row (in your case there will be only one column). The output.format.string says that %1 grouping is the only column in this row which is of type string. Ashish ________________________________ From: Shuja Rehman [mailto:[email protected]] Sent: Friday, June 11, 2010 4:38 AM To: [email protected] Subject: Re: Create Table with Line Terminated other than '\n' Zheng Shao !!!! Any other solution??? On Fri, Jun 11, 2010 at 10:38 AM, Shuja Rehman <[email protected]<mailto:[email protected]>> wrote: Hi yeah Zheng,hadoop does not allowing other than \n. as i tried like this create table test (xmlFile String)ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' LINES TERMINATED BY '\001' ; but it giving me the error saying that ERROR ql.Driver: FAILED: Error in semantic analysis: LINES TERMINATED BY only supports newline '\n' right now Then what can be the solution???? ANY HELP????????????? On Fri, Jun 11, 2010 at 7:22 AM, Zheng Shao <[email protected]<mailto:[email protected]>> wrote: Also, changing "LINES TERMINATED BY" probably won't work, because hadoop's TextInputFormat does not allow line terminators other than "\n". Zheng On Thu, Jun 10, 2010 at 6:31 PM, Carl Steinbach <[email protected]<mailto:[email protected]>> wrote: > Hi Shuja, > The grammar for Hive's CREATE TABLE statement is discussed > here: http://wiki.apache.org/hadoop/Hive/LanguageManual/DDL#Create_Table > You need to use the "LINES TERMINATED BY" clause in the CREATE TABLE > statement in order to specify a line terminator other than "\n". > Carl > > On Thu, Jun 10, 2010 at 5:39 PM, Shuja Rehman > <[email protected]<mailto:[email protected]>> wrote: >> >> Hi >> I want to create a table in hive which should have row formated line >> terminated other than '\n'. so i can read xml file as single cell in one row >> and column of table. >> kindly let me know how to do this? >> THanks >> >> >> >> -- >> Regards >> Shuja-ur-Rehman Baig >> _________________________________ >> MS CS - School of Science and Engineering >> Lahore University of Management Sciences (LUMS) >> Sector U, DHA, Lahore, 54792, Pakistan >> Cell: +92 3214207445 > > -- Yours, Zheng http://www.linkedin.com/in/zshao -- Regards Shuja-ur-Rehman Baig _________________________________ MS CS - School of Science and Engineering Lahore University of Management Sciences (LUMS) Sector U, DHA, Lahore, 54792, Pakistan Cell: +92 3214207445 -- Regards Shuja-ur-Rehman Baig _________________________________ MS CS - School of Science and Engineering Lahore University of Management Sciences (LUMS) Sector U, DHA, Lahore, 54792, Pakistan Cell: +92 3214207445
