Re: Processing xml documents using StreamXmlRecordReader

2012-06-21 Thread Mohammad Tariq
Hello Madhu, I really appreciate your efforts. I am sorry I did not respond back.Actually I was struggling with it, so had anything to let you know.Many thanks. Regards,     Mohammad Tariq On Thu, Jun 21, 2012 at 12:37 PM, madhu phatak wrote: > Hi, >  Jira for the new API code > htt

Re: Processing xml documents using StreamXmlRecordReader

2012-06-19 Thread Mohammad Tariq
But I have downloaded "hadoop-streaming-0.20.205.0.jar" and it contains StreamXmlRecordReader.class file. This means it should support StreamInputFormat. Regards,     Mohammad Tariq On Tue, Jun 19, 2012 at 5:54 PM, Mohammad Tariq wrote: > Thanks Madhu. I'll do that. > > Regards, >     Mohammad

Re: Processing xml documents using StreamXmlRecordReader

2012-06-19 Thread Mohammad Tariq
Thanks Madhu. I'll do that. Regards,     Mohammad Tariq On Tue, Jun 19, 2012 at 5:43 PM, madhu phatak wrote: > Seems like StreamInputFormat not yet ported to new API.That's why you are > not able to set as InputFormatClass. You can file a  jira for this issue. > > > On Tue, Jun 19, 2012 at 4:49

Re: Processing xml documents using StreamXmlRecordReader

2012-06-19 Thread madhu phatak
Seems like StreamInputFormat not yet ported to new API.That's why you are not able to set as InputFormatClass. You can file a jira for this issue. On Tue, Jun 19, 2012 at 4:49 PM, Mohammad Tariq wrote: > My driver function looks like this - > > public static void main(String[] args) throws IOEx

Re: Processing xml documents using StreamXmlRecordReader

2012-06-19 Thread Mohammad Tariq
My driver function looks like this - public static void main(String[] args) throws IOException, InterruptedException, ClassNotFoundException { // TODO Auto-generated method stub Configuration conf = new Configuration(); Job job = new Job();

Re: Processing xml documents using StreamXmlRecordReader

2012-06-19 Thread Mohammad Tariq
Hello Madhu, Thanks for the response. Actually I was trying to use the new API (Job). Have you tried that. I was not able to set the InputFormat using the Job API. Regards,     Mohammad Tariq On Tue, Jun 19, 2012 at 4:28 PM, madhu phatak wrote: > Hi, >  Set the following propertie

Re: Processing xml documents using StreamXmlRecordReader

2012-06-19 Thread madhu phatak
Hi, Set the following properties in driver class jobConf.set("stream.recordreader.class", "org.apache.hadoop.streaming.StreamXmlRecordReader"); jobConf.set("stream.recordreader.begin", "start-tag"); jobConf.set("stream.recordreader.end", "end-tag"); jobConf.setInputForma