Hi Arsalan, Are you trying to insert this data into HBase or are you trying to just process this log file using Hadoop? I am not sure how your question is related to HBase, so if it is unrelated, you can seek help on the mapreduce user lists.
For a MR job, you can use TextInputFormat and specify the custom separator. See https://issues.apache.org/jira/browse/HADOOP-3295. Best Regards, Sonal Crux: Reporting for HBase <https://github.com/sonalgoyal/crux> Nube Technologies <http://www.nubetech.co> <http://in.linkedin.com/in/sonalgoyal> On Fri, Sep 30, 2011 at 12:26 PM, Arsalan Bilal <[email protected]>wrote: > No , i did not try Guava's Splitter > I am asking about such input format class that takes also separator. > As example, job.setInputFormatClass(<Class Format>, <Seperator>); > What will be inputFormatClass here that support separator? > > > On Thu, Sep 29, 2011 at 8:39 PM, Buttler, David <[email protected]> wrote: > > > Have you considered just taking the line of text as is and using Guava's > > Splitter? > > > > Not sure how this is related to HBase > > > > -----Original Message----- > > From: Arsalan Bilal [mailto:[email protected]] > > Sent: Thursday, September 29, 2011 7:24 AM > > To: [email protected] > > Subject: Problem in Input Format Class > > > > i want to read text file (sample given below) separated with semicolon(;) > > using mapper. Each one record is separated with semicolon(;). > > Should i write my own custom input format class? OR > > Is there exist any input format class that ask about separator? > > > > input File look > > > > 1;00000003;310:012:8001:01;-05:00;04:04;2010;45;56164773;3;1;0;1; > > > > > > > -- > Best Regards, > Arsalan Bilal >
