I am asking about such* **InputFormat* and *RecordReader* that read strings
of text separated by (;) semicolon characters


On Fri, Sep 30, 2011 at 11:11 AM, Sonal Goyal <[email protected]> wrote:

> Sorry, I think I got confused by the question and talked about the
> OutputFormat not the input format, which is apparently what you are looking
> for. Please ignore my answer. Apologies!
>
> Best Regards,
> Sonal
> Crux: Reporting for HBase <https://github.com/sonalgoyal/crux>
> Nube Technologies <http://www.nubetech.co>
>
> <http://in.linkedin.com/in/sonalgoyal>
>
>
>
>
>
> On Fri, Sep 30, 2011 at 12:32 PM, Sonal Goyal <[email protected]>
> wrote:
>
> > Hi Arsalan,
> >
> > Are you trying to insert this data into HBase or are you trying to just
> > process this log file using Hadoop? I am not sure how your question is
> > related to HBase, so if it is unrelated, you can seek help on the
> mapreduce
> > user lists.
> >
> > For a MR job, you can use TextInputFormat and specify the custom
> separator.
> > See https://issues.apache.org/jira/browse/HADOOP-3295.
> >
> > Best Regards,
> > Sonal
> > Crux: Reporting for HBase <https://github.com/sonalgoyal/crux>
> > Nube Technologies <http://www.nubetech.co>
> >
> > <http://in.linkedin.com/in/sonalgoyal>
> >
> >
> >
> >
> >
> >
> > On Fri, Sep 30, 2011 at 12:26 PM, Arsalan Bilal <
> [email protected]>wrote:
> >
> >> No , i did not try Guava's Splitter
> >> I am asking about such input format class that takes also separator.
> >> As example, job.setInputFormatClass(<Class Format>, <Seperator>);
> >> What will be inputFormatClass here that support separator?
> >>
> >>
> >> On Thu, Sep 29, 2011 at 8:39 PM, Buttler, David <[email protected]>
> >> wrote:
> >>
> >> > Have you considered just taking the line of text as is and using
> Guava's
> >> > Splitter?
> >> >
> >> > Not sure how this is related to HBase
> >> >
> >> > -----Original Message-----
> >> > From: Arsalan Bilal [mailto:[email protected]]
> >> > Sent: Thursday, September 29, 2011 7:24 AM
> >> > To: [email protected]
> >> > Subject: Problem in Input Format Class
> >> >
> >> > i want to read text file (sample given below) separated with
> >> semicolon(;)
> >> > using mapper. Each one record is separated with semicolon(;).
> >> > Should i write my own custom input format class? OR
> >> > Is there exist any input format class that ask about separator?
> >> >
> >> > input File look
> >> >
> >> > 1;00000003;310:012:8001:01;-05:00;04:04;2010;45;56164773;3;1;0;1;
> >> >
> >> >
> >>
> >>
> >> --
> >> Best Regards,
> >> Arsalan Bilal
> >>
> >
> >
>



-- 
Best Regards,
Arsalan Bilal

Reply via email to