Re: Writing custom parser

AD Mon, 26 Dec 2011 09:55:01 -0800

Also I wrote my own processor by just mimicking one of the existing like
TSProcessor.  Its pretty simple you only need to implement a couple methods
in a new classfile.  Here is a massively oversimplified example.


package org.apache.hadoop.chukwa.extraction.demux.processor.mapper;

public class MyCustomProcessor extends AbstractProcessor {

@Override
  protected void parse(String recordEntry,OutputCollector<ChukwaRecordKey,
ChukwaRecord> output, Reporter reporter) {

  ChukwaRecord record = new ChukwaRecord();
        this.buildGenericRecord(record, recordEntry, archiveKey
            .getTimePartition(), chunk.getDataType());

   // Do all your parsing here then add whatever fields you want to the
record

   record.add("some_field","some_value");
   output.collect(key,record);
 }
}


On Mon, Dec 26, 2011 at 12:24 PM, Eric Yang <[email protected]> wrote:

> To store data to HBase, the demux parser must be implanted with Chukwa
> HBase annotation.
> For more details, check out this document:
>
> http://people.apache.org/~eyang/chukwa-0.5.0-docs/programming.html
>
> Section: Demux Data To HBase
>
> Hope this helps.
>
> regards,
> Eric
>
> On Mon, Dec 26, 2011 at 5:47 AM, Mohammad Tariq <[email protected]>
> wrote:
> > Hello list,
> >
> >    I have used Chukwa, without any modification till now. Now I want
> > to collect data that is in form of "xml" and store it in Hbase tables
> > and for that I have to write the parser.Since it is my first attempt
> > of doing such thing, I would like to have some guidance from the
> > experts. I am using this link for getting started
> > "http://wiki.apache.org/hadoop/DemuxModification".If anyone is aware
> > of some other place that can help me, please let me know.Many thanks.
> >
> > Regards,
> >     Mohammad Tariq
>

Re: Writing custom parser

Reply via email to