We can write our own mapper ,because MapProcessorFactory uses
Class.forName(parserClass).getConstructor().newInstance();
since we config the parserClass in config file.
But the ReduceProcessorFactory inits the reducer class using the classname
"org.apache.hadoop.chukwa.extraction.demux.processor.reducer."+ reduceType.
So the reducer class must be put in package
org.apache.hadoop.chukwa.extraction.demux.processor.reducer and named
reduceType.
Why not we put the reducer class name in config file either?
For exam, the {customdatatype_mapper} means mapper class
,{customdatatype_reducer} means reducer class.
This can give chukka more flexibility.
--
Best regards,
Ivy Tang