hello,

 I currently push apache logs into Chukwa.  I am trying to figure out how to
get all those logs into Cassandra and run mapreduce there.  Is the best
place to do this in Demux (right my own version of TSProcessor?)

 Also the data flow seems to miss a step.  The page
http://incubator.apache.org/chukwa/docs/r0.4.0/dataflow.html says in 3.3
that

   - demux moves complete files to: dataSinkArchives/[yyyyMMdd]/*/*.done
 - the next step is to move files from
postProcess/demuxOutputDir_*/[clusterName]/[dataType]/[dataType]_[yyyyMMdd]_[HH].R.evt

  How do they get from dataSinkArchives to postProcess?  does this run
inside of DemuxManager or a separate process (bin/chukwa demux) ?

 Thanks
 AD

Reply via email to