Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The following page has been changed by andyk: http://wiki.apache.org/hadoop/Chukwa_Quick_Start ------------------------------------------------------------------------------ The cluster deployment process is still under active development, thus it is possible that the following instructions may not work yet, but they will soon, so please don't delete them. Eventually, even the single machine setup (for newcomers to Chukwa who want to try it out of the box on their) above will be replaced by the below process, renaming the conf/slaves.template and conf/collectors.template files (to remove the .template suffix) for the defaults of localhost for the collector and agent. '''Configure Chukwa''' + (For an explanation of each configuration file in the conf directory, see ["Chukwa Configuration"]) + 1. Specify your JAVA_HOME and HADOOP_HOME in conf/chukwa-env.sh 1. Specify which hosts to run collectors on in the conf/collectors file. 1. Like in Hadoop, you need to specify a set of nodes on which you want to run Chukwa agents (similar to conf/slaves in Hadoop) using a conf/slaves file. The local agents on each machine will also reference the conf/collectors file, selecting a collector at random from this list to talk to. Thus, like Hadoop, it is common to run Chukwa from a shared file system where all of the agents (i.e. slaves) can access the same conf files. - 1. Setup the initial adaptors you want to run on every agent in the chukwa cluster by copying conf/initial_adaptors.template to conf/initial_adaptors and adding whichever adaptors you see fit. See the ["Chukwa_Adaptors_List"] for a catalog of pre-made adaptors. + 1. Setup the initial adaptors you want to run on every agent in the chukwa cluster by copying conf/initial_adaptors.template to conf/initial_adaptors and adding whichever adaptors you see fit. See the ["Chukwa Adaptors List"] for a catalog of pre-made adaptors. '''Run Chukwa''' 1. run bin/start-all.sh to start all agents, collectors, data loaders, and schedule the demux MapReduce job to run every 5 minutes over top of the datasink. Use bin/stop-all.sh to shut everything down.
