Folks, I am currently attempting to run the Synthetic_control data example on Mahout. I have installed Hadoop-1.2.1 and Mahout 0.6 in my Ubuntu.
I prepared the following hadoop runtime configuration file to set all the paths required. the following are the contents of the hadooprc.sh *export JAVA_HOME=/usr/lib/jvm/java-1.7.0-openjdk-i386 export HADOOP_HOME=/home/hduser/hadoop-1.2.1 export MAHOUT_HOME=/home/hduser/mahout-distribution-0.6 export PATH=$MAHOUT_HOME/bin:$JAVA_HOME/bin:$HADOOP_HOME/bin:$PATH export CLASSPATH=$JAVA_HOME:/home/hduser/hadoop-1.2.1/hadoop-core-1.2.1.jar:$MAHOUT_HOME/mahout-core-0.6.jar * And also tried the following as suggested by Saeed Iqbal's blog for runtime configuration file *export JAVA_HOME=/usr/lib/jvm/java-1.7.0-openjdk-i386 export HADOOP_HOME=/home/hduser/hadoop-1.2.1 export HADOOP_CONF_DIR=/home/hduser/hadoop-1.2.1/conf export MAHOUT_HOME=/home/hduser/mahout-distribution-0.6/bin export PATH=$PATH:$MAHOUT_HOME* The above file initializes Mahout and I followed the commands below to write the synthetic control data into HDFS. fRom this link: https://cwiki.apache.org/confluence/display/MAHOUT/Clustering+of+synthetic+control+data $HADOOP_HOME/bin/hadoop fs -mkdir testdata $HADOOP_HOME/bin/hadoop fs -put <PATH TO synthetic_control.data> testdata the mvn clean install option gave me a build failure error but when i typed maven -version i got the latest maven installed. I tried to enter this command from mahout_home/bin org.apache.mahout.clustering.syntheticcontrol.kmeans.Job and got the following error: org.apache.mahout.clustering.syntheticcontrol.kmeans.Job command not found Can anyone tell me where I am going wrong? how to fix this? really appreciate your help Regads Pavan
