Dear Kiran, It seems that you are not using the correct version of Mahout. Please use version 0.6 as indicated. The easiest way to ensure that you use the correct version of the prerequisite packages is to download the whole benchmark from the CloudSuite website. I believe this will solve your problem.
Regards, Djordje ________________________________________ From: kiran yadav [[email protected]] Sent: Sunday, January 20, 2013 12:27 PM To: [email protected] Subject: [cloudsuite] Error running Data analytics benchmark Hi I am running Data Analytics Benchmark, getting the following error RAM size is 3 GB , i m running on single-node hadoop root@kiran-Inspiron-1525:/usr/local/hadoop# mahout wikipediaDataSetCreator -i wikipedia-training/chunks -o traininginput -c /usr/local/mahout-distribution-0.7/examples/temp/categories.txt Running on hadoop, using HADOOP_HOME=/usr/local/hadoop HADOOP_CONF_DIR=/usr/local/hadoop/conf MAHOUT-JOB: /usr/lib/mahout/mahout-examples-0.5-cdh3u5-job.jar Warning: $HADOOP_HOME is deprecated. 13/01/20 16:45:06 WARN driver.MahoutDriver: No wikipediaDataSetCreator.props found on classpath, will use command-line arguments only 13/01/20 16:45:06 INFO bayes.WikipediaDatasetCreatorDriver: Input: wikipedia-training/chunks Out: traininginput Categories: /usr/local/mahout-distribution-0.7/examples/temp/categories.txt 13/01/20 16:45:07 INFO common.HadoopUtil: Deleting traininginput 13/01/20 16:45:07 WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Applications should implement Tool for the same. 13/01/20 16:45:09 INFO input.FileInputFormat: Total input paths to process : 85 13/01/20 16:45:09 INFO util.NativeCodeLoader: Loaded the native-hadoop library 13/01/20 16:45:09 WARN snappy.LoadSnappy: Snappy native library not loaded 13/01/20 16:45:11 INFO mapred.JobClient: Running job: job_201301201643_0001 13/01/20 16:45:12 INFO mapred.JobClient: map 0% reduce 0% 13/01/20 16:45:34 INFO mapred.JobClient: Task Id : attempt_201301201643_0001_m_000000_0, Status : FAILED java.lang.ArrayStoreException: [C at java.util.AbstractCollection.toArray(AbstractCollection.java:171) at org.apache.mahout.analysis.WikipediaAnalyzer.<init>(WikipediaAnalyzer.java:38) at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) at java.lang.reflect.Constructor.newInstance(Constructor.java:513) at java.lang.Class.newInstance0(Class.java:355) at java.lang.Class.newInstance(Class.java:308) at org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorMapper.setup(WikipediaDatasetCreatorMapper.java:107) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) at org.apache.hadoop.mapred.Child$4.run(Child.java:255) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) at org.apache.hadoop.mapred.Child.main(Child.java:249) 13/01/20 16:45:34 INFO mapred.JobClient: Task Id : attempt_201301201643_0001_m_000001_0, Status : FAILED java.lang.ArrayStoreException: [C at java.util.AbstractCollection.toArray(AbstractCollection.java:171) at org.apache.mahout.analysis.WikipediaAnalyzer.<init>(WikipediaAnalyzer.java:38) at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) at java.lang.reflect.Constructor.newInstance(Constructor.java:513) at java.lang.Class.newInstance0(Class.java:355) at java.lang.Class.newInstance(Class.java:308) at org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorMapper.setup(WikipediaDatasetCreatorMapper.java:107) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) at org.apache.hadoop.mapred.Child$4.run(Child.java:255) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) at org.apache.hadoop.mapred.Child.main(Child.java:249) 13/01/20 16:45:43 INFO mapred.JobClient: Task Id : attempt_201301201643_0001_m_000000_1, Status : FAILED java.lang.ArrayStoreException: [C at java.util.AbstractCollection.toArray(AbstractCollection.java:171) at org.apache.mahout.analysis.WikipediaAnalyzer.<init>(WikipediaAnalyzer.java:38) at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) at java.lang.reflect.Constructor.newInstance(Constructor.java:513) at java.lang.Class.newInstance0(Class.java:355) at java.lang.Class.newInstance(Class.java:308) at org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorMapper.setup(WikipediaDatasetCreatorMapper.java:107) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) at org.apache.hadoop.mapred.Child$4.run(Child.java:255) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) at org.apache.hadoop.mapred.Child.main(Child.java:249) 13/01/20 16:45:43 INFO mapred.JobClient: Task Id : attempt_201301201643_0001_m_000001_1, Status : FAILED java.lang.ArrayStoreException: [C at java.util.AbstractCollection.toArray(AbstractCollection.java:171) at org.apache.mahout.analysis.WikipediaAnalyzer.<init>(WikipediaAnalyzer.java:38) at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) at java.lang.reflect.Constructor.newInstance(Constructor.java:513) at java.lang.Class.newInstance0(Class.java:355) at java.lang.Class.newInstance(Class.java:308) at org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorMapper.setup(WikipediaDatasetCreatorMapper.java:107) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) at org.apache.hadoop.mapred.Child$4.run(Child.java:255) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) at org.apache.hadoop.mapred.Child.main(Child.java:249) 13/01/20 16:45:52 INFO mapred.JobClient: Task Id : attempt_201301201643_0001_m_000000_2, Status : FAILED java.lang.ArrayStoreException: [C at java.util.AbstractCollection.toArray(AbstractCollection.java:171) at org.apache.mahout.analysis.WikipediaAnalyzer.<init>(WikipediaAnalyzer.java:38) at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) at java.lang.reflect.Constructor.newInstance(Constructor.java:513) at java.lang.Class.newInstance0(Class.java:355) at java.lang.Class.newInstance(Class.java:308) at org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorMapper.setup(WikipediaDatasetCreatorMapper.java:107) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) at org.apache.hadoop.mapred.Child$4.run(Child.java:255) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) at org.apache.hadoop.mapred.Child.main(Child.java:249) 13/01/20 16:45:52 INFO mapred.JobClient: Task Id : attempt_201301201643_0001_m_000001_2, Status : FAILED java.lang.ArrayStoreException: [C at java.util.AbstractCollection.toArray(AbstractCollection.java:171) at org.apache.mahout.analysis.WikipediaAnalyzer.<init>(WikipediaAnalyzer.java:38) at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) at java.lang.reflect.Constructor.newInstance(Constructor.java:513) at java.lang.Class.newInstance0(Class.java:355) at java.lang.Class.newInstance(Class.java:308) at org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorMapper.setup(WikipediaDatasetCreatorMapper.java:107) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) at org.apache.hadoop.mapred.Child$4.run(Child.java:255) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) at org.apache.hadoop.mapred.Child.main(Child.java:249) 13/01/20 16:46:06 INFO mapred.JobClient: Job complete: job_201301201643_0001 13/01/20 16:46:06 INFO mapred.JobClient: Counters: 7 13/01/20 16:46:06 INFO mapred.JobClient: Job Counters 13/01/20 16:46:06 INFO mapred.JobClient: SLOTS_MILLIS_MAPS=67588 13/01/20 16:46:06 INFO mapred.JobClient: Total time spent by all reduces waiting after reserving slots (ms)=0 13/01/20 16:46:06 INFO mapred.JobClient: Total time spent by all maps waiting after reserving slots (ms)=0 13/01/20 16:46:06 INFO mapred.JobClient: Launched map tasks=8 13/01/20 16:46:06 INFO mapred.JobClient: Data-local map tasks=8 13/01/20 16:46:06 INFO mapred.JobClient: SLOTS_MILLIS_REDUCES=0 13/01/20 16:46:06 INFO mapred.JobClient: Failed map tasks=1 13/01/20 16:46:06 INFO driver.MahoutDriver: Program took 60579 ms Please help for this error -- Regards kiran
