Hi,

   I run item recommemder base on Netflix, but it always fail for not
enough local disk space. So, I cut the User Id to half(not user account but 
user Id),to reduce the temp data. Now, it finish but 
take 40 hours. The command like follow:

hadoop jar /app/mahout-distribution-0.5/core/target/mahout-core-0.5-job.jar 
org.apache.mahout.cf.taste.hadoop.item.RecommenderJob -Dmapred.map.tasks=196 
-Dmapred.reduce.tasks=196 -Dmapred.input.dir=NetFlix_data_new 
-Dmapred.output.dir=output_netflix8

my hadoop cluster:

28 nodes
16G memory per node
8 core per node
250G local disk per node




Reply via email to