Yeah, I tried out HDFS and set up the magic conf files for it. I see how having HADOOP_HOME sent Mahout off to hdfs-land. Thanks!
On Fri, Feb 11, 2011 at 12:54 AM, Sean Owen <[email protected]> wrote: > My guess is you've set "fs.default.name" in your Hadoop XML config > files now? that controls how Path objects decide how to construe > paths. I think it defaults to "file://", but setting up for > pseudo-distributed mode, you're asked to set it to hdfs://host:9000" > > On Fri, Feb 11, 2011 at 5:14 AM, Lance Norskog <[email protected]> wrote: >> This is new, within the last week. When I changed ~/Documents/* to >> file:///Users/lancenorskog/Documents/* this started working. >> Somehow, file paths without url protocol handlers don't default to >> file:// anymore. >> >> Lance-Norskogs-MacBook-Pro:mahout lancenorskog$ bin/mahout >> trainclassifier -o >> ~/Documents/open/datasets/20news-bydate/bayes-model-bgrams -i >> ~/Documents/open/datasets/20news-bydate/bayes-train-input -type cbayes >> -ng 2 >> Running on hadoop, using >> HADOOP_HOME=/Users/lancenorskog/Documents/open/hadoop-0.20.2 >> No HADOOP_CONF_DIR set, using >> /Users/lancenorskog/Documents/open/hadoop-0.20.2/conf >> 11/02/10 21:07:18 INFO bayes.TrainClassifier: Training Complementary >> Bayes Classifier >> 11/02/10 21:07:19 INFO cbayes.CBayesDriver: Reading features... >> 11/02/10 21:07:19 WARN mapred.JobClient: Use GenericOptionsParser for >> parsing the arguments. Applications should implement Tool for the >> same. >> Exception in thread "main" >> org.apache.hadoop.mapred.InvalidInputException: Input path does not >> exist: >> hdfs://localhost:9000/Users/lancenorskog/Documents/open/datasets/20news-bydate/bayes-train-input >> at >> org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:190) >> at >> org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:201) >> at >> org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810) >> at >> org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781) >> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730) >> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249) >> >> -- >> Lance Norskog >> [email protected] >> > -- Lance Norskog [email protected]
