My guess is you've set "fs.default.name" in your Hadoop XML config files now? that controls how Path objects decide how to construe paths. I think it defaults to "file://", but setting up for pseudo-distributed mode, you're asked to set it to hdfs://host:9000"
On Fri, Feb 11, 2011 at 5:14 AM, Lance Norskog <[email protected]> wrote: > This is new, within the last week. When I changed ~/Documents/* to > file:///Users/lancenorskog/Documents/* this started working. > Somehow, file paths without url protocol handlers don't default to > file:// anymore. > > Lance-Norskogs-MacBook-Pro:mahout lancenorskog$ bin/mahout > trainclassifier -o > ~/Documents/open/datasets/20news-bydate/bayes-model-bgrams -i > ~/Documents/open/datasets/20news-bydate/bayes-train-input -type cbayes > -ng 2 > Running on hadoop, using > HADOOP_HOME=/Users/lancenorskog/Documents/open/hadoop-0.20.2 > No HADOOP_CONF_DIR set, using > /Users/lancenorskog/Documents/open/hadoop-0.20.2/conf > 11/02/10 21:07:18 INFO bayes.TrainClassifier: Training Complementary > Bayes Classifier > 11/02/10 21:07:19 INFO cbayes.CBayesDriver: Reading features... > 11/02/10 21:07:19 WARN mapred.JobClient: Use GenericOptionsParser for > parsing the arguments. Applications should implement Tool for the > same. > Exception in thread "main" > org.apache.hadoop.mapred.InvalidInputException: Input path does not > exist: > hdfs://localhost:9000/Users/lancenorskog/Documents/open/datasets/20news-bydate/bayes-train-input > at > org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:190) > at > org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:201) > at > org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810) > at > org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781) > at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730) > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249) > > -- > Lance Norskog > [email protected] >
