Adding instructions for MAHOUT-1794 to the readme.
Project: http://git-wip-us.apache.org/repos/asf/mahout/repo Commit: http://git-wip-us.apache.org/repos/asf/mahout/commit/b25a70a1 Tree: http://git-wip-us.apache.org/repos/asf/mahout/tree/b25a70a1 Diff: http://git-wip-us.apache.org/repos/asf/mahout/diff/b25a70a1 Branch: refs/heads/master Commit: b25a70a1bc6b9f8cb6c89947e0eaba5588463652 Parents: 67f0093 Author: Andrew Musselman <[email protected]> Authored: Sat Mar 19 14:46:48 2016 -0700 Committer: Andrew Musselman <[email protected]> Committed: Sat Mar 19 14:46:48 2016 -0700 ---------------------------------------------------------------------- examples/bin/README.txt | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/mahout/blob/b25a70a1/examples/bin/README.txt ---------------------------------------------------------------------- diff --git a/examples/bin/README.txt b/examples/bin/README.txt index 503a687..7ad3a38 100644 --- a/examples/bin/README.txt +++ b/examples/bin/README.txt @@ -1,5 +1,8 @@ This directory contains helpful shell scripts for working with some of Mahout's examples. +To set a non-default temporary work directory: `export MAHOUT_WORK_DIR=/path/in/hdfs/to/temp/dir` + Note that this requires the same path to be writable both on the local file system as well as on HDFS. + Here's a description of what each does: classify-20newsgroups.sh -- Run SGD and Bayes classifiers over the classic 20 News Groups. Downloads the data set automatically. @@ -7,4 +10,4 @@ cluster-reuters.sh -- Cluster the Reuters data set using a variety of algorithms cluster-syntheticcontrol.sh -- Cluster the Synthetic Control data set. Downloads the data set automatically. factorize-movielens-1m.sh -- Run the Alternating Least Squares Recommender on the Grouplens data set (size 1M). factorize-netflix.sh -- (Deprecated due to lack of availability of the data set) Run the ALS Recommender on the Netflix data set. -spark-document-classifier.mscala -- A mahout-shell script which trains and tests a Naive Bayes model on the Wikipedia XML dump and defines simple methods to classify new text. \ No newline at end of file +spark-document-classifier.mscala -- A mahout-shell script which trains and tests a Naive Bayes model on the Wikipedia XML dump and defines simple methods to classify new text.
