[
https://issues.apache.org/jira/browse/MAHOUT-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mike Percy updated MAHOUT-1336:
-------------------------------
Attachment: MAHOUT-1336.patch
This patch adds an additional assert check for output path existence and
removes the problematic --mapred argument to SparseVectorsFromSequenceFiles
from the unit test. Test now correctly passes.
> HighDFWordsPrunerTest is failing silently
> -----------------------------------------
>
> Key: MAHOUT-1336
> URL: https://issues.apache.org/jira/browse/MAHOUT-1336
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.8
> Reporter: Mike Percy
> Fix For: 0.9
>
> Attachments: MAHOUT-1336.patch
>
>
> Apparently ToolRunner does not allow the --mapred option. The validation is
> not very foolproof, so there is a resulting silent failure in
> HighDFWordsPrunerTest.
> Error message:
> {noformat}
> org.apache.commons.cli2.OptionException: Unexpected --mapred while processing
> Options
> at org.apache.commons.cli2.commandline.Parser.parse(Parser.java:99)
> at
> org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.run(SparseVectorsFromSequenceFiles.java:154)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
> at
> org.apache.mahout.vectorizer.HighDFWordsPrunerTest.runTest(HighDFWordsPrunerTest.java:111)
> at
> org.apache.mahout.vectorizer.HighDFWordsPrunerTest.testHighDFWordsPruning(HighDFWordsPrunerTest.java:85)
> ...
> Usage:
>
> [--minSupport <minSupport> --analyzerName <analyzerName> --chunkSize
>
> <chunkSize> --output <output> --input <input> --minDF <minDF> --maxDFSigma
>
> <maxDFSigma> --maxDFPercent <maxDFPercent> --weight <weight> --norm <norm>
>
> --minLLR <minLLR> --numReducers <numReducers> --maxNGramSize <ngramSize>
>
> --overwrite --help --sequentialAccessVector --namedVector --logNormalize]
>
> O
> {noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira