[
https://issues.apache.org/jira/browse/MAHOUT-916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13640575#comment-13640575
]
Ted Dunning commented on MAHOUT-916:
------------------------------------
Or not.
I am running on a 16 core machine and 1.5C parallelism causes 10% of tests to
fail.
Even 4 way parallelism causes errors like this:
{code}
Running org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFilesTest
Tests run: 4, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 3.806 sec <<<
FAILURE!
testCreateNamed(org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFilesTest)
Time elapsed: 0.265 sec <<< ERROR!
java.lang.RuntimeException: org.xml.sax.SAXParseException; lineNumber: 34;
columnNumber: 105; XML document structures must start and end within the same
entity.
at
com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:253)
at
com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:288)
at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:121)
at
org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1161)
at
org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1109)
at
org.apache.hadoop.conf.Configuration.getProps(Configuration.java:1045)
at org.apache.hadoop.conf.Configuration.get(Configuration.java:397)
at
org.apache.hadoop.mapred.JobConf.checkAndWarnDeprecation(JobConf.java:1910)
at org.apache.hadoop.mapred.JobConf.<init>(JobConf.java:378)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.<init>(LocalJobRunner.java:150)
at
org.apache.hadoop.mapred.LocalJobRunner.submitJob(LocalJobRunner.java:437)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:983)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:912)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:416)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1149)
at
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:912)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:500)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:530)
at
org.apache.mahout.vectorizer.SimpleTextEncodingVectorizer.createVectors(SimpleTextEncodingVectorizer.java:63)
at
org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFiles.run(EncodedVectorsFromSequenceFiles.java:99)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at
org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFiles.main(EncodedVectorsFromSequenceFiles.java:37)
at
org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFilesTest.runTest(EncodedVectorsFromSequenceFilesTest.java:110)
at
org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFilesTest.testCreateNamed(EncodedVectorsFromSequenceFilesTest.java:77)
Running org.apache.mahout.vectorizer.collocations.llr.GramKeyPartitionerTest
{code}
> Make Mahout's tests run in parallel
> -----------------------------------
>
> Key: MAHOUT-916
> URL: https://issues.apache.org/jira/browse/MAHOUT-916
> Project: Mahout
> Issue Type: Improvement
> Components: build
> Reporter: Grant Ingersoll
> Assignee: Isabel Drost
> Priority: Minor
> Labels: MAHOUT_INTRO_CONTRIBUTE
> Attachments: MAHOUT-916.patch, MAHOUT-916.patch
>
>
> Maven now supports parallel execution of tests. We should hook this in to
> Mahout.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira