[ 
https://issues.apache.org/jira/browse/MAHOUT-916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13640575#comment-13640575
 ] 

Ted Dunning commented on MAHOUT-916:
------------------------------------

Or not.

I am running on a 16 core machine and 1.5C parallelism causes 10% of tests to 
fail.

Even 4 way parallelism causes errors like this:

{code}
Running org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFilesTest
Tests run: 4, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 3.806 sec <<< 
FAILURE!
testCreateNamed(org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFilesTest)
  Time elapsed: 0.265 sec  <<< ERROR!
java.lang.RuntimeException: org.xml.sax.SAXParseException; lineNumber: 34; 
columnNumber: 105; XML document structures must start and end within the same 
entity.
        at 
com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:253)
        at 
com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:288)
        at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:121)
        at 
org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1161)
        at 
org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1109)
        at 
org.apache.hadoop.conf.Configuration.getProps(Configuration.java:1045)
        at org.apache.hadoop.conf.Configuration.get(Configuration.java:397)
        at 
org.apache.hadoop.mapred.JobConf.checkAndWarnDeprecation(JobConf.java:1910)
        at org.apache.hadoop.mapred.JobConf.<init>(JobConf.java:378)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.<init>(LocalJobRunner.java:150)
        at 
org.apache.hadoop.mapred.LocalJobRunner.submitJob(LocalJobRunner.java:437)
        at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:983)
        at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:912)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:416)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1149)
        at 
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:912)
        at org.apache.hadoop.mapreduce.Job.submit(Job.java:500)
        at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:530)
        at 
org.apache.mahout.vectorizer.SimpleTextEncodingVectorizer.createVectors(SimpleTextEncodingVectorizer.java:63)
        at 
org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFiles.run(EncodedVectorsFromSequenceFiles.java:99)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at 
org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFiles.main(EncodedVectorsFromSequenceFiles.java:37)
        at 
org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFilesTest.runTest(EncodedVectorsFromSequenceFilesTest.java:110)
        at 
org.apache.mahout.vectorizer.EncodedVectorsFromSequenceFilesTest.testCreateNamed(EncodedVectorsFromSequenceFilesTest.java:77)

Running org.apache.mahout.vectorizer.collocations.llr.GramKeyPartitionerTest
{code}
                
> Make Mahout's tests run in parallel
> -----------------------------------
>
>                 Key: MAHOUT-916
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-916
>             Project: Mahout
>          Issue Type: Improvement
>          Components: build
>            Reporter: Grant Ingersoll
>            Assignee: Isabel Drost
>            Priority: Minor
>              Labels: MAHOUT_INTRO_CONTRIBUTE
>         Attachments: MAHOUT-916.patch, MAHOUT-916.patch
>
>
> Maven now supports parallel execution of tests.  We should hook this in to 
> Mahout.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to