[ 
https://issues.apache.org/jira/browse/MAHOUT-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13729302#comment-13729302
 ] 

Stevo Slavic commented on MAHOUT-1302:
--------------------------------------

Looking at standard error output at
https://builds.apache.org/job/Mahout-Quality/2184/testReport/org.apache.mahout.text/SequenceFilesFromMailArchivesTest/testSequential/
I'm no longer sure my initial analysis is entirely correct. There is something 
environment specific going on.
But, according to log for a failed execution like one above, processing order 
is same as for successful test run e.g. on my dev env:
{noformat}
Aug 05, 2013 12:48:21 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: Command line arguments: {--body=null, --bodySeparator=[
], --charset=[UTF-8], --chunkSize=[64], --endPhase=[2147483647], 
--input=[/home/sslavic/svn/mahout/mahout/integration/target/test-data/mahout-SequenceFilesFromMailArchivesTest-8799937166382529536/mail-archives-in],
 --keyPrefix=[TEST], --method=[sequential], 
--output=[/home/sslavic/svn/mahout/mahout/integration/target/test-data/mahout-SequenceFilesFromMailArchivesTest-8799937166382529536/mail-archives-out],
 --separator=[], --startPhase=[0], --subject=null, --tempDir=[temp]}
Aug 05, 2013 12:48:22 AM org.apache.hadoop.util.NativeCodeLoader <clinit>
WARNING: Unable to load native-hadoop library for your platform... using 
builtin-java classes where applicable
Aug 05, 2013 12:48:22 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: At 
/home/sslavic/svn/mahout/mahout/integration/target/test-data/mahout-SequenceFilesFromMailArchivesTest-8799937166382529536/mail-archives-in/subdir
Aug 05, 2013 12:48:22 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: At 
/home/sslavic/svn/mahout/mahout/integration/target/test-data/mahout-SequenceFilesFromMailArchivesTest-8799937166382529536/mail-archives-in/subdir/subsubdir
Aug 05, 2013 12:48:22 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: Parsed 2 messages from directory 
/home/sslavic/svn/mahout/mahout/integration/target/test-data/mahout-SequenceFilesFromMailArchivesTest-8799937166382529536/mail-archives-in/subdir/subsubdir
Aug 05, 2013 12:48:22 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: Parsed 4 messages from directory 
/home/sslavic/svn/mahout/mahout/integration/target/test-data/mahout-SequenceFilesFromMailArchivesTest-8799937166382529536/mail-archives-in/subdir
Aug 05, 2013 12:48:22 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: Parsed 4 messages from 
/home/sslavic/svn/mahout/mahout/integration/target/test-data/mahout-SequenceFilesFromMailArchivesTest-8799937166382529536/mail-archives-in
Aug 05, 2013 12:48:22 AM org.slf4j.impl.JCLLoggerAdapter info
INFO: Conversion took 465ms
{noformat}

Still seems like environment specific issue. Guessing maybe hdfs config is 
different, or local FS, or JVM vendor/version.
                
> SequenceFilesFromMailArchivesTest.testSequential failing
> --------------------------------------------------------
>
>                 Key: MAHOUT-1302
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1302
>             Project: Mahout
>          Issue Type: Bug
>          Components: Integration
>    Affects Versions: 0.8
>         Environment: ubuntu-3 and ubuntu-6 Apache Jenkins nodes
>            Reporter: Stevo Slavic
>            Assignee: Suneel Marthi
>            Priority: Minor
>              Labels: test
>             Fix For: 0.9
>
>
> SequenceFilesFromMailArchivesTest.testSequential is failing only on ubuntu3 
> and ubuntu6 Jenkins nodes. Because of that, MahoutQuality and integration job 
> builds either fail or are successful depending on where they get run.
> Test fails because it expects entries in chunk-0 SequenceFile to be in 
> specific order, but that order is not guaranteed because of the way the 
> chunk-0 is created/filled - SequenceFilesFromMailArchives traverses input 
> using Java's
> File[] java.io.File.listFiles(FileFilter filter)
> which does not guarantee order of files/directories.
> Unless we want in SequenceFileIterator to guarantee order by sorting, test 
> needs to be changed to verify presence of given files and their content, but 
> not their exact order.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to