[ 
https://issues.apache.org/jira/browse/MAHOUT-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13752049#comment-13752049
 ] 

Ted Dunning commented on MAHOUT-1302:
-------------------------------------

{quote}
I don't like the design of {{SequenceFilesFromMailArchives}} - it's using 
{{PrefixAdditionFilter}} which is a {{FileFilter}} to traverse the FS tree. 
That felt unnatural before my change, and feels even more unnatural now after 
the change.
{quote}
Stevo,

I really don't think that *anybody* is attached to the current implementation.  
If you see something that is better, I would very much expect that everybody 
would applaud and not complain.
                
> SequenceFilesFromMailArchivesTest.testSequential failing
> --------------------------------------------------------
>
>                 Key: MAHOUT-1302
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1302
>             Project: Mahout
>          Issue Type: Bug
>          Components: Integration
>    Affects Versions: 0.8
>         Environment: ubuntu-3 and ubuntu-6 Apache Jenkins nodes
>            Reporter: Stevo Slavic
>            Assignee: Suneel Marthi
>            Priority: Minor
>              Labels: test
>             Fix For: 0.9
>
>
> SequenceFilesFromMailArchivesTest.testSequential is failing only on ubuntu3 
> and ubuntu6 Jenkins nodes. Because of that, MahoutQuality and integration job 
> builds either fail or are successful depending on where they get run.
> Test fails because it expects entries in chunk-0 SequenceFile to be in 
> specific order, but that order is not guaranteed because of the way the 
> chunk-0 is created/filled - SequenceFilesFromMailArchives traverses input 
> using Java's
> File[] java.io.File.listFiles(FileFilter filter)
> which does not guarantee order of files/directories.
> Unless we want in SequenceFileIterator to guarantee order by sorting, test 
> needs to be changed to verify presence of given files and their content, but 
> not their exact order.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to