[
https://issues.apache.org/jira/browse/MAHOUT-1319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13853776#comment-13853776
]
Suneel Marthi commented on MAHOUT-1319:
---------------------------------------
Uploading a new patch that takes a filter class that implements PathFilter.
Unlike the sequential version the MR version already handles the keyprefix and
chunk sizes without the need of a filter class (like PrefixAdditionFilter).
With this patch it should be possible to pass in a CustomFilter that implements
PathFilter to the MR version of seqdirectory.
> seqdirectory -filter argument silently ignored when run as MR
> -------------------------------------------------------------
>
> Key: MAHOUT-1319
> URL: https://issues.apache.org/jira/browse/MAHOUT-1319
> Project: Mahout
> Issue Type: Bug
> Components: Integration
> Affects Versions: 0.8
> Reporter: Liz Merkhofer
> Assignee: Suneel Marthi
> Labels: seqdirectory, text
> Fix For: 0.9
>
> Attachments: MAHOUT-1319-custom-filter.patch, MAHOUT-1319.patch
>
>
> Running "seqdirectory" (Sequence Files from Input Directory) from the command
> line and specifying a custom filter using the -filter parameter, the argument
> is ignored and the default "PrefixAdditionFilter" is used on the input. No
> exception is thrown.
> When the same command is run with "-xm sequential", the filter is found and
> works as expected.
--
This message was sent by Atlassian JIRA
(v6.1.4#6159)