[
https://issues.apache.org/jira/browse/MAHOUT-560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Isabel Drost updated MAHOUT-560:
--------------------------------
Attachment: MAHOUT-560.patch
Changes I made - any comments welcome (especially if there is an easier, more
obvious way I have over-looked)
> Support for more flexible file handling in text to sequence file conversion
> ---------------------------------------------------------------------------
>
> Key: MAHOUT-560
> URL: https://issues.apache.org/jira/browse/MAHOUT-560
> Project: Mahout
> Issue Type: Improvement
> Components: Utils
> Affects Versions: 0.5
> Reporter: Isabel Drost
> Assignee: Isabel Drost
> Priority: Trivial
> Fix For: 0.5
>
> Attachments: MAHOUT-560.patch
>
>
> Currently SequenceFilesFromDirectory supports for conversion of texts to
> sequence file. The exact file (and potentially text from file) selection is
> not configurable. I'd like to re-use most of the conversion logic but change
> the exact text selection. (More information on what exactly I want to do:
> http://tinyurl.com/35pv8jg )
> I slightly changed SequenceFilesFromDirectory to make that possible. (Added
> one additional optional parameter, but by default the current behaviour is
> used).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.