Support for more flexible file handling in text to sequence file conversion
---------------------------------------------------------------------------
Key: MAHOUT-560
URL: https://issues.apache.org/jira/browse/MAHOUT-560
Project: Mahout
Issue Type: Improvement
Components: Utils
Affects Versions: 0.5
Reporter: Isabel Drost
Assignee: Isabel Drost
Priority: Trivial
Fix For: 0.5
Currently SequenceFilesFromDirectory supports for conversion of texts to
sequence file. The exact file (and potentially text from file) selection is not
configurable. I'd like to re-use most of the conversion logic but change the
exact text selection. (More information on what exactly I want to do:
http://tinyurl.com/35pv8jg )
I slightly changed SequenceFilesFromDirectory to make that possible. (Added one
additional optional parameter, but by default the current behaviour is used).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.