look at MAHOUT-833 , this patch gives you this functionality.
On Sat, Feb 16, 2013 at 10:55 AM, Claudio Reggiani <[email protected]> wrote: > Hello, > > I have a text dataset. Running "seqdirectory" command on it I see it's not > written in MapReduce style (looking at the source code of > SequenceFilesFromDirectory confirms that). > > What if I have a big dataset stored in HDFS and I would like to convert it > in SequenceFile format? Do I need to create my own custom job or > seqdirectory does that? > > Thanks > Claudio Reggiani >
