with Mahout 0.8, I'd try: mahout vectordump --seqFile *-i* tn/topics/part-m-00000 --dictionary tn/vectors/dictionary.file-0 --dictionaryType sequencefile --output topicdump.txt -sort *true*--vectorSize 1
On Fri, Aug 23, 2013 at 8:36 AM, Charly Lizarralde < [email protected]> wrote: > Thanks! Docs are in spanish, so, maybe I should provide the spanish list > then... > > The vector dump comand is: mahout vectordump --seqFile > tn/topics/part-m-00000 --dictionary tn/vectors/dictionary.file-0 > --dictionaryType sequencefile --output topicdump.txt -sort --vectorSize 10 > > And the output ( topicdump.txt) is: > > > {yo:0.05347025391826375,pero:0.01621850129739,hay:0.01256010577070346,como:0.015645488997146385,apellido:0.0138762425391 > > 95612,quiero:0.02141736852909945,mis:0.011207260571060144,mi:0.10256988732241464,me:0.13765803016629644,zi:0.03454083289 > 2116805} > > As you can see, the topicterms are not sorted. >
