I like this idea very much. More like adding metadata over sparse vectors To the ideo make it more verbose Vectors currently have a name. Which is the id of the original document/data point the vector points to ? It could also have fields or labels in which the vector belong to.
Question is, is there any other metadata we want to associate with a Vector ? Robin On Tue, Jan 19, 2010 at 6:22 PM, Olivier Grisel <olivier.gri...@ensta.org>wrote: > Hi all, > > Is there any existing LabeledVectorWritable implementation to combine > a possibly empty list of label indexes and a vector of features to > process and store labeled datasets for supervised learning algorithms? > > Maybe it would be worth having a two versions of such as Writable, one > for singly classified documents (one and only one label index perf > instance) and another for multi categorized documents (0 or more > category indexes per instance). > > -- > Olivier > http://twitter.com/ogrisel - http://code.oliviergrisel.name >