Of course. This is now pretty easy if the dictionary is separated. The labeled matrix only cares about object identity of dictionary, not about what kinds of labels are there (except for get and put).
For that matter, it also doesn't care if the LabelDictionary is strictly 1:1 or probabilistically 1:~1. As you say, allowing collisions judiciously is very, very powerful. On Sat, Oct 3, 2009 at 11:30 PM, Jake Mannix <[email protected]> wrote: > > interface LabelDictionary<T> { > T getLabel(int index); > int getIndex(T label); > } > > allows for LabelDictionary<String> as choice, but allows for flexibility > (such > as having if they're strings or token ngrams keep track of their IDF or > number > of tokens or underlying type [i.e. you're doing regression on some model > with > a lot of numeric parameters, but pre-normalization they all carried > different > units]). > -- Ted Dunning, CTO DeepDyve
