Of course.  This is now pretty easy if the dictionary is separated.  The
labeled matrix only cares about object identity of dictionary, not about
what kinds of labels are there (except for get and put).

For that matter, it also doesn't care if the LabelDictionary is strictly 1:1
or probabilistically 1:~1.  As you say, allowing collisions judiciously is
very, very powerful.

On Sat, Oct 3, 2009 at 11:30 PM, Jake Mannix <[email protected]> wrote:

>
> interface LabelDictionary<T> {
>  T getLabel(int index);
>  int getIndex(T label);
> }
>
> allows for LabelDictionary<String> as choice, but allows for flexibility
> (such
> as having if they're strings or token ngrams keep track of their IDF or
> number
> of tokens or underlying type [i.e. you're doing regression on some model
> with
> a lot of numeric parameters, but pre-normalization they all carried
> different
> units]).
>



-- 
Ted Dunning, CTO
DeepDyve

Reply via email to