Sorry, I mean classifier, my bad... Unfortunatly I can't release the raw data. And I don't really know how to blind the data correctly. I could give the categories numbers and hash the text (title and abstract) if that would be enough...
Thanks and regards, David 2011/11/8 Ted Dunning <[email protected]> > Recommender? Recommenders are not normally used for adding categories to > documents. > > Is it possible for you to release blinded data in which all terms and > categories are replaced by numbers and permuted? Or even just stemmed and > sorted as with the RCV1 corpus? > > Having such a test corpus would make it much easier to get folks to help. > > On Tue, Nov 8, 2011 at 7:56 AM, David Rahman <[email protected] > >wrote: > > > I have a lots of data from where I work. The data are documents (title + > > abstract) and each document can have one or more categories (e.g. social > > sciences + policics). We want to build a recommender and analyze the > output > > for further testing. > > > > Thanks and regards, > > David > > > > 2011/11/8 Ted Dunning <[email protected]> > > > > > The practical techniques for such problems are pretty diverse. > > > > > > One method is to simply define multiple binary classifiers. If you can > > > stratify your labels, then you can have some labels depend on others. > > > Another option is to find commonly occurring sets of labels and build > > > classifiers for those sets directly. > > > > > > Do you have an application mind? Do you have data available? > > > > > > On Tue, Nov 8, 2011 at 6:37 AM, David Rahman < > [email protected] > > > >wrote: > > > > > > > Yes, I was asking for an example where multiple labels might be > aplied > > > to a > > > > single example. > > > > > > > > Thanks and regards, > > > > David > > > > > > > > 2011/11/8 Ted Dunning <[email protected]> > > > > > > > > > What exactly do you mean by multi-label classification? > > > > > > > > > > The 20 newsgroup example has many possible label values. > > > > > > > > > > Are you asking for an example where multiple labels might be > applied > > > to a > > > > > single example? If so, no, we don't have a nice example of that. > > > > > > > > > > On Tue, Nov 8, 2011 at 5:36 AM, David Rahman < > > > [email protected] > > > > > >wrote: > > > > > > > > > > > Hi, > > > > > > > > > > > > I have a general question about multi-label classification. > Binary- > > > or > > > > > > single-label classification is working, as shown in several > > examples > > > > > > (Wikipedia and 20Newsgroup, Mahout In Action book...). > > > > > > > > > > > > Are there some working examples on multi-label calssification for > > > > trying > > > > > > out? > > > > > > Or is there some data available on how mahout performs on > > mult-label > > > > > > classification problems? > > > > > > > > > > > > Thanks and regards, > > > > > > David > > > > > > > > > > > > > > > > > > > > >
