Re: DocumentCategorizerME data

Dan Russ Thu, 20 Jul 2017 07:45:54 -0700

Hi Neeraj,
   I believe that the Document Categorizer picks the best label for document.  
It sounds like you want multiple classifers (if you have each sentence a 
document and you want to classify the sentence) OR use the NameFinderME to 
build a sequence classifier that can identify 1)sarasm, 2)surprise …
Daniel


> On Jul 20, 2017, at 8:53 AM, Kulkarni, Neeraj <neera...@gmail.com> wrote:
> 
> Hello all,
> 
> I wondered if anyone had insights on classifying data through multiple data
> sets or through a single data set for document categorizer ?
> 
> I am building a sentence category classifier (e.g. surprise, disgust,
> sarcasm, unknown) and wondered if one DocumentCategorizerME instance with
> training data with all four types is more effective (with confidence
> threshold of say 0.25 to decide on a category) or if I should seek to
> categorize sentences through three DocumentCategorizerME instances (one
> each for surprise, disgust, sarcasm, with confidence threshold of say 0.6
> to decide on a category, unknown otherwise).
> 
> I am a newbie to this mailing list, and apologize if this question is
> irrelevant. Please help me by pointing to the right direction.
> 
> Best,
> Neeraj

Re: DocumentCategorizerME data

Reply via email to