Re: Tags generation?

Dawid Weiss Fri, 03 Aug 2012 12:08:01 -0700

> Unstemming is pretty simple.  Just build an unstemming dictionary based on
> seeing what word forms have lead to a stemmed form.  Include frequencies.


This can lead to very funny (or not, depends how you look at it)
mistakes when different lemmas stem to the same token. How frequent
and important this phenomenon is varies from language to language (and
can be calculated apriori).

Dawid

Re: Tags generation?

Reply via email to