> Unstemming is pretty simple.  Just build an unstemming dictionary based on
> seeing what word forms have lead to a stemmed form.  Include frequencies.

This can lead to very funny (or not, depends how you look at it)
mistakes when different lemmas stem to the same token. How frequent
and important this phenomenon is varies from language to language (and
can be calculated apriori).

Dawid

Reply via email to