> Unstemming is pretty simple. Just build an unstemming dictionary based on > seeing what word forms have lead to a stemmed form. Include frequencies.
This can lead to very funny (or not, depends how you look at it) mistakes when different lemmas stem to the same token. How frequent and important this phenomenon is varies from language to language (and can be calculated apriori). Dawid
